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Philosophies and Goals of the Text: 
A Message to the Student 

A book is a very special link between author and reader. In a mystery novel, the 
author presents the reader with a maze of uncertainty, perplexity, and general 
quicksand. Intellectual smokescreens are set up all the way to the "whodunit" 
ending. Unfortunately, many business statistics texts seem to be written the same 
way — except for the "whodunit" section. This text is specifically designed to be 
different. Its goals are: (1) to be a clear and friendly guide as you learn about busi- 
ness statistics, (2) to avoid quicksand that could inhibit either your interest or 
your learning, and (3) to earn and retain your trust in our ability to accomplish 
goals 1 and 2. 

Business statistics is not only relevant to your present academic program, it is 
also relevant to your future personal and professional life. As a citizen, you will 
be exposed to, and perhaps may even help generate, statistical descriptions and 
analyses of data that are of vital importance to your local, state, national, and 
world communities. As a business professional, you will constantly be dealing with 
statistical measures of performance and success, as well as with employers who 
will expect you to be able to utilize the latest statistical techniques and computer 
software tools — including spreadsheet programs like Excel and statistical soft- 
ware packages like Minitab — in working with these measures. 

The chapters that follow are designed to be both informal and informative, as 
befits an introductory text in business statistics. You will not be expected to have 
had mathematical training beyond simple algebra, and mathematical symbols and 
notations will be explained as they become relevant to our discussion. Following an 
introductory explanation of the purpose and the steps involved in each technique, 
you will be provided with several down-to-earth examples of its use. Each section 
has a set of exercises based on the section contents. At the end of each chapter you'll 
find a summary of what you've read and a listing of equations that have been intro- 
duced, as well as chapter exercises, an interesting minicase or two, and in most of 
the chapters — a realistic business case to help you practice your skills. 

Features New to the Sixth Edition 
Data Analysis Plus™ 5.0 

The Sixth Edition makes extensive use of Data Analysis Plus™ 5.0, an updated 
version of the outstanding add-in that enables Microsoft Excel to carry out prac- 
tically all of the statistical tests and procedures covered in the text. This excellent 
software is easy to use, and is on the CD that accompanies each textbook. 



xvi 



Preface 



Test Statistics.xls and Estimators.xls 

Test Statistics.xls and Estimators.xls accompany and are an important comple- 
ment to Data Analysis Plus™ 5.0. These workbooks enable Excel users to 
quickly perform statistical tests and interval-estimation procedures by simply en- 
tering the relevant summary statistics. These workbooks are terrific for solving 
exercises, checking solutions, or even playing "what-if" by trying different inputs 
to see how they would affect the results. These workbooks, along with Beta- 
mean. xls and three companion workbooks to determine the power of a hypothe- 
sis test, accompany Data Analysis Plus™ 5.0 and are on the text CD. 

Updated Set of 82 Computer Solutions Featuring 
Complete Printouts and Step-By-Step Instructions 
for Obtaining Them 

Featuring the very latest versions of both Excel and Minitab — Excel 2003 and 
Minitab 15, respectively — these pieces are located in most of the major sections of 
the book. Besides providing relevant computer printouts for most of the text ex- 
amples, they are accompanied by friendly step-by-step instructions. 

Updated Exercises and Content 

The Sixth Edition includes a total of nearly 1600 section and chapter exercises, 
and approximately 150 of them are new or updated. Altogether, there are about 
1800 chapter, case, and applet exercises, with about 450 data sets on the text CD 
for greater convenience in using the computer. The datasets are in Excel, Minitab, 
and other popular formats. Besides numerous new or updated chapter examples, 
vignettes, and Statistics in Action items, coverage of the hypergeometric distribu- 
tion (Chapter 6) and the Spearman Coefficient of Rank Correlation (Chapter 14) 
have also been added, as has a CD Appendix to Chapter 19, Decision Making. 

Continuing Features of Introduction 
to Business Statistics 

Chapter-Opening Vignettes and 
Statistics In Action Items 

Each chapter begins with a vignette that's both interesting and relevant to the ma- 
terial ahead. Within each chapter, there are also Statistics In Action items that 
provide further insights into the issues and applications of business statistics in 
the real world. They include a wide range of topics, including using the consumer 
price index to time-travel to the (were they really lower?) prices in days gone by, 
and surprisingly-relevant discussion of an odd little car in which the rear passen- 
gers faced to the rear. Some of the vignette and Statistics in Action titles: 

Get That Cat off the Poll! (p, 116) 
Synergy, ANOVA, and the Thorndikes (p. 409) 
Proportions Testing and the Restroom Police (p. 465) 
Time-Series-Based Forecasting and the Zundapp (p. 685) 
The CPI Time Machine (p. 726) 

A Sample of Sampling By Giving Away Samples (p. 126) 
Gender Stereotypes and Asking for Directions (p. 361) 
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Extensive Use of Examples and Analogies 

The chapters continue to be packed with examples to illustrate the techniques 
being discussed. In addition to describing a technique and presenting a small-scale 
example of its application, we will typically present one or more Excel and 
Minitab printouts showing how the analysis can be handled with popular statis- 
tical software. This pedagogical strategy is used so the reader will better appreci- 
ate what's going on inside the computer when it's applied to problems of a larger 
scale. 

The Use of Real Data 

The value of statistical techniques becomes more apparent through the consistent 
use of real data in the text. Data sets gathered from such publications as USA 
Today, Fortune, Newsweek, and The Wall Street Journal are used in more than 
400 exercises and examples to make statistics both relevant and interesting. 

Computer Relevance 

The text includes nearly 200 computer printouts generated by Excel and Minitab, 
and the text CD contains data sets for section and chapter exercises, integrated 
and business cases, and chapter examples. In addition to the new Data Analysis 
Plus™ 5.0 software and the handy Test Statistics.xls and Estimators.xls work- 
books that accompany it, the Sixth Edition offers the separate collection of 26 
Excel worksheet templates generated by the author specifically for exercise solu- 
tions and "what-if" analyses based on summary data. 

Seeing Statistics Applets 

The Sixth Edition continues with the 21 popular interactive java applets, many 
customized by their author to specific content and examples in this textbook, and 
they include a total of 85 applet exercises. The applets are from the award- 
winning Seeing Statistics, authored by Gary McClelland of the University of 
Colorado, and they bring life and action to many of the most important statisti- 
cal concepts in the text. 

Integrated Cases 

At the end of each chapter, you'll find one or both of these case scenarios helpful 
in understanding and applying concepts presented within the chapter: 

(1) Thorndike Sports Equipment Company 

The text continues to follow the saga of Grandfather (Luke) and Grandson (Ted) 
Thorndike as they apply chapter concepts to the diverse opportunities, interesting 
problems, and assorted dilemmas faced by the Thorndike Sports Equipment 
Company. At the end of each chapter, the reader has the opportunity to help Luke 
and Ted apply statistics to their business. The text CD includes seven Thorndike 
video units that are designed to accompany and enhance selected written cases. 
Viewers should find them to enhance the relevance of the cases as well as to pro- 
vide some entertaining background for the Thorndikes' statistical adventures. 

(2) Springdale Shopping Survey 

The Springdale Shopping Survey cases provide the opportunity to apply chapter 
concepts and the computer to real numbers representing the opinions and behav- 
iors of real people in a real community. The only thing that isn't real is the name 
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of the community. The entire database contains 30 variables for 150 respondents. 
This database is also on the text CD. 

Business Cases 

The Sixth Edition also provides a set of 12 real-world business cases in 10 differ- 
ent chapters of the text. These interesting and relatively extensive cases feature 
disguised organizations, but include real data pertaining to real business problems 
and situations. In each case, the company or organization needs statistical assis- 
tance in analyzing their database to help them make more money, make better de- 
cisions, or simply make it to the next fiscal year. The organizations range all the 
way from an MBA program, to a real estate agency, to a pizza delivery service, 
and these cases and their variants are featured primarily among the chapters in 
the latter half of the text. The cases have been adapted from the excellent presen- 
tations in Business Cases in Statistical Decision Making, by Lawrence H. Peters, 
of Texas Christian University and J. Brian Gray, of the University of Alabama. 
Just as answers to problems in the real world are not always simple, obvious, and 
straightforward, neither are some of the solutions associated with the real prob- 
lems faced by these real (albeit disguised) companies and organizations. However, 
in keeping with the "Introduction to ... " title of this text, we do provide a few 
guidelines in the form of specific questions or issues the student may wish to ad- 
dress while using business statistics in helping to formulate observations and rec- 
ommendations that could be informative or helpful to his or her "client." 

Organization of the Text 

The text can be used in either a one-term or a two-term course. For one-term ap- 
plications, Chapters 1 through 11 are suggested. For two-term use, it is recom- 
mended that the first term include Chapters 1 through 11, and that the second 
term include at least Chapters 12 through 18. In either one- or two-term use, the 
number and variety of chapters allow for instructor flexibility in designing either 
a course or a sequence of courses that will be of maximum benefit to the student. 
This flexibility includes the possibility of including one or more of the two re- 
maining chapters, which are in the Special Topics section of the text. 

Chapter 1 provides an introductory discussion of business statistics and its 
relevance to the real world. Chapters 2 and 3 cover visual summarization meth- 
ods and descriptive statistics used in presenting statistical information. Chapter 4 
discusses popular approaches by which statistical data are collected or generated, 
including relevant sampling methods. In Chapters 5 through 7, we discuss the 
basic notions of probability and go on to introduce the discrete and continuous 
probability distributions upon which many statistical analyses depend. In Chap- 
ters 8 and 9, we discuss sampling distributions and the vital topic of making esti- 
mates based on sample findings. 

Chapters 10 through 14 focus on the use of sample data to reach conclusions 
regarding the phenomena that the data represent. In these chapters, the reader 
will learn how to use statistics in deciding whether to reject statements that have 
been made concerning these phenomena. Chapters 15 and 16 introduce methods 
for obtaining and using estimation equations in describing how one variable tends 
to change in response to changes in one or more others. 

Chapter 17 extends the discussion in the two previous chapters to examine 
the important issue of model building. Chapter 18 discusses time series, forecast- 
ing, and index number concepts used in analyzing data that occur over a period 
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of time. Chapter 19 discusses the role of statistics in decision theory, while Chap- 
ter 20 explores total quality management and its utilization of statistics. 

At the end of the text, there is a combined index and glossary of key terms, a 
set of statistical tables, and answers to selected odd exercises. For maximum 
convenience, immediately preceding the back cover of the text are pages contain- 
ing the two statistical tables to which the reader will most often be referring: the 
^-distribution and the standard normal, or ^-distribution. 

Ancillary Items 

To further enhance the usefulness of the text, a complete package of complemen- 
tary ancillary items has been assembled: 

Student's Suite CD-ROM 

This CD is packaged with each textbook and contains Data Analysis Plus™ S.O 
Excel add-in software and accompanying workbooks, including Test Statistics.xls 
and Estimators.xls; Seeing Statistics applets, datasets for exercises, cases, and text 
examples; author-developed Excel worksheet templates for exercise solutions and 
"what-if" analyses; and the Thorndike Sports Equipment video cases. Also in- 
cluded, in pdf format, are Chapter 21, Ethics in Statistical Analysis and Report- 
ing, and a Chapter 19 appendix on the expected value of imperfect information. 

Instructor's Resource CD-ROM 

This CD is available to qualified adopters, and contains author-generated com- 
plete and detailed solutions to all section, chapter, and applet exercises, integrated 
cases and business cases; a test bank in Microsoft Word format that includes test 
questions by section; ExamView testing software, which allows a professor to cre- 
ate exams in minutes; PowerPoint presentations featuring concepts and examples 
for each chapter; and a set of display Seeing Statistics applets based on those in 
the text and formatted for in-class projection. 

Also Available from the Publisher 

Available separately from the publisher are other items for enhancing students' 
learning experience with the textbook. Among them are the following: 

Student Solutions Manual (Weiers) 

This hard-copy manual is author-generated and contains complete, detailed solu- 
tions to all odd-numbered exercises in the text. It is available separately, or it can 
be pre-packaged with the textbook. 

Instructor's Solutions Manual (Weiers) 

The Instructor's Solutions manual contains author-generated complete and de- 
tailed solutions to all section, chapter, and applet exercises, integrated cases and 
business cases. It is available on the Instructor's Resource CD in Word format. 

Test Bank (Bob Donnelly) 

Containing over 2600 test questions, including true-false, multiple-choice, and 
problems similar to those at the ends of the sections and chapters of the text, the 



XX 



Preface 



computerized Test Bank makes test creation a cinch. The ExamView program is 
available on the Instructor's Resource CD. 

Minitab, Student Version for Windows (Minitab, Inc.) 

The student version of this popular statistical software package. Available at a 
discount when bundled with the text. 
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Using the Computer 

In terms of software capability, this edition is the best yet. Besides incorporating 
Excel's standard Data Analysis module and Toolbar Function capability, we fea- 
ture Data Analysis Plus™ S.O and its primary workbook partners, Test Statis- 
tics.xls and Estimators.xls. The text includes 82 Computer Solutions pieces that 
show Excel and Minitab printouts relevant to chapter examples, plus friendly 
step-by-step instructions showing how to carry out each analysis or procedure in- 
volved. The Excel materials have been tested with Microsoft Office 2003, but the 
printouts and instructions will be either identical or very similar to those for ear- 
lier versions of this spreadsheet software package. The Minitab printouts and 
instructions pertain to Minitab Release 15, but will be either identical or very sim- 
ilar to those for earlier versions of this dedicated statistical software package. 

Minitab 

This statistical software is powerful, popular, easy to use, and offers little in the 
way of surprises — a pleasant departure in an era when we too-often see software 
crashes and the dreaded "blue screen of death." As a result, there's not much else to 
be said about this dedicated statistical software. If you use either the full version 
or the student version of Minitab, you should be able to navigate the Minitab 
portions of the 82 Computer Solutions pieces in the text with ease. Note that 
Minitab 15 has excellent graphics that will not be nearly so attractive in some 
earlier versions. 

Excel 

This popular spreadsheet software offers a limited number of statistical tests and 
procedures, but it delivers excellent graphics and it seems to be installed in nearly 
every computer on the planet. As a result, it gets featured coverage in many of the 
newer statistics textbooks, including this one. Some special sections with regard 
to Excel appear below. 

Data Analysis/Analysis ToolPak 

This is the standard data analysis module within Excel. When you click Tools, 
you should see Data Analysis on the menu list that appears. If it is not present, 
you will need to install it using this procedure: 1. Click Tools. 2. Click Add-Ins. 3. 
Click to select Analysis ToolPak. 4. Click OK. If the Analysis ToolPak choice does 
not appear in step 3, you'll have to install it using your Microsoft Office CD and 
setup program. 

Paste Function (f x ) 

The symbol f x appears as one of the buttons in the Excel toolbar near the top of 
the screen. It provides many kinds of functions, including math (e.g., placing the 
square of one cell into another cell) and statistical (e.g., finding areas under the 
normal curve.) This toolbar item is employed in a number of computer-assisted 
analyses and procedures in the text, and its usage will be explained within the 
context of each Computer Solutions piece in which it appears. 

Data Analysis Plus™ 5.0 

This outstanding software greatly extends Excel's capabilities to include practi- 
cally every statistical test and procedure covered in the text, and it is very easy to 
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use. It is on the CD that accompanies the text and is automatically installed when 
you follow the setup instructions on the CD. Typically, a file called STATS.xls will 
be inserted into the XLstart folder in the Excel portion of your computer's Win- 
dows directory. This software is featured in nearly one-third of the Computer So- 
lutions sets of printouts and instructions that appear in the text. After installation 
using the text CD, when you click Tools, the Data Analysis Plus item will be 
among those appearing on the menu below. In the unlikely case of difficulties, 
refer to the "readme.txt" file on the CD for manual-installation instructions or to 
the toll-free number on the CD. 

Test Statistics.xls and Estimators.xls 

These Excel workbooks are among those accompanying Data Analysis Plus™ 5.0 
on the text CD. They contain worksheets that enable us to carry out procedures 
or obtain solutions based only on summary information about the problem or sit- 
uation. This is a real work-saver for solving chapter exercises, checking solutions 
that have been hand-calculated, or for playing "what-if" by trying different in- 
puts to instantaneously see how they affect the results. These workbooks are typ- 
ically installed into the same directory where the data files are located. 

Other Excel Worksheet Templates 

There are 26 Excel worksheet templates generated by the author and carried over 
from the previous edition. As with the worksheets within Test Statistics.xls and 
Estimators.xls, they provide solutions based on summary information about a 
problem or situation. The instructions for using each template are contained 
within the template itself. When applicable, they are cited within the Computer 
Solutions items in which the related analyses or procedures appear. 
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Statistics Can Entertain, Enlighten, Alarm 

Today's statistics applications range from the inane to the highly germane. Sometimes 
statistics provides nothing more than entertainment — e.g., a study found that 54% of 
U.S. adults celebrate their birthday by dining out. 1 Regarding an actual entertainer, 
another study found that the public's "favorable" rating for actor Tom Cruise had 
dropped from 58% to 35% between 2005 and 2006. 2 

On the other hand, statistical descriptors can be of great importance to managers 
and decision makers. For example, 5% of workers say they use the Internet too much 
at work, and that decreases their productivity. 3 In the governmental area, U.S. census 
data can mean millions of dollars to big cities. According to the Los Angeles city 
council, that city will have lost over $180 million in federal aid because the 2000 census 
had allegedly missed 76,800 residents, most of whom were urban, minority, and poor. 4 

At a deadly extreme, statistics can also describe the growing toll on persons living 
near or downwind of Chernobyl, site of the world's worst nuclear accident. Just 1 0 years 
following this 1 986 disaster, cancer rates in the fallout zone had already nearly doubled, 
and researchers are now concerned about the possibility of even higher rates with the 
greater passage of time. 5 In general, statistics can be useful in examining any geographic 
"cluster" of disease incidence, helping us to decide whether the higher incidence could 
be due simply to chance variation, or whether some environmental 
agent or pollutant may have played a role. 
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'Source: Mary Cadden and Robert W. 
Ahrens, "Taking a Holiday from the Kitchen," 
USA Today, March 23, 2006, p. 1D. 
2 Source: Susan Wloszczyna, "In Public's Eyes, 
Tom's Less of a Top Gun," USA Today, May 1 0, 
2006, p. ID. 

3 Source: Jae Yang and Marcy Mullins, "Internet 
Usage's Impact on Productivity," USA Today, 
March 21, 2006, p. 1B. 
4 Source: www.cd13.com, letter from Los 
Angeles City Council to U.S. House of 
Representatives, April 1 1, 2006. 
5 Source: Allison M. Heinrichs, "Study to Examine 
Breast Cancer in Europeans," Pittsburgh Tribune- 
Review, April 23, 2006. 
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Part 1: Business Statistics: Introduction and Background 




Timely Topic, Tattered Image 



At this point in your college career, toxic dumping, armed robbery, fortune telling, 
and professional wrestling may all have more positive images than business statistics. 
If so, this isn't unusual, since many students approach the subject believing that it will 
be either difficult or irrelevant. In a study of 105 beginning students' attitudes toward 
statistics, 56% either strongly or moderately agreed with the statement, "I am afraid 
of statistics." 6 (Sorry to have tricked you like that, but you've just been introduced to 
a statistic, one that you'll undoubtedly agree is neither difficult nor irrelevant.) 

Having recognized such possibly negative first impressions, let's go on to discuss 
statistics in a more positive light. First, regarding ease of learning, the only thing 
this book assumes is that you have a basic knowledge of algebra. Anything else 
you need will be introduced and explained as we go along. Next, in terms of 
relevance, consider the unfortunates of Figure 1.1 and how just the slight change 
of a single statistic might have considerably influenced each individual's fortune. 



Briefly defined, business statistics can be described as the collection, summarization, 
analysis, and reporting of numerical findings relevant to a business decision or situ- 
ation. Naturally, given the great diversity of business itself, it's not surprising that 
statistics can be applied to many kinds of business settings. We will be examining a 
wide spectrum of such applications and settings. Regardless of your eventual career 
destination, whether it be accounting or marketing, finance or politics, information 
science or human resource management, you'll find the statistical techniques 
explained here are supported by examples and problems relevant to your own field. 



As a businessperson, you may find yourself involved with statistics in at least one 
of the following ways: (1) as a practitioner collecting, analyzing, and presenting 



What Is Business Statistics? 



For the Consumer as Well as the Practitioner 





Some have the notion that 
statistics can be irrelevant. 
As the plight of these indi- 
viduals suggests, nothing 
could be further from the 
truth. 



Lefty "H.R." Jones, former 
professional baseball pitcher. 
Had an earned-run average of 
12.4 last season, which turned 
out to be his last season. 




n of readers weren't even aware 
) of his column. 



newspaper columnist. Survey 
by publisher showed that 43% 



Walter Wickerbin, former 




tire manufacturer. The company 
r T advertised a 45,000-mile tread life, 
W but tests by a leading consumer 
— magazine found most tires wore 
out in less than 20,000 miles. 



Rhonda Rhodes, former vice 
president of engineering for a 



6 Source: Eleanor W. Jordan and Donna F. Stroup, "The Image of Statistics," Collegiate News and 
Views, Spring 1984, p. 11. 
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findings based on statistical data or (2) as a consumer of statistical claims and 
findings offered by others, some of whom may be either incompetent or unethical. 

As you might expect, the primary orientation of this text will be toward the 
"how-to," or practitioner, dimension of business statistics. After finishing this 
book, you should be both proficient and conversant in most of the popular tech- 
niques used in statistical data collection, analysis, and reporting. As a secondary 
goal, this book will help you protect yourself and your company as a statistical 
consumer. In particular, it's important that you be able to deal with individuals 
who arrive at your office bearing statistical advice. Chances are, they'll be one of 
the following: 

1. Dr. Goodstat The good doctor has painstakingly employed the correct 
methodology for the situation and has objectively analyzed and reported on 
the information he's collected. Trust him, he's OK. 

2. Stanley Stumbler Stanley means well, but doesn't fully understand what he's 
doing. He may have innocently employed an improper methodology and 
arrived at conclusions that are incorrect. In accepting his findings, you may 
join Stanley in flying blind. 

3. Dr. Unethicus This character knows what he's doing, but uses his knowledge 
to sell you findings that he knows aren't true. In short, he places his own 
selfish interests ahead of both scientific objectivity and your informational 
needs. He varies his modus operandi and is sometimes difficult to catch. One 
result is inevitable: when you accept his findings, he wins and you lose. 



STATISTICS: YESTERDAY AND TODAY 1 .2 



Yesterday 

Although statistical data have been collected for thousands of years, very early 
efforts typically involved simply counting people or possessions to facilitate 
taxation. This record-keeping and enumeration function remained dominant 
well into the 20th century, as this 1925 observation on the role of statistics in 
the commercial and political world of that time indicates: 

It is coming to be the rule to use statistics and to think statistically. The larger 
business units not only have their own statistical departments in which they col- 
lect and interpret facts about their own affairs, but they themselves are consumers 
of statistics collected by others. The trade press and government documents are 
largely statistical in character, and this is necessarily so, since only by the use of 
statistics can the affairs of business and of state be intelligently conducted. 

Business needs a record of its past history with respect to sales, costs, sources 
of materials, market facilities, etc. Its condition, thus reflected, is used to measure 
progress, financial standing, and economic growth. A record of business 
changes — of its rise and decline and of the sequence of forces influencing it — is 
necessary for estimating future developments. 7 

Note the brief reference to "estimating future developments" in the preced- 
ing quotation. In 1925, this observation was especially pertinent because a transition 
was in process. Statistics was being transformed from a relatively passive record 

7 Source: Horace Secrist, An Introduction to Statistical Methods, rev. ed. New York: Macmillan 
Company, 1925, p. 1. 
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keeper and descriptor to an increasingly active and useful business tool, which would 
influence decisions and enable inferences to be drawn from sample information. 

Today 

Today, statistics and its applications are an integral part of our lives. In such 
diverse settings as politics, medicine, education, business, and the legal arena, 
human activities are both measured and guided by statistics. 

Our behavior in the marketplace generates sales statistics that, in turn, help 
companies make decisions on products to be retained, dropped, or modified. 
Likewise, auto insurance firms collect data on age, vehicle type, and accidents, 
and these statistics guide the companies toward charging extremely high premiums 
for teenagers who own or drive high-powered cars like the Chevrolet Corvette. In 
turn, the higher premiums influence human behavior by making it more difficult 
for teens to own or drive such cars. The following are additional examples where 
statistics are either guiding or measuring human activities. 

• Well beyond simply counting how many people live in the United States, the 
U.S. Census Bureau uses sampling to collect extensive information on income, 
housing, transportation, occupation, and other characteristics of the popu- 
lace. The Bureau used to do this by means of a "long form" sent to 1 in 6 
Americans every 10 years. Today, the same questions are asked in a 67-question 
monthly survey that is received by a total of about 3 million households each 
year. The resulting data are more recent and more useful than the decennial 
sampling formerly employed, and the data have a vital effect on billions of 
dollars in business decisions and federal funding. 8 

• According to the International Dairy Foods Association, ice cream and related 
frozen desserts are consumed by more than 90% of the households in the 
United States. The most popular flavor is vanilla, which accounts for 26% of 
sales. Chocolate is a distant second, at 13% of sales. 9 

• On average, U.S. stores lose $25 million each day to shoplifters. The problem 
becomes even worse when the national economy is weak, and more than half 
of those arrested for shoplifting are under the age of 25. Every day, 5400 
people are detained for shoplifting. 10 

Throughout this text, we will be examining the multifaceted role of statistics 
as a descriptor of information, a tool for analysis, a means of reaching conclu- 
sions, and an aid to decision making. In the next section, after introducing the 
concept of descriptive versus inferential statistics, we'll present further examples 
of the relevance of statistics in today's world. 



exercises 




■ .la 




1.1 What was the primary use of statistics in ancient 
times? 



1.2 In what ways can business statistics be useful in 
today's business environment? 



8 Source: Haya El Nasser, "Rolling Survey for 2010 Census Keeps Data Up to Date," USA Today, 

January 17, 2005, p. 4A. 

'Source: http://www.idfa.org, June 14, 2006. 

10 Source: http://witn.psu.edu/articles (show #2516 news summary), June 14, 2006. 
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DESCRIPTIVE VERSUS INFERENTIAL STATISTICS 1 .3 



As we have seen, statistics can refer to a set of individual numbers or numerical 
facts, or to general or specific statistical techniques. A further breakdown of the 
subject is possible, depending on whether the emphasis is on (1) simply describing 
the characteristics of a set of data or (2) proceeding from data characteristics to 
making generalizations, estimates, forecasts, or other judgments based on the 
data. The former is referred to as descriptive statistics, while the latter is called 
inferential statistics. As you might expect, both approaches are vital in today's 
business world. 

Descriptive Statistics 

In descriptive statistics, we simply summarize and describe the data we've col- 
lected. For example, upon looking around your class, you may find that 35% of 
your fellow students are wearing Casio watches. If so, the figure "35%" is a 
descriptive statistic. You are not attempting to suggest that 35% of all college 
students in the United States, or even at your school, wear Casio watches. You're 
merely describing the data that you've recorded. In the year 1900, the U.S. Postal 
Service operated 76,688 post offices, compared to just 27,505 in 2004. 11 In 2005, 
the 1.26 billion common shares of McDonald's Corporation each received a $0.67 
dividend on net income of $2.04 per common share. 12 Table 1.1 (page 6) provides 
additional examples of descriptive statistics. Chapters 2 and 3 will present a num- 
ber of popular visual and statistical approaches to expressing the data we or oth- 
ers have collected. For now, however, just remember that descriptive statistics are 
used only to summarize or describe. 

Inferential Statistics 

In inferential statistics, sometimes referred to as inductive statistics, we go beyond 
mere description of the data and arrive at inferences regarding the phenomena or 
phenomenon for which sample data were obtained. For example, based partially 
on an examination of the viewing behavior of several thousand television house- 
holds, the ABC television network may decide to cancel a prime-time television 
program. In so doing, the network is assuming that millions of other viewers 
across the nation are also watching competing programs. 

Political pollsters are among the heavy users of inferential statistics, typically 
questioning between 1000 and 2000 voters in an effort to predict the voting behav- 
ior of millions of citizens on election day. If you've followed recent presidential 
elections, you may have noticed that, although they contact only a relatively small 
number of voters, the pollsters are quite often "on the money" in predicting both 
the winners and their margins of victory. This accuracy, and the fact that it's not 
simply luck, is one of the things that make inferential statistics a fascinating and 
useful topic. (For more examples of the relevance and variety of inferential statis- 
tics, refer to Table 1.1.) As you might expect, much of this text will be devoted to 
the concept and methods of inferential statistics. 

n Source: Bureau of the Census, U.S. Department of Commerce, Statistical Abstract of the United 
States 2006, p. 729. 

12 Source: McDonald's Corporation, Inc., 2005 Summary Annual Report. 
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Descriptive Statistics 

• Members of Congress accepted 372 privately financed trips during the first half 
of 2006. [p. 5A] 

• During the second quarter of 2006, media giant Time Warner reported revenues 
of $10.7 billion, [p. 3B] 

• Ford's F-series pickup trucks remain the best-selling vehicles in America, but July 
sales were down 45.6% compared with the same period a year before, [p. 3B] 

• U.S. shipments of digital cameras totaled 6.3 million units during the first quarter 
of 2006, up 17% over the first quarter of 2005. [p. 1 B] 

Inferential Statistics 

• A survey by the American Automobile Association found the average price for 
premium gasoline was $3,302, slightly down from $3,307 the previous day. [p. 1 A] 

• According to a Hartford Financial Services Group survey of 2245 parents of high 
school and college students, 20% of the parents said they didn't expect their child 
to be responsible for any of his or her financial costs of going to college, [p. 1 B] 

• Survey results indicated that 13.5% of persons under 18 keep a personal blog, 
display photos on the Web, or maintain their own website, [p. 1 D] 

• In a survey of environmental responsibility, 37.8% of the respondents said environ- 
mentally friendly products are "very important" to them and their family, [p. 1 B] 



Source; USA Today, August 3, 2006. The page references are shown in brackets. 



Key Terms for Inferential Statistics 

In surveying the political choices of a small number of eligible voters, political 
pollsters are using a sample of voters selected from the population of all eligible 
voters. Based on the results observed in the sample, the researchers then proceed 
to make inferences on the political choices likely to exist in this larger population 
of eligible voters. A sample result (e.g., 46% of the sample favor Charles Grady 
for president) is referred to as a sample statistic and is used in an attempt to esti- 
mate the corresponding population parameter (e.g., the actual, but unknown, 
national percentage of voters who favor Mr. Grady). These and other important 
terms from inferential statistics may be defined as follows: 

• Population Sometimes referred to as the universe, this is the entire set of 
people or objects of interest. It could be all adult citizens in the United States, 
all commercial pilots employed by domestic airlines, or every roller bearing 
ever produced by the Timken Company. 

A population may refer to things as well as people. Before beginning a study, it is 
important to clearly define the population involved. For example, in a given study, 
a retailer may decide to define "customer" as all those who enter her store 
between 9 A.M. and 5 p.m. next Wednesday. 

• Sample This is a smaller number (a subset) of the people or objects that exist 
within the larger population. The retailer in the preceding definition may 
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decide to select her sample by choosing every 10th person entering the store 
between 9 A.M. and 5 p.m. next Wednesday. 

A sample is said to be representative if its members tend to have the same charac- 
teristics (e.g., voting preference, shopping behavior, age, income, educational 
level) as the population from which they were selected. For example, if 45% of 
the population consists of female shoppers, we would like our sample to also 
include 45% females. When a sample is so large as to include all members of the 
population, it is referred to as a complete census. 

• Statistic This is a measured characteristic of the sample. For example, our 
retailer may find that 73% of the sample members rate the store as having 
higher-quality merchandise than the competitor across the street. The sample 
statistic can be a measure of typicalness or central tendency, such as the mean, 
median, mode, or proportion, or it may be a measure of spread or dispersion, 
such as the range and standard deviation: 

The sample mean is the arithmetic average of the data. This is the sum of the data 
divided by the number of values. For example, the mean of $4, $3, and $8 can be 
calculated as ($4 + $3 + $8)/3, or $5. 

The sample median is the midpoint of the data. The median of $4, $3, and $8 
would be $4, since it has just as many values above it as below it. 

The sample mode is the value that is most frequently observed. If the data consist 
of the numbers 12, 15, 10, 15, 18, and 21, the mode would be 15 because it oc- 
curs more often than any other value. 

The sample proportion is simply a percentage expressed as a decimal fraction. For 
example, if 75.2% is converted into a proportion, it becomes 0.752. 

The sample range is the difference between the highest and lowest values. For ex- 
ample, the range for $4, $3, and $8 is ($8 — $3), or $5. 

The sample standard deviation, another measure of dispersion, is obtained by 
applying a standard formula to the sample values. The formula for the standard 
deviation is covered in Chapter 3, as are more detailed definitions and examples of 
the other measures of central tendency and dispersion. 

• Parameter This is a numerical characteristic of the population. If we were to 
take a complete census of the population, the parameter could actually be 
measured. As discussed earlier, however, this is grossly impractical for most 
business research. The purpose of the sample statistic is to estimate the value 
of the corresponding population parameter (e.g., the sample mean is used to 
estimate the population mean). Typical parameters include the population 
mean, median, proportion, and standard deviation. As with sample statistics, 
these will be discussed in Chapter 3. 

For our retailer, the actual percentage of the population who rate her 
store's merchandise as being of higher quality is unknown. (This unknown 
quantity is the parameter in this case.) However, she may use the sample 
statistic (73%) as an estimate of what this percentage would have been had 
she taken the time, expense, and inconvenience to conduct a census of all cus- 
tomers on the day of the study. 
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1 .3 What is the difference between descriptive statistics 
and inferential statistics? Which branch is involved when 
a state senator surveys some of her constituents in order 
to obtain guidance on how she should vote on a piece of 
legislation? 

1.4 In 2002, the Cinergy Corporation sold 35,615 million 
cubic feet of gas to residential customers, an increase of 
1.1% over the previous year. Does this information repre- 
sent descriptive statistics or inferential statistics? Why? 
source: Cinergy Corporation, Annual Report 2002, p. 110. 



1.5 An article in Runner's World magazine described a 
study that compared the cardiovascular responses of 
20 adult subjects for exercises on a treadmill, on a mini- 
trampoline, and jogging in place on a carpeted surface. 
Researchers found average heart rates were significantly 
less on the minitrampoline than for the treadmill and 
stationary jogging. Does this information represent 
descriptive statistics or inferential statistics? Why? 
source: Kate Delhagen, "Health Watch," Runner's World, 
August 1987, p. 21. 



1 .4 TYPES OF VARIABLES AND SCALES 

OF MEASUREMENT 



Qualitative Variables 

Some of the variables associated with people or objects are qualitative in nature, 
indicating that the person or object belongs in a category. For example: (1) you 
are either male or female; (2) you have either consumed Dad's Root Beer within 
the past week or you have not; (3) your next television set will be either color or 
black and white; and (4) your hair is likely to be brown, black, red, blonde, or 
gray. While some qualitative variables have only two categories, others may have 
three or more. Qualitative variables, also referred to as attributes, typically 
involve counting how many people or objects fall into each category. 

In expressing results involving qualitative variables, we describe the percentage 
or the number of persons or objects falling into each of the possible categories. 
For example, we may find that 35% of grade-school children interviewed recognize 
a photograph of Ronald McDonald, while 65% do not. Likewise, some of the 
children may have eaten a Big Mac hamburger at one time or another, while 
others have not. 

Quantitative Variables 

Quantitative variables enable us to determine how much of something is possessed, 
not just whether it is possessed. There are two types of quantitative variables: 
discrete and continuous. 

Discrete quantitative variables can take on only certain values along an interval, 
with the possible values having gaps between them. Examples of discrete quantita- 
tive variables would be the number of employees on the payroll of a manufacturing 
firm, the number of patrons attending a theatrical performance, or the number of 
defectives in a production sample. Discrete variables in business statistics usually 
consist of observations that we can count and often have integer values. Fractional 
values are also possible, however. For example, in observing the number of gallons 
of milk that shoppers buy during a trip to a U.S. supermarket, the possible values 
will be 0.25, 0.50, 0.75, 1.00, 1.25, 1.50, and so on. This is because milk is typically 
sold in 1 -quart containers as well as gallons. A shopper will not be able to purchase 
a container of milk labeled "0.835 gallons." The distinguishing feature of discrete 
variables is that gaps exist between the possible values. 
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Continuous quantitative variables can take on a value at any point along an 
interval. For example, the volume of liquid in a water tower could be any quan- 
tity between zero and its capacity when full. At a given moment, there might be 
325,125 gallons, 325,125.41 gallons, or even 325,125.413927 gallons, depend- 
ing on the accuracy with which the volume can be measured. The possible values 
that could be taken on would have no gaps between them. Other examples of 
continuous quantitative variables are the weight of a coal truck, the Dow Jones 
Industrial Average, the driving distance from your school to your home town, and 
the temperature outside as you're reading this book. The exact values each of 
these variables could take on would have no gaps between them. 



Scales of Measurement 

Assigning a numerical value to a variable is a process called measurement. For 
example, we might look at the thermometer and observe a reading of 72.5 degrees 
Fahrenheit or examine a box of lightbulbs and find that 3 are broken. The 
numbers 72.5 and 3 would constitute measurements. When a variable is mea- 
sured, the result will be in one of the four levels, or scales, of measurement — 
nominal, ordinal, interval, or ratio — summarized in Figure 1.2. The scale to 
which the measurements belong will be important in determining appropriate 
methods for data description and analysis. 

The Nominal Scale 

The nominal scale uses numbers only for the purpose of identifying membership 
in a group or category. Computer statistical analysis is greatly facilitated by the 
use of numbers instead of names. For example, Louisiana's Entergy Corporation 
lists four types of domestic electric customers. 13 In its computer records, the 
company might use "1" to identify residential customers, "2" for commercial 
customers, "3" for industrial customers, and "4" for government customers. 
Aside from identification, these numbers have no arithmetic meaning. 

The Ordinal Scale 

In the ordinal scale, numbers represent "greater than" or "less than" measure- 
ments, such as preferences or rankings. For example, consider the following 



Nominal 



Each number represents a category 



Ordinal Greater than and less than relationships 



Interval 



Ratio 




Units of measurement 



and Absolute zero point 



I 



FIGURE 1.2 



The methods through 
which statistical data can 
be analyzed depend on 
the scale of measurement 
of the data. Each of the 
four scales has its own 
characteristics. 



13 Source: Entergy Corporation, 2005 Annual Report. 
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Association of Tennis Professionals singles rankings for female tennis players: 14 

1. Amelie Mauresmo 

2. Kim Clijsters 

3. Justine Henin-Hardenne 

4. Maria Sharapova 

In the ordinal scale, numbers are viewed in terms of rank (i.e., greater than, 
less than), but do not represent distances between objects. For example, we can- 
not say that the distance between Amelie Mauresmo and Kim Clijsters is the same 
as the distance between Kim Clijsters and Justine Henin-Hardenne. This is because 
the ordinal scale has no unit of measurement. 



The Interval Scale 

The interval scale not only includes "greater than" and "less than" relationships, 
but also has a unit of measurement that permits us to describe bow much more or 
less one object possesses than another. The Fahrenheit temperature scale represents 
an interval scale of measurement. We not only know that 90 degrees Fahrenheit is 
hotter than 70 degrees, and that 70 degrees is hotter than 60 degrees, but can also 
state that the distance between 90 and 70 is twice the distance between 70 and 60. 
This is because degree markings serve as the unit of measurement. 

In an interval scale, the unit of measurement is arbitrary, and there is no 
absolute zero level where none of a given characteristic is present. Thus, multiples 
of measured values are not meaningful — e.g., 2 degrees Fahrenheit is not twice as 
warm as 1 degree. On questionnaire items like the following, business research 
practitioners typically treat the data as interval scale since the same physical and 
numerical distances exist between alternatives: 

[ ] [ ] [ ] [ ] [ ] 
Kmart prices are 1 2 3 4 5 
low high 



The Ratio Scale 

The ratio scale is similar to the interval scale, but has an absolute zero and 
multiples are meaningful. Election votes, natural gas consumption, return on 
investment, the speed of a production line, and FedEx Corporation's average 
daily delivery of 5,868,000 packages during 2005 15 are all examples of the ratio 
scale of measurement. 




1 .6 What is the difference between a qualitative vari- 
able and a quantitative variable? When would each be 
appropriate? 

1 .7 What is the difference between discrete and continu- 
ous variables? Under what circumstances would each be 
applicable? 



1.8 The Acme School of Locksmithing has been 
accredited for the past 15 years. Discuss how this 
information might be interpreted as a 

a. qualitative variable. 

b. quantitative variable. 



14 Source: ESPN.com, June 14, 2006. 

15 Source: FedEx Corporation, 2005 Annual Report, p. 3. 
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1.9 Jeff Bowlen, a labor relations expert, has collected 
information on strikes in various industries. 

a. Jeff says, "Industry A has been harder hit by strikes 
than Industry B." In what scale of measurement is 
this information? Why? 

b. Industry C has lost 10.8 days per worker, while 
Industry D has lost 14.5 days per worker. In what 
scale of measurement is this information? Why? 



1.10 The Snowbird Ski Lodge attracts skiers from several 
New England states. For each of the following scales of 
measurement, provide one example of information that 
might be relevant to the lodge's business, 
a. Nominal b. Ordinal 
c. Interval d. Ratio 



STATISTICS IN BUSINESS DECISIONS 1 .5 



One aspect of business in which statistics plays an especially vital role is decision 
making. Every year, U.S. businesses risk billions of dollars in important decisions 
involving plant expansions, new product development, personnel selection, qual- 
ity assurance, production techniques, supplier choices, and many others. These 
decisions almost always involve an element of uncertainty. Competitors, govern- 
ment, technology, and the social and economic environment, along with 
sometimes capricious consumers and voters, constitute largely uncontrollable 
factors that can sometimes foil the best-laid plans. 

Prior to making decisions, companies often collect information through a 
series of steps called the research process. The steps include: (1) defining the 
problem in specific terms that can be answered by research, (2) deciding on the 
type of data required, (3) determining through what means the data will be 
obtained, (4) planning for the collection of data and, if necessary, selection of a 
sample, (5) collecting and analyzing the data, (6) drawing conclusions and report- 
ing the findings, and (7) following through with decisions that take the findings 
into consideration. Business and survey research, discussed more fully in Chapter 
4, provides both descriptive and inferential statistics that can improve business 
decisions in many kinds of situations. 




1.11 Restaurants sometimes provide "customer reaction" 
cards so that customers can evaluate their dining experi- 
ence at the establishment. What kinds of decisions might 
be made on the basis of this information? 



1.12 What kinds of statistical data might a burglar alarm 
company employ in trying to convince urban homeown- 
ers to purchase its product? 



BUSINESS STATISTICS: TOOLS VERSUS TRICKS 1 .6 



The techniques of business statistics are a valuable tool for the enhancement of 
business operations and success. Appropriately, the major emphasis of this text will be 
to acquaint you with these techniques and to develop your proficiency in using them 
and interpreting their results. 

On the other hand, as suggested earlier, these same techniques can be abused for 
personal or corporate gain. Improperly used, statistics can become an effective 
weapon with which to persuade or manipulate others into beliefs or behaviors that 
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we'd like them to adopt. Note too that, even when they are not intentionally mis- 
used, the results of statistical research and analyses can depend a lot on when and 
how they were conducted, as Statistics in Action 1.1 shows. 

Unlike many other pursuits, such as defusing torpedoes, climbing mountains, 
or wrestling alligators, improper actions in business statistics can sometimes work 
in your favor. (As embezzlers know, this can also be true in accounting.) Natu- 
rally, we don't expect that you'll use your knowledge of statistics to manipulate 
unknowing customers and colleagues, but you should be aware of how others 
may be using statistics in an attempt to manipulate you. Remember that one of 
the key goals of this text is to make you an informed consumer of statistical infor- 
mation generated by others. In general, when you are presented with statistical 
data or conclusions that have been generated by others, you should ask yourself 
this key question: Who carried out this study and analyzed the data, and what 
benefits do they stand to gain from the conclusions reached? 




1.13 The text claims that a company or organization who wishes to avoid accepting biased results, what single 
might actually benefit when one of its employees uses question should be foremost in your mind as you begin 
statistics incorrectly. How can this be? reading the article? 

1.14 The headline of an article in your daily newspaper 
begins "Research Study Reveals. ..." As a statistics student 



1 .7 SUMMARY 



Business statistics can be defined as the collection, summarization, analysis, and 
reporting of numerical findings relevant to a business decision or situation. As busi- 
nesspersons and citizens, we are involved with statistics either as practitioners or as 
consumers of statistical claims and findings offered by others. Very early statistical 
efforts primarily involved counting people or possessions for taxation purposes. More 
recently, statistical methods have been applied in all facets of business as a tool for 
analysis and reporting, for reaching conclusions based on observed data, and as an aid 
to decision making. 

Statistics can be divided into two branches: descriptive and inferential. Descriptive 
statistics focuses on summarizing and describing data that have been collected. Infer- 
ential statistics goes beyond mere description and, based on sample data, seeks to 
reach conclusions or make predictions regarding the population from which the 
sample was drawn. The population is the entire set of all people or objects of interest, 
with the sample being a subset of this group. A sample is said to be representative if 
its members tend to have the same characteristics as the larger population. A census 
involves measuring all people or objects in the population. 

The sample statistic is a characteristic of the sample that is measured; it is often a 
mean, median, mode, proportion, or a measure of variability such as the range or 
standard deviation. The population parameter is the population characteristic that the 
sample statistic attempts to estimate. 

Variables can be either qualitative or quantitative. Qualitative variables indicate 
whether a person or object possesses a given attribute, while quantitative variables 





High Stakes on the Interstate: Car Phones and Accidents 



Do car phones contribute to auto accidents? Preliminary 
research says they may. In one study, the researchers ran- 
domly selected 100 New York motorists who had been in an 
accident and 1 00 who had not. Those who had been in an 
accident were30% more likelytohaveacell phone. In another 
study, published in The New England Journal of Medicine, re- 
searchers found that cell phone use while driving quadrupled 
the chance of having an accident, a risk increase comparable 
to driving with one's blood alcohol level at the legal limit. 

The Cellular Telecommunications Industry Association 
has a natural stake in this issue. There are currently more 
than 180 million cell phone subscribers, tens of thousands 
are signing up daily, and a high percentage of subscribers 
use their phones while driving. The association tends to 



dismiss accident studies such as the ones above as limited, 
flawed, and having research shortcomings. 

One thing is certain: more research is on the way. It will 
be performed by objective researchers as well as by individu- 
als and organizations with a vested interest in the results. Fu- 
ture studies, their methodologies, the allegiances of their 
sponsors, and the interpretation of their results will play an 
important role in the safety of our highways and the eco- 
nomic vitality of our cellular phone industry. 



Sources: "Survey: Car Phone Users Run Higher Risk of Crashes," Indiana 
Gazette, March 19, 1996, p. 10; "Ban Car Phones?" USA Today, April 27, 
2000, p. 16A; "Get Off the Cell Phone," Tribune-Review, January 29, 2000, 
p. A6; and "Cell Phone Use Booms, Despite Uneven Service," USA Today, 
March 14, 2005, p. 2B. 



express how much of an attribute is possessed. Discrete quantitative variables can 
take on only certain values along an interval, with the possible values having gaps 
between them, while continuous quantitative variables can take on a value at any 
point along an interval. 

When a variable is measured, a numerical value is assigned to it, and the result 
will be in one of four levels, or scales, of measurement — nominal, ordinal, inter- 
val, or ratio. The scale to which the measurements belong will be important in 
determining appropriate methods for data description and analysis. 

By helping to reduce the uncertainty posed by largely uncontrollable factors, 
such as competitors, government, technology, the social and economic environ- 
ment, and often unpredictable consumers and voters, statistics plays a vital role in 
business decision making. Although statistics is a valuable tool in business, its tech- 
niques can be abused or misused for personal or corporate gain. This makes it 
especially important for businesspersons to be informed consumers of statistical 
claims and findings. 




1.15 A research firm observes that men are twice as likely 
as women to watch the Super Bowl on television. Does 
this information represent descriptive statistics or inferen- 
tial statistics? Why? 

1.16 For each of the following, indicate whether the 
appropriate variable would be qualitative or quantitative. 
If you identify the variable as quantitative, indicate 
whether it would be discrete or continuous. 

a. Whether you own a Panasonic television set 

b. Your status as either a full-time or a part-time student 

c. The number of people who attended your school's 
graduation last year 




d. The price of your most recent haircut 

e. Sam's travel time from his dorm to the student union 

f. The number of students on campus who belong to a 
social fraternity or sorority 

1.17 What kinds of statistical data play a role in an auto 
insurance firm's decision on the annual premium you'll 
pay for your policy? 

1.18 For each of the following, indicate the scale of mea- 
surement that best describes the information. 

a. In January 2003, Dell Corporation had approximately 
39,100 employees, source: Dell Corporation, 2003 Year in 
Review, p. 21. 
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b. USA Today reports that the previous day's highest tem- 
perature in the United States was 115 degrees in Death 
Valley, California, source: USA Today, June 2, 2003, p. 12A. 

c. An individual respondent answers "yes" when asked if 
TV contributes to violence in the United States. 

d. In a comparison test of family sedans, a magazine 
rates the Toyota Camry higher than the VW Passat. 

1.19 Most undergraduate business students will not go 
on to become actual practitioners of statistical research 
and analysis. Considering this fact, why should such indi- 
viduals bother to become familiar with business statistics? 

1.20 Bill scored 1200 on the Scholastic Aptitude Test and 
entered college as a physics major. As a freshman, he 
changed to business because he thought it was more 
interesting. Because he made the dean's list last semester, 
his parents gave him $30 to buy a new Casio calculator. 
For this situation, identify at least one piece of 
information in the 

a. nominal scale of measurement. 

b. ordinal scale of measurement. 

c. interval scale of measurement. 

d. ratio scale of measurement. 



1.21 Roger Amster teaches an English course in which 40 
students are enrolled. After yesterday's class, Roger ques- 
tioned the 5 students who always sit in the back of the 
classroom. Three of the 5 said "yes" when asked if they 
would like A Tale of Two Cities as the next class reading 
assignment. 

a. Identify the population and the sample in this situation. 

b. Is this likely to be a representative sample? If not, 
why not? 

1 .22 In studying the performance of the company's stock 
investments over the past year, the research manager of a 
mutual fund company finds that only 43% of the stocks 
returned more than the rate that had been expected at the 
beginning of the year. 

a. Could this information be viewed as representing the 
nominal scale of measurement? If so, explain your rea- 
soning. If not, why not? 

b. Could this information be viewed as representing the 
ratio scale of measurement? If so, explain your reason- 
ing. If not, why not? 



"USA Snapshots" Set the Standard 

When it comes to creative visual displays to summarize data, hardly anything on 
the planet comes close to USA Today and its "USA Snapshots" that appear in the 
lower-left portion of the front page of each of the four sections of the newspaper. 
Whether it's "A look at statistics that shape the nation" (section A), "your finances" 
(section B), "the sports world" (section C), or "our lives" (section D), the visual is apt 
to be both informative and entertaining. 

For example, when the imaginative folks who create "USA Snapshots" get their 
hands on some numbers, we can expect that practically any related object that 
happens to be round may end up becoming a pie chart, or that any relevant 
entity that's rectangular may find itself relegated to 
duty as a bar chart. An example of 
this creativity can be seen later in \/r^ua''zing the 
the chapter, in Figure 2.3. 

If you're one of the many millions 
who read USA Today, chances are 
you'll notice a lot of other lively, 
resourceful approaches to the visual 
description of information. Comple- 
menting their extensive daily fare of 
news, editorials, and many other items 
that we all expect a good daily newspa- 
per to present, USA Today and the "USA 
Snapshot" editors set the standard when 
it comes to reminding us that statistics 
can be as interesting as they are 
relevant. 
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learning 
objectives 

After reading this 
chapter, you should 
be able to: 



Construct a frequency distribution and a histogram. 
Construct relative and cumulative frequency distributions. 
Construct a stem-and-leaf diagram to represent data. 
Visually represent data by using graphs and charts. 
Construct a dotplot and a scatter diagram. 
Construct contingency tables. 



2.1 INTRODUCTION 



When data have been collected, they are of little use until they have been organized 
and represented in a form that helps us understand the information contained. 
In this chapter, we'll discuss how raw data are converted to frequency distribu- 
tions and visual displays that provide us with a "big picture" of the information 
collected. 

By so organizing the data, we can better identify trends, patterns, and other 
characteristics that would not be apparent during a simple shuffle through a pile 
of questionnaires or other data collection forms. Such summarization also helps 
us compare data that have been collected at different points in time, by different 
researchers, or from different sources. It can be very difficult to reach conclusions 
unless we simplify the mass of numbers contained in the original data. 

As we discussed in Chapter 1, variables are either quantitative or qualitative. 
In turn, the appropriate methods for representing the data will depend on whether 
the variable is quantitative or qualitative. The frequency distribution, histogram, 
stem-and-leaf display, dotplot, and scatter diagram techniques of this chapter are 
applicable to quantitative data, while the contingency table is used primarily for 
counts involving qualitative data. 



2.2 THE FREQUENCY DISTRIBUTION 

AND THE HISTOGRAM 



Raw data have not been manipulated or treated in any way beyond their original 
collection. As such, they will not be arranged or organized in any meaningful 
manner. When the data are quantitative, two of the ways we can address this 
problem are the frequency distribution and the histogram. The frequency distri- 
bution is a table that divides the data values into classes and shows the 
number of observed values that fall into each class. By converting data to a fre- 
quency distribution, we gain a perspective that helps us see the forest instead 
of the individual trees. A more visual representation, the histogram describes a 
frequency distribution by using a series of adjacent rectangles, each of which 
has a length that is proportional to the frequency of the observations within 
the range of values it represents. In either case, we have summarized the raw 
data in a condensed form that can be readily understood and easily inter- 
preted. 



Chapter 2: Visual Description of Data 
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The Frequency Distribution 

We'll discuss the frequency distribution in the context of a research study that 
involves both safety and fuel-efficiency implications. Data are the speeds (miles 
per hour) of 105 vehicles observed along a section of highway where both accidents 
and fuel-inefficient speeds have been a problem. 



Example 



Raw Data and Frequency Distribution 

Part A of Table 2.1 lists the raw data consisting of measured speeds (mph) of 105 
vehicles along a section of highway. There was a wide variety of speeds, and these 
data values are contained in data file CX02SPEE. If we want to learn more from 
this information by visually summarizing it, one of the ways is to construct a 
frequency distribution like the one shown in part B of the table. 



A. Raw Data 



54.2 


58.7 


71.2 


69.7 


51.7 


75.6 


53.7 


57.0 


82.5 


76.8 


62.1 


67.4 


64.6 


70.5 


48.4 


69.0 


65.2 


64.1 


65.8 


56.9 


82.0 


62.0 


54.3 


73.2 


74.5 


73.6 


76.2 


55.1 


66.9 


62.4 


70.7 


68.3 


62.8 


83.5 


54.8 


68.3 


69.4 


59.2 


60.9 


60.4 


60.2 


75.4 


55.4 


56.3 


77.1 


61.2 


50.8 


67.1 


70.6 


60.7 


56.0 


80.7 


80.1 


56.2 


70.1 


63.7 


70.9 


54.6 


61.6 


63.2 


72.2 


84.0 


76.8 


61.6 


61.1 


80.8 


58.3 


52.8 


74.3 


71.4 


63.2 


57.8 


61.9 


75.8 


80.8 


57.3 


62.6 


75.1 


67.6 


78.0 


52.2 


57.6 


61.1 


66.9 


88.5 


66.7 


61.2 


73.9 


79.1 


70.2 


65.2 


61.3 


69.5 


72.8 


57.5 


71.4 


64.7 


78.4 


67.6 


76.3 


78.6 


66.8 


71.1 


58.9 


61.1 













B. Frequency Distribution (Number of Motorists in Each Category) 



Speed (mph) 



Number of Motorists 



45-under 50 
50-under 55 
55-under 60 
60-under 65 
65-under 70 
70-under 75 
75-under 80 
80-under 85 
85-under 90 



1 

9 
15 
24 
17 
17 
13 
8 
1 



TABLE 2.1 



Raw data and frequency 
distribution for observed 
speeds of 105 vehicles 



Key Terms 

In generating the frequency distribution in part B of Table 2.1, several judgmental 
decisions were involved, but there is no single "correct" frequency distribution for 
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a given set of data. There are a number of guidelines for constructing a frequency 
distribution. Before discussing these rules of thumb and their application, we'll 
first define a few key terms upon which they rely: 

Class Each category of the frequency distribution. 

Frequency The number of data values falling within each class. 

Class limits The boundaries for each class. These determine which data values are 
assigned to that class. 

Class interval The width of each class. This is the difference between the lower 
limit of the class and the lower limit of the next higher class. When a frequency 
distribution is to have equally wide classes, the approximate width of each class is 

Largest value _ Smallest value 
Approximate in raw data in raw data 
class width Number of classes desired 

Class mark The midpoint of each class. This is midway between the upper and 
lower class limits. 



Guidelines for the Frequency Distribution 

In constructing a frequency distribution for a given set of data, the following 
guidelines should be observed: 

1. The set of classes must be mutually exclusive (i.e., a given data value can fall 
into only one class). There should be no overlap between classes, and limits 
such as the following would be inappropriate: 

Not allowed, since a value of 60 could fit into either class: ^ 

60-65 

Not allowed, since there's an overlap between the classes: ^ U ^ ^ 

53-under 58 

2. The set of classes must be exhaustive (i.e., include all possible data values). 
No data values should fall outside the range covered by the frequency 
distribution. 

3. If possible, the classes should have equal widths. Unequal class widths make 
it difficult to interpret both frequency distributions and their graphical 
presentations. 

4. Selecting the number of classes to use is a subjective process. If we have too 
few classes, important characteristics of the data may be buried within the 
small number of categories. If there are too many classes, many categories will 
contain either zero or a small number of values. In general, about 5 to 15 
classes will be suitable. 

5. "Whenever possible, class widths should be round numbers (e.g., 5, 10, 25, 50, 
100). For the highway speed data, selecting a width of 2.3 mph for each class 
would enhance neither the visual attractiveness nor the information value of 
the frequency distribution. 

6. If possible, avoid using open-end classes. These are classes with either no 
lower limit or no upper limit — e.g., 85 mph or more. Such classes may not 
always be avoidable, however, since some data may include just a few values 
that are either very high or very low compared to the others. 
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The frequency distribution in part B of Table 2.1 was the result of applying 
the preceding guidelines. Illustrating the key terms introduced earlier, we will 
refer to the "50-under 55" class of the distribution: 

• Class limits 50-under 55. All values are at least 50, but less than 55. 

• Frequency 9. The number of motorists with a speed in this category. 

• Class interval 5. The difference between the lower class limit and that of the 
next higher class, or 55 minus 50. 

• Class mark 52.5. The midpoint of the interval; this can be calculated as the 
lower limit plus half the width of the interval, or 50 + (0.5)(5.0) = 52.5. 

Relative and Cumulative Frequency Distributions 

Relative Frequency Distribution. Another useful approach to data expression is 
the relative frequency distribution, which describes the proportion or percentage 
of data values that fall within each category. The relative frequency distribution 
for the speed data is shown in Table 2.2; for example, of the 105 motorists, 15 of 
them (14.3%) were in the 55-under 60 class. 

Relative frequencies can be useful in comparing two groups of unequal size, 
since the actual frequencies would tend to be greater for each class within the 
larger group than for a class in the smaller one. For example, if a frequency dis- 
tribution of incomes for 100 physicians is compared with a frequency distribution 
for 500 business executives, more executives than physicians would be likely to 
fall into a given class. Relative frequency distributions would convert the groups 
to the same size: 100 percentage points each. Relative frequencies will play an 
important role in our discussion of probabilities in Chapter 5. 

Cumulative Frequency Distribution. Another approach to the frequency distri- 
bution is to list the number of observations that are within or below each of the 
classes. This is known as a cumulative frequency distribution. When cumulative 
frequencies are divided by the total number of observations, the result is a cumu- 
lative relative frequency distribution. The "Cumulative Relative Frequency (%)" 
column in Table 2.2 shows the cumulative relative frequencies for the speed data in 
Table 2.1. Examining this column, we can readily see that 62.85% of the motorists 
had a speed less than 70 mph. 

Cumulative percentages can also operate in the other direction (i.e., "greater 
than or within"). Based on Table 2.2, we can determine that 90.48% of the 105 
motorists had a speed of at least 55 mph. 



Cumulative 
Relative 
Frequency (%) 




45- 
50- 
55- 
60- 
65- 
70- 
75- 
80- 
85- 



Speed (mph) 



■under 50 
■under 55 
■under 60 
■under 65 
■under 70 
■under 75 
■under 80 
■under 85 
■under 90 



Number of 
Motorists 



Relative 
Frequency (%) 



0.95 
8.57 
14.29 
22.86 
16.19 
16.19 
12.38 
7.62 
0.95 



Cumulative 
Frequency 



10 
25 
49 
66 
83 
96 
104 
105 



0.95 
9.52 
23.81 
46.66 
62.85 
79.05 
91.43 
99.05 
100.00 



Frequencies, relative 
frequencies, cumulative fre- 
quencies, and cumulative 
relative frequencies for the 
speed data of Table 2.1 



9 
15 
24 
17 
17 
13 

8 
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The Histogram 

The histogram describes a frequency distribution by using a series of adjacent 
rectangles, each of which has a length proportional to either the frequency or the 
relative frequency of the class it represents. The histogram in part (a) of Figure 2.1 
is based on the speed-measurement data summarized in Table 2.1. The lower class 
limits (e.g., 45 mph, 50 mph, 55 mph, and so on) have been used in constructing 
the horizontal axis of the histogram. 

The tallest rectangle in part (a) of Figure 2.1 is associated with the 60-under 
65 class of Table 2.1, identifying this as the class having the greatest number of 
observations. The relative heights of the rectangles visually demonstrate how the 
frequencies tend to drop off as we proceed from the 60-under 65 class to the 
65-under 70 class and higher. 



The Frequency Polygon 

Closely related to the histogram, the frequency polygon consists of line segments 
connecting the points formed by the intersections of the class marks with the class 
frequencies. Relative frequencies or percentages may also be used in constructing 
the figure. Empty classes are included at each end so the curve will intersect the hor- 
izontal axis. For the speed-measurement data in Table 2.1, these are the 40-under 
45 and 90-under 95 classes. (Note: Had this been a distribution for which the first 
nonempty class was "0 but under 5," the empty class at the left would have been 
"—5 but under 0.") The frequency polygon for the speed-measurement data is 
shown in part (b) of Figure 2.1. 

Compared to the histogram, the frequency polygon is more realistic in that 
the number of observations increases or decreases more gradually across the 



FIGURE 2.1 




Speed (mph) Speed (mph) 

(a) Histogram (b) Frequency polygon 




40 45 50 55 60 65 70 75 80 85 90 95 

Speed (mph) 

(c) Percentage ogive (less than) 



Histogram, frequency polygon, and percentage ogive for the speed-measurement data summarized 
in Table 2.1 
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various classes. The two endpoints make the diagram more complete by allowing 
the frequencies to taper off to zero at both ends. 

Related to the frequency polygon is the ogive, a graphical display providing 
cumulative values for frequencies, relative frequencies, or percentages. These 
values can be either "greater than" or "less than." The ogive diagram in part (c) of 
Figure 2.1 shows the percentage of observations that are less than the lower limit 
of each class. 

We can use the computer to generate a histogram as well as the underlying 
frequency distribution on which the histogram is based. Computer Solutions 2.1 
describes the procedures and shows the results when we apply Excel and Minitab 
in constructing a histogram for the speed data in Table 2.1. 



computer solutions 2.1 



The Histogram 
EXCEL 



_4_ 
_5_ 
_6_ 

_7_ 
_8_ 
_9_ 
W_ 
V\_ 
12_ 
J3. 
T4 
15 



30 



£ 20 

0) 
3 
W 

O 10 



Histogram 



M 



N 



45 



50 55 60 65 70 75 

mph 



80 85 90 More 



Open the data file CX02SPEE.XLS. The name of the variable (Speed) is in cell A1, and the 105 speeds are in the cells 
immediately below. 

Type Bin into cell C1 . Enter the bin cutoffs (45 to 90, in multiples of 5) into C2:C1 1 . (Alternatively, you can skip this 

step if you want Excel to generate its default frequency distribution.) 

Click Tools. Click Data Analysis. Within Analysis Tools, select Histogram. Click OK. 

Enter the data range (A1:A106) into the Input Range box. If you entered the bin cutoffs as described in step 2, enter 
the bin range (C1:C11) into the Bin Range box. Click to place a check mark in the Labels box. (This is because each 
variable has its name in the first cell of its block.) Select Output Range and enter where the output is to begin — this 
will be cell E1. 

Click to place a check mark into the Chart Output box. Click OK. 

Within the chart, click on the word Bin. Click again, and type in mph. Double-click on any one of the bars in the chart. 
Select Options and set the Gap Width to 0. Click OK. You can further improve the appearance by clicking on the 
chart and changing fonts, item locations, such as the key in the lower right, or the background color of the display. In 
the printout shown here, we have also enlarged the display and moved it slightly to the left. 



(continued) 
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MINITAB 

1 . Open the data file CX02SPEE.MTW. The name of the variable (Speed) is in column C 1 , and the 1 05 speeds are in the 
cells immediately below. 




2. Click Graph. Click Histogram. Select Simple and click OK. Within the Histogram menu, indicate the column to be 
graphed by entering C1 into the Graph variables box. 

3. Click the Labels button, then click the Data Labels tab. Select Use y-value labels. (This will show the counts at the 
top of the bars.) Click OK. Click OK. 

4. On the graph that appears, double-click on any one of the numbers on the horizontal axis. Click the Binning tab. 
In the Interval Type submenu, select Cutpoint. In the Interval Definition submenu, select Midpoint/Cutpoint 
positions and enter 45:90/5 into the box. (This provides intervals from 45 to 90, with the width of each interval 
being 5.) Click OK. 



Excel and Minitab differ slightly in how they describe the classes in a frequency 
distribution. If you use the defaults in these programs, the frequency distributions 
■™ may differ slightly whenever a data point happens to have exactly the same value 
as one of the upper limits because 

1. Excel includes the upper limit but not the lower limit; the Excel bin value of 
"60" in Computer Solutions 2.1 represents values that are "more than 55, but 
not more than 60." A particular speed (x) will be in Excel's interval if 
55 < x < 60. 

2. Minitab includes the lower limit, but not the upper limit, so a category of 
55-60 is "at least 55, but less than 60." Thus, a particular speed (x) will be in 
Minitab's 55-60 interval if 55 < x < 60. 

This was not a problem with the speed data and Computer Solutions 2.1. How- 
ever, if it had been, and we wanted the Excel frequency distribution to be 
the same as Minitab's, we could have simply used Excel bin values that were very 
slightly below those of Minitab's upper limits — for example, a bin value of 44.99 
instead of 45.00, 49.99 instead of 50, 54.99 instead of 55, and so on. 
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2.1 What is a frequency distribution? What benefits 
does it offer in the summarization and reporting of data 
values? 

2.2 Generally, how do we go about deciding how many 
classes to use in a frequency distribution? 

2.3 The National Safety Council reports the following 
age breakdown for licensed drivers in the United States. 

source: Bureau of the Census, Statistical Abstract of the United States 
2006, p. 721. 



Age (Years) 



Licensed Drivers (Millions) 



under 20 
20-under25 
25-under 35 
35-under 45 
45-under 55 
55-under 65 
65 or over 



9.50 
16.50 
34.02 
40.88 
40.74 
26.17 
28.90 



Identify the following for the 35-under 45 class: 
(a) frequency, (b) upper and lower limits, (c) width, 
and (d) midpoint. 

2.4 Using the frequency distribution in Exercise 2.3, 
identify the following for the 25-under 35 class: 

(a) frequency, (b) upper and lower limits, (c) width, 
and (d) midpoint. 

2.5 The National Center for Health Statistics reports the 
following age breakdown of deaths in the United States 
during 2002. source: The New York Times Almanac 2006, p. 384. 

Age (Years) Number of Deaths (Thousands) 



under 1 5 
1 5-under 25 
25-under 35 
35-under 45 
45-under 55 
55-under 65 
65-under 75 
75 or over 



40.01 
33.08 
41.38 
91.01 
172.21 
253.73 
424.1 1 
1391.99 



Identify the following for the 45-under 55 class: 
(a) frequency, (b) upper and lower limits, (c) width, 
and (d) midpoint. 

2.6 Using the frequency distribution in Exercise 2.5, 
identify the following for the 15-under 25 class: 
(a) frequency, (b) upper and lower limits, (c) width, 
and (d) midpoint. 



2.7 What is meant by the statement that the set of classes 
in a frequency distribution must be mutually exclusive 
and exhaustive? 



2.8 For commercial banks in each state, the U.S. Federal 
Deposit Insurance Corporation has listed their total 
deposits (billions of dollars) as follows, source: Bureau of the 
Census, Statistical Abstract of the United States 2006, p. 764. 

Deposits Deposits Deposits 


AL 


62.6 


LA 


55.2 


OH 


200.2 


AK 


6.0 


ME 


16.7 


OK 


46.3 


AZ 


61.8 


MD 


82.1 


OR 


39.2 


AR 


38.7 


MA 


172.7 


PA 


210.7 


CA 


671.1 


Ml 


136.1 


Rl 


19.9 


CO 


64.5 


MN 


94.4 


SC 


48.1 


CT 


73.8 


MS 


33.5 


SD 


53.3 


DE 


105.8 


MO 


87.1 


TN 


90.2 


FL 


301.0 


MT 


1 1.9 


TX 


310.3 


GA 


132.0 


NE 


32.9 


UT 


102.0 


HI 


23.1 


NV 


40.5 


VT 


9.0 


ID 


13.8 


NH 


29.4 


VA 


147.8 


IL 


281.9 


NJ 


21 1.3 


WA 


87.4 


IN 


81.1 


NM 


18.2 


WV 


22.7 


IA 


51.2 


NY 


637.6 


Wl 


96.1 


KS 


46.5 


NC 


163.9 


WY 


7.9 


KY 


56.9 


ND 


1 1.4 







Construct a frequency distribution and a histogram for 
these data. 

2.9 The accompanying data describe the hourly wage 
rates (dollars per hour) for 30 employees of an electronics 



firm: 
















22.66 


24.39 


17.31 


21.02 


21.61 


20.97 


18.58 


16.61 


19.74 


21.57 


20.56 


22.16 


20.16 


18.97 


22.64 


19.62 


22.05 


22.03 


17.09 


24.60 


23.82 


17.80 


16.28 


19.34 


22.22 


19.49 


22.27 


18.20 


19.29 


20.43 







Construct a frequency distribution and a histogram for 
these data. 

2.10 The following performance scores have been 
recorded for 25 job applicants who have taken a pre- 
employment aptitude test administered by the company 
to which they applied: 

66.6 75.4 66.7 59.2 78.5 80.8 79.9 87.0 94.1 
70.2 92.8 86.9 92.8 66.8 65.3 100.8 76.2 87.8 
71.0 92.9 97.3 82.5 78.5 72.0 76.2 

Construct a frequency distribution and a histogram for 
these data. 
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2.11 During his career in the NHL, hockey great Wayne 
Gretzky had the following season-total goals for each of his 
20 seasons, source: The World Almanac and Book of Facts 2000, p. 954. 

51 55 92 71 87 73 52 62 40 54 
40 41 31 16 38 11 23 25 23 9 

Construct a frequency distribution and a histogram for 
these data. 

2.12 According to the U.S. Department of Agriculture, 
the distribution of U.S. farms according to value of 
annual sales is as follows, source: Bureau of the Census, Statisti- 
cal Abstract of the United States 2006, p. 546. 



Farms 

Annual Sales (Thousands) 



under $10,000 1262 

$10,000-under $25,000 256 

$25,000-under $50,000 1 58 

$50,000-under $ 1 00,000 1 40 

$100,000-under $250,000 159 

$250,000-under $500,000 82 

$500,000-under $ 1 ,000,000 42 

$1,000,000 or more 29 



Convert this information to a 

a. Relative frequency distribution. 

b. Cumulative frequency distribution showing "less than 
or within" frequencies. 

2.13 Convert the distribution in Exercise 2.3 to a 

a. Relative frequency distribution. 

b. Cumulative frequency distribution showing 
"less than or within" frequencies. 

2.14 Convert the distribution in Exercise 2.8 to a 

a. Relative frequency distribution. 

b. Cumulative relative frequency distribution showing 
"greater than or within" relative frequencies. 



2.15 Using the frequency distribution obtained in Exer- 
cise 2.8, convert the information to a "less-than" ogive. 

2.16 For the frequency distribution constructed in Exer- 
cise 2.11, convert the information to a "less-than" ogive. 

/ data set/ Note: Exercises 2.17-2.19 require a 
computer and statistical software. 

2.17 The current values of the stock portfolios for 80 
clients of an investment counselor are as listed in data file 
XR02017. Use your computer statistical software to gener- 
ate a frequency distribution and histogram describing this 
information. Do any of the portfolio values seem to be 
especially large or small compared to the others? 

2.18 One of the ways Keynote Systems, Inc., measures 
Internet shopping site performance is by visiting a site 
and measuring how long it takes to come up on the user's 
PC. In one such study, they found the average time for an 
Internet shopping site to come up was 21.80 seconds. 
Assume that website administrators for sears.com try 
their own site on 80 random occasions and come up with 
the times (in seconds) contained in data file XR02018. 
Generate a frequency distribution and histogram describ- 
ing this information. Comment on whether the site 
seemed to be especially fast or slow coming up on any of 
the visits, source: "How Key Web Sites Handle Holiday Shopping 
Rush," USA Today, November 24, 1999, p. 3B. 

2.19 A company executive has read with interest the find- 
ing that the average U.S. office worker receives 36 e-mails 
per day. Assume that an executive, wishing to replicate 
this study within her own corporation, directs information 
technology personnel to find out the number of e-mails 
each of a sample of 100 office workers received yesterday, 
with the results as provided in the data file XR02019. 
Generate a frequency distribution and histogram describ- 
ing this information and comment on the extent to which 
some workers appeared to be receiving an especially high 
or low number of e-mails, source: Anne R. Carey and Genevieve 
Lynn, "Message Overload?", USA Today, September 13, 1999, p. 1B. 



2.3 THE STEM-AND-LEAF DISPLAY 

AND THE DOTPLOT 



In this section, we examine two additional methods of showing how the data 
values are distributed: the stem-and-leaf display and the dotplot. Each of the 
techniques can be carried out either manually or with the computer and statistical 
software. 

The Stem-and-Leaf Display 

The stem-and-leaf display, a variant of the frequency distribution, uses a subset of 
the original digits as class descriptors. The technique is best explained through 
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a few examples. The raw data are the numbers of Congressional bills vetoed 
during the administrations of seven U.S. presidents, from Johnson to Clinton. 1 



Johnson Nixon Ford Carter Reagan Bush Clinton 

Vetoes 30 43 66 31 78 44 38 

In stem -an d-leaf terms, we could describe these data as follows: 

Stem (10's Digit) Leaf (Vs Digit) 

3|018 (represents 30, 31, and 38) 

4|34 (represents 43 and 44) 

5| (no data values in the 50s) 

6|6 (represents 66) 

7|8 (represents 78) 



The figure to the left of the divider ( I ) is the stem, and the digits to the right 
are referred to as leaves. By using the digits in the data values, we have identified 
five different categories (30s, 40s, 50s, 60s, and 70s) and can see that there are 
three data values in the 30s, two in the 40s, one in the 60s, and one in the 70s. 

Like the frequency distribution, the stem-and-leaf display allows us to quickly 
see how the data are arranged. For example, none of these presidents vetoed more 
than 44 bills, with the exceptions of Presidents Ford and Reagan, who vetoed 66 
and 78, respectively. Compared with the frequency distribution, the stem-and-leaf 
display provides more detail, since it can describe the individual data values as 
well as show how many are in each group, or stem. Computer Solutions 2.2 
shows the procedure and results when Excel and Minitab are used to generate a 
stem-and-leaf display based on the speed data of Table 2.1. 

When the stem-and-leaf display is computer generated, the result may vary 
slightly from the previous example of the presidential vetoes. For example, 
Minitab may do the following: 

1. Break each stem into two or more lines, each of which will have leaves in a 
given range. In Computer Solutions 2.2, speeds in the 50s are broken into two 
lines. The first line includes those with a leaf that is 0 to 4 (i.e., speeds from 
50.x to 54.x), the second covers those for which the leaf is from 5 to 9 
(i.e., speeds from 55.x to 59.x). 

2. Include stem-and-leaf figures for outliers. An outlier is a data value very distant 
from most of the others. When outliers are present, they can be shown separately 
in a "HI" or a "LO" portion to avoid stretching the display and ending up with 
a large number of stems that have no leaves. 

The stem-and-leaf display shows just two figures for each data value. For 
example, the data value 1475 could be represented as 

1|4 (1000's digit and 100's digit) 

or 

14|7 (100's digit and 10's digit) 

or 

147|5 (10's digit and l's digit) 

In a given stem-and-leaf display, only one of these alternatives could be 
used because (1) a given data value can be represented only once in a display, 

Source: Bureau of the Census, Statistical Abstract of the United States 2006, p. 258. 
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computer solutions 2.2 






«.-. 
1 





The Stem-and-Leaf Display 



EXCEL 





A 


B 


C 


D 


E 


F 


1 


Stem & Leaf Display 










2 














3 


Stems 


Leaves 










4 


4 


->8 










5 


5 


->01 2234444556666777778889 






6 


6 


->00001 11111111 2222233344455566667777889999 


7 


7 


->0000001 1 1 122333445555666678889 




8 


8 


->000022348 









1 . Open the data file CX02SPEE.XLS. Click on A1 and drag to A106 to select the label and data values in cells A1:A106. 
Click Tools. Click Data Analysis Plus. 

2. In the Data Analysis Plus menu, click Stem and Leaf Display. Click OK. Ensure that the Input Range box contains 
the range you specified in step 1 . In the Increment menu box, select 10 (because the 10's digit is the stem for the 
display). Click Labels. Click OK. 



MINITAB 

Stem-and-leaf of Speed N = 105 
Leaf Unit = 1.0 



1 


4 


8 


10 


5 


012234444 


25 


5 


556666777778889 


49 


6 


000011111111122222333444 


(17) 


6 


55566667777889999 


39 


7 


00000011112233344 


22 


7 


5555666678889 


9 


8 


00002234 


1 


8 


8 



1 . Open the data file CX02SPEE.MTW. The variable (Speed) is in column C 1 . 

2. Click Graph. Click Stem-And-Leaf. Enter C1 into the Variables box. Click OK. 



and (2) all of the values must be expressed in terms of the same stem digit and the 
same leaf digit. If we were to deviate from either of these rules, the resulting dis- 
play would be meaningless. 

When data values are in decimal form, such as 3.4, 2.5, and 4.1, the stem-and- 
leaf method can still be used. In expressing these numbers, the stem would be 
the l's digit, and each leaf would correspond to the first number to the right of 
the decimal point. For example, the number 3.4 would be converted to a stem of 
3 and a leaf of 4. The location of the decimal point would have to be considered 
during interpretation of the display. 
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computer solutions 2.3 



The Dotplot 
EXCEL 

Excel does not currently provide the dotplot display. 

MINITAB 





Open the data file CX02SPEE.MTW. The variable (Speed) is in column C1 . 

Click Graph. Click Dotplot. Select One Y Simple. Click OK. Enter C1 into the Variables box. Click OK. 



The Dotplot 

The dotplot displays each data value as a dot and allows us to readily see the 
shape of the distribution as well as the high and low values. Computer Solutions 
2.3 shows the procedure and the result when we apply Minitab's dotplot feature 
to the speed data shown previously in Table 2.1. 




2.20 Construct a stem-and-leaf display for the following 2.21 Construct a stem-and-leaf display for the following 
data: data: 
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2.22 In the following stem-and-leaf display for a set of 
two-digit integers, the stem is the 10's digit, and each leaf 
is the l's digit. What is the original set of data? 

2|002278 
3|01 1 359 
4|1344 
5|47 

2.23 In the following stem-and-leaf display for a set of 
three-digit integers, the stem is the 100's digit, and each 
leaf is the 10's digit. Is it possible to determine the exact 
values in the original data from this display? If so, list the 
data values. If not, provide a set of data that could have 
led to this display. 

6|1 33579 
7|00257 
8|66899 
9|0003568 

/ data Set / Note: Exercises 2.24-2.28 require a 
computer and statistical software capable of constructing 
stem-and-leaf and dotplot displays. 
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2.24 Construct stem-and-leaf and dotplot displays based 
on the investment portfolio values in the data file 
(XR02017) for Exercise 2.17. 

2.25 Construct stem-and-leaf and dotplot displays based 
on the website access times in the data file (XR02018) for 
Exercise 2.18. 

2.26 Construct stem-and-leaf and dotplot displays based 
on the e-mail receipts in the data file (XR02019) for 
Exercise 2.19. 

2.27 City tax officials have just finished appraising 
60 homes near the downtown area. The appraisals (in 
thousands of dollars) are contained in data file XR02027. 
Construct stem-and-leaf and dotplot displays describing 
the appraisal data. 

2.28 The manufacturer of a water-resistant watch 
has tested 80 of the watches by submerging each one 
until its protective seal leaked. The depths (in meters) 
just prior to failure are contained in data file XR02028. 
Construct stem-and-leaf and dotplot displays for these 
data. 



2.4 OTHER METHODS FOR VISUAL REPRESENTATION 

OF THE DATA 



In this section, we will examine several other methods for the graphical represen- 
tation of data, then discuss some of the ways in which graphs and charts can be 
used (by either the unwary or the unscrupulous) to deceive the reader or viewer. 
We will also provide several Computer Solutions to guide you in using Excel and 
Minitab to generate some of the more common graphical presentations. These are 
just some of the more popular approaches. There are many other possibilities. 

The Bar Chart 

Like the histogram, the bar chart represents frequencies according to the relative 
lengths of a set of rectangles, but it differs in two respects from the histogram: ( 1 ) the 
histogram is used in representing quantitative data, while the bar chart represents 
qualitative data; and (2) adjacent rectangles in the histogram share a common side, 
while those in the bar chart have a gap between them. Computer Solutions 2.4 
shows the Excel and Minitab procedures for generating a bar chart describing the 
number of U.S. nonrecreational airplane pilot certificates in each of four categories. 2 
The relevant data file is CX02AIR. 

The Line Graph 

The line graph is capable of simultaneously showing values of two quantitative 
variables (y, or vertical axis, and x, or horizontal axis); it consists of linear segments 

2 Source: General Aviation Manufacturers Association, General Aviation Statistical Databook 2005 
Edition, p. 24. 
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connecting points observed or measured for each variable. When x represents time, 
the result is a time series view of the y variable. Even more information can be 
presented if two or more y variables are graphed together. Computer Solutions 2.5 
shows the Excel and Minitab procedures for generating a line chart showing 
annual net income for McDonald's Corporation from 1996 through 2005. 3 The 
pertinent data file is CX02MCD. 



m 



computer solutions 2.4 



The Bar Chart 
EXCEL 
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B 


C 


D 


E F G H I 
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Nonrecreational Airplane Pilot Certificates 
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3 


Certificate Type 


Number (thousands) 








4 


Student 




87.213 




Transport 




5 


Private 




228.619 




o 
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Student 




11 














12 
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13 
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1 . Open the data file CX02AIR.XLS. The labels, Certificate Type and Number (thousands), have already been entered 
into A3 and C3, respectively. The name for the first category has already been entered into A4, and its corresponding 
numerical value (thousands of pilots) has been entered into C4. We have continued downward, entering category 
names into column A and corresponding numerical values into column C, until all category names and their values 
have been entered into columns A and C. 

2. Click on A4 and drag to C7 to select cells A4:C7. Click the ChartWizard toolbar button. 

3. In the Chart type box, click Bar. In the Chart sub-type box, click the first choice, identified as Clustered Bar. Click 
Next. 

4. Ensure that the Data range box contains the range you specified in step 2. In the Series in section, select Columns. 
Click Next. 

5. Under the Titles tab of the Chart Options menu, enter U.S. Airplane Pilot Certificates into the Chart title box. 
Enter Certificate Type into the Category axis box. Enter Number (thousands) into the Value axis box. Click 
Next. 

6. In the Place chart section of the next menu, select As object in and retain the Sheetl that already appears in the 
box, if this is satisfactory as the location for the chart. Click Finish. 

7. To further improve the appearance, click on the chart and drag the borders to expand it vertically and horizontally. 
If you don't want the little " Legend " box that originally appears, just right-click within it, then click Clear. 



(continued) 



^Source: McDonald's Corporation, 2005 Annual Report, p. 1. 
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MINITAB 

1 . Open the data file CX02AIR.MTW. The labels, Certificate Type and Number (thousands), have already been entered 
at the top of columns C1 and C2, respectively. The names of the categories have already been entered into C1, and 
the numerical values (thousands of pilots) have been entered into C2. 

2. Click Graph. Click Bar Chart. In the Bars represent box, select Values from a table. Select One column of values. 
Simple. Click OK. Within the Graph variables menu, enter C2. Enter C1 into the Categorical variable box. Click 
Scale. Select the Axes and Ticks tab and select Transpose value and category scales. Click OK. Click OK. 



Chart of Number (thousands) vs Certificate Type 



Student - 



01 

§; Private - 



3 



J; Commercial - 



Transport - 
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Number (thousands) 
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The Line Chart 
EXCEL 



I 



B 



2 

_3_ 
J_ 
_5_ 
_6_ 

7 

_8_ 
_9_ 

_m 

XL 

13 



McDonald's Net Income 



Year 



1996 



1997 



1998 



1999 



2000 



2001 



2002 



2003 
2004 
2005 



Net Income 



1.573 



1.642 



1.550 



1.948 



1.977 
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0.893 



1.471 
2.279 
2.602 
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1 . Open the data file CX02MCD.XLS. The labels and data are already entered, as shown in the printout. Click on A4 and 
drag to B13 to select cells A4:B13. (Note: Within the two columns, the variable to be represented by the vertical axis 
should always be in the column at the right.) 

2. Click the ChartWizard toolbar button. In the Chart type box, click XY (Scatter). In the Chart sub-type box, click 
the fourth choice, identified as Scatter with data points connected by lines. Click Next. 

3. Ensure that the Data range box contains the range you specified in step 1 . In the Series in section, select Columns. 
Click Next. 

4. Under the Titles tab of the Chart Options menu, enter McDonald's Corporation into the Chart title box. Enter 
Year into the Value (X) axis box. Enter Net Income (Sbillions) into the Value (Y) axis box. Click Next. 

5. In the Place chart section of the next menu, select As object in and retain the Sheetl that already appears in the 
box, if this is satisfactory as the location for the chart. Click Finish. 

6. Further appearance improvements can be made by clicking on the chart and dragging the borders to expand it vertically 
and horizontally. If you don't want the little "Legend" box that originally appears, just right-click within it, then click Clear. 



MINITAB 

1 . Open the data file CX02MCD.MTW. The labels, Year and Net Income, have already been entered at the top of columns 
C1 and C2, respectively. The years and the numerical values have been entered into C1 and C2, respectively. 

2. Click Graph. Click Scatterplot. Select With Connect Line. Click OK. Enter C2 into the first line of the Y variables 
box. Enter C1 into the first line of the X variables box. Click OK. 

3. Double-click on the chart title and the vertical-axis labels and revise as shown. 



McDonald's Corporation 

2.75 H 




1996 1998 2000 2002 2004 2006 

Year 



The Pie Chart 

The pie chart is a circular display divided into sections based on either the num- 
ber of observations within or the relative values of the segments. If the pie chart 
is not computer generated, it can be constructed by using the principle that a cir- 
cle contains 360 degrees. The angle used for each piece of the pie can be calcu- 
lated as follows: 

Number of degrees Relative value 

— X 360 

for the category of the category 
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For example, if 25% of the observations fall into a group, they would be repre- 
sented by a section of the circle that includes (0.25 X 360), or 90 degrees. Com- 
puter Solutions 2.6 shows the Excel and Minitab procedures for generating a 
pie chart to show the relative importance of three major business segments in 
contributing to Black & Decker Corporation's overall profit. 4 For additional 
emphasis, we can even "explode" one or more of the pie slices. The underlying 
data file is CX02BLD. 



The Pictogram 

Using symbols instead of a bar, the pictogram can describe frequencies or other 
values of interest. Figure 2.2 is an example of this method; it was used by 
Panamco to describe soft drink sales in Central America over a 3-year period. In 
the diagram, each truck represents about 12.5 million cases of soft drink prod- 
ucts. When setting up a pictogram, the choice of symbols is up to you. This is an 
important consideration because the right (or wrong) symbols can lend nonverbal 
or emotional content to the display. For example, a drawing of a sad child with 
her arm in a cast (each symbol representing 10,000 abused children) could help to 
emphasize the emotional and social costs of child abuse. 



computer solutions 2.6 




The Pie Chart 
EXCEL 
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1 . Open the data file CX02BLD.XLS. The segment names, Power Tools, Hardware & Home, and Fasteners, have already 
been entered as shown in the display, as have the sales for each segment. 

2. Click on A4 and drag to B6 to select cells A4:B6. Click the ChartWizard toolbar button. 

3. In the Chart type box, click Pie. In the Chart sub-type box, click the first choice, identified as Pie. Displays the 
contribution of each value to a total. Click Next. 

4. Ensure that the Data range box contains the range you specified in step 2. In the Series in section, select Columns. 
Click Next. 

5. Under the Titles tab of the Chart Options menu, enter Black & Decker Corporation into the Chart title box. Under 
the Data labels tab, select Category name and Percentage. Click to place a check mark next to Show leader lines. 

Under the Legend tab, click to remove the check mark from Show legend. Click Next. 



4 Source: Black & Decker Corporation, 1999 Annual Report, p. 1. 
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6. In the Place chart section of the next menu, select As object in and retain the Sheetl that already appears in the 
box, if this is satisfactory as the location for the chart. Click Finish. 

7. Further appearance improvements can be made by clicking just outside the pie and expanding it slightly, by changing 
the fonts in the labels, or by pulling out a slice of the pie, as shown in the display. 



MINITAB 

1 . Open the data file CX02BLD.MTW. The labels, Segment and Sales, have already been entered at the top of columns 
C1 and C2, respectively. The names of the segments have already been entered into C1 and the numerical values (sales, 
millions of dollars) have been entered into C2. 

2. Click Graph. Click Pie Chart. Select Chart values from a table. Enter C1 into the Categorical variable box. Enter 
C2 into the Summary variables box. Click OK. 

3. Double-click on the chart title and revise as shown. 



Black & Decker Corporation Sales by Segment 




Segment 
Q Power Tools 
n Hardware & Home 
I I Faileners 



Total Soft Drink Sales 

in millions of unit cases 



33.3 



21.6 



1996 



1997 



62.7 




m 



1998 



FIGURE 2.2 



In the pictogram, the 
symbols represent 
frequencies or other 
values of interest. This 
chart shows how soft 
drink sales (millions of 
cases) in Central America 
increased from 1996 
through 1998. 



Source: Panamco, Annual Report 1998, p. 24. 
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FIGURE 2.3 



One of the many creative 
ways USA Today and its 
"USA Snapshots" present 
informative statistics in an 
interesting way 



USA SNAPSHOTS® 



Workers take happiness over money 

What Americans value most about work: 



Job 

satisfaction 
65% 



Being paid 
well 
-34% 




Having an 
important title 

'% 



Source: 2001 G 



<] Yoo and Quin Tian, USA TODAY 



Source: "Workers Take Happiness Over Money," USA Today. Copyright 2002. Reprinted with permission. 



The Sketch 



Varying in size depending on the frequency or other numerical value displayed, 
the sketch is a drawing or pictorial representation of some symbol relevant to the 
data. This approach is demonstrated in part (c) of Figure 2.4. 



Other Visuals 

The preceding approaches are but a few of the many possibilities for the visual de- 
scription of information. As noted in the chapter-opening vignette, USA Today 
readers are treated daily to a wide variety of interesting and informative displays, 
including the "Statistics That Shape Our ..." visuals in the lower-left corner of the 
front page of each section. One of these displays is shown in Figure 2.3. 



The Abuse of Visual Displays 

Remember that visuals can be designed to be either emotionally charged or pur- 
posely misleading to the unwary viewer. This capacity to mislead is shared by a 
great many statistical tests and descriptions, as well as visual displays. We will 
consider just a few of the many possible examples where graphical methods could 
be viewed as misleading. 

During 2004, 29.0 million DVD videos were shipped, compared to 17.5 mil- 
lion in 2003. 5 Using these data, we can construct the charts in Figure 2.4. In part 
(a), the vertical axis is given a very high value as its upper limit, and the data are 
"compressed" so that the increase in shipments isn't very impressive. In part (b) of 
Figure 2.4, taking a slice from the vertical axis causes the increase in shipments to 
be more pronounced. Another strategy for achieving the same effect is to begin the 
vertical axis with a value other than zero. 



5 Data source: Data (but not the visual displays) are from The World Almanac and Book of Facts 
2006, p. 276. 
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2003 2004 
(c) Comparing sketches that have proportional dimensions 



FIGURE 2.4 



The information presented 
may be distorted in several 
ways. Part (a) shows the 
effect of compressing the 
data by using a high end- 
point for the vertical axis. In 
part (b), the change is exag- 
gerated by taking a slice 
from the vertical axis. In 
part (c), although the 
sketch representing 2004 is 
65.7% higher and 65.7% 
wider than the 2003 
sketch, the increase is dis- 
torted because the area of 
the 2004 sketch is 274.6% 
as large as that of the 2003 
sketch. 



In part (c) of Figure 2.4, we see how shipments in the 2 years could be com- 
pared using a sketch of a DVD video case. Although the sketch on the right is 
65.7% higher than the one on the left, it looks more than 65.7% larger. This is 
because area for each sketch is height times width, and both the height and the 
width are 65.7% greater for the one at the right. Because both height and width 
are increased, the sketch for 2004 shipments has an area 274.6% (based on 
1.657 X 1.657) as large as its 2003 counterpart. 

Figure 2.5 (page 36) is another example of visual distortion by using a 
nonzero vertical axis. Over the years shown, sales increased by only 1.4%, and 
this dismal performance is evident from part (a) of the figure. However, in part 
(b), the starting point for the vertical axis is 1080 cases instead of 0, and the ver- 
tical axis is stretched considerably. As a result, it looks as though the company has 
done very well over the years shown. 



flp* Exercises 




Note: These graphs and charts can be done by hand, but, 
if possible, use the computer and your statistical software. 

2.29 What is the difference between a histogram and a bar 
chart? For what type of data would each be appropriate? 



2.30 The following U.S. market shares for automobile 
and light-truck sales for 2001 and 2002 have been 
reported, source: Crain Communications, Inc., Automotive News 2003 
Market Data Book, p. 26. 
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FIGURE 2.5 



If the vertical scale is 
stretched and has a starting 
point that is greater than 
zero, the sales for this firm 
can look a lot better than 
the 1 .4% growth that 
occurred over the 
8-year period. 
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Country or Region 
of Manufacture 


2001 


2002 


North America 


82.2% 


80.7% 


Japan 


9.6 


10.5 


Europe 


4.6 


5.0 


Korea 


3.6 


3.8 



a. Express the data for the 2001 model year in the form 
of a bar chart. 

b. Express the data for the 2002 model year in the form 
of a bar chart. 

2.31 The median income for a four-person family has 
been reported as shown here for 1993-2003. 

source: Time Almanac 2006, p. 627. 



2.32 For McDonald's Corporation, net income per share of 
common stock was as follows for the period 1995-2005. 

source: McDonald's Corporation, 2005 Annual Report, p. 1. 



1995 


$0.97 


2001 


$1.25 


1996 


1.08 


2002 


0.70 


1997 


1.15 


2003 


1.15 


1998 


1.10 


2004 


1.79 


1999 


1.39 


2005 


2.04 


2000 


1.46 







Construct a line graph describing these data over time. 

2.33 For the period 1995-2005, McDonald's Corpora- 
tion paid the following dividends (rounded to nearest 
penny) per share of common stock, source: McDonald's 



1993 


$45,161 


1999 


$59,981 


Corporation, 2005 Annual Report, p. 1. 
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Using these data and the data in Exercise 2.32, draw a 
line graph that describes both net income and dividend 
per share of common stock over time. (Both lines will be 
plotted on the same graph.) 

2.34 During 2001, the Campbell Soup Foundation 
provided the following amounts in grants: Camden, N.J., 
$1,336,700; plant communities, $341,500; Dollars for 
Doers, $179,600; other projects, $64,100. source: The 
Campbell Soup Foundation, 2002 Annual Report, p. 16. 

a. Construct a pie chart to summarize these 
contributions. 

b. Construct a bar chart to summarize these 
contributions. 

c. Why is it appropriate to construct a bar chart for 
these data instead of a histogram? 

2.35 It has been estimated that 95.5% of U.S. households 
own a telephone and that 68.5% have cable television. 
source: Bureau of the Census, Statistical Abstract of the United States 
2006, p. 737. 
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a. Why can't this information be summarized in one pie 
chart? 

b. Construct two separate pie charts to summarize tele- 
phone ownership and cable television service. 

2.36 Federal outlays for national defense totaled $348 
billion in 2002 compared with just $134 billion in 1980. 

source: Bureau of the Census, Statistical Abstract of the United States 
2002, p. 326. 

a. Using appropriate symbols that would reflect 
favorably on such an increase, construct a pictogram 
to compare 2002 with 1980. 

b. Using appropriate symbols that would reflect 
unfavorably on such an increase, construct a 
pictogram to compare 2002 with 1980. 

2.37 For the data in Exercise 2.36, use the sketch 
technique (and an appropriate symbol of your choice) to 
compare the 2002 and 1980 expenditures. 



THE SCATTER DIAGRAM 2.5 



There are times when we would like to find out whether there is a relationship 
between two quantitative variables — for example, whether sales is related to 
advertising, whether starting salary is related to undergraduate grade point average, 
or whether the price of a stock is related to the company's profit per share. To exam- 
ine whether a relationship exists, we can begin with a graphical device known as the 
scatter diagram, or scatterplot. 

Think of the scatter diagram as a sort of two-dimensional dotplot. Each point 
in the diagram represents a pair of known or observed values of two variables, 
generally referred to as y and x, with y represented along the vertical axis and 
x represented along the horizontal axis. The two variables are referred to as the 
dependent (y) and independent (x) variables, since a typical purpose for this type 
of analysis is to estimate or predict what y will be for a given value of x. 

Once we have drawn a scatter diagram, we can "fit" a line to it in such a way 
that the line is a reasonable approximation to the points in the diagram. In view- 
ing the "best-fit" line and the nature of the scatter diagram, we can tell more 
about whether the variables are related and, if so, in what way. 

1. A direct (positive) linear relationship between the variables, as shown in part 
(a) of Figure 2.6 (page 38). The best-fit line is linear and has a positive slope, 
with both y and x increasing together. 

2. An inverse (negative) linear relationship between the variables, as shown in 
part (b) of Figure 2.6. The best-fit line is linear and has a negative slope, with 
y decreasing as x increases. 

3. A curvilinear relationship between the variables, as shown in part (c) of 
Figure 2.6. The best-fit line is a curve. As with a linear relationship, a curvi- 
linear relationship can be either direct (positive) or inverse (negative). 
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FIGURE 2.6 



When two variables 
are related, as in parts (a) 
through (c), the relationship 
can be either direct 
(positive) or inverse 
(negative), and either linear 
or curvilinear. In part (d), 
there is no relationship at 
all between the variables. 



(a) Direct, linear relationship 



(b) Inverse, linear relationship 



• • • • • 



(c) Direct, curvilinear relationship (d) No relationship 



4. No relationship between the variables, as shown in part (d) of Figure 2.6. The 
best-fit line is horizontal, with a slope of zero and, when we view the scatter dia- 
gram, knowing the value of x is of no help whatsoever in predicting the value of y. 

In this chapter, we will consider only linear relationships between variables, and 
there will be two possibilities for fitting a straight line (i.e., a linear equation) to the 
data. The first possibility is the "eyeball" method, using personal judgment and a 
ruler applied to the scatter diagram. The second, more accurate, approach is to use 
the computer and your statistical software to fit a straight line that is mathematically 
optimum. This technique, known as the "least-squares" method, will be discussed 
in much more detail in Chapter 15, Simple Linear Regression and Correlation. 







example 








Best-Fit Equation 

In 1999, opening-day Major League Baseball team payrolls ranged from $88,237 
million for the New York Yankees to just $17,617 million for the Montreal Expos. 6 
During the 162 -game season, the Yankees won 98 games; the Expos, just 68. 7 The 
Yankees went on to sweep the Atlanta Braves in the World Series. The payroll and 
wins data for the Yankees, Expos, and the other major league teams are contained in 



^Source: Hal Bodley, "Earnings, Salary Gaps Most Crucial Issue," USA Today, April 2, 1999, p. 13C. 
7 Source: The Time Almanac 2000, p. 1012. 
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the data file CX02BASE. A partial listing of the data for the 30 teams is shown here: 



Team 


x = Payroll 


y = Wins 


New York Yankees 


$88,237 


98 


Boston Red Sox 


62.708 


94 


Toronto Blue Jays 


37.825 


84 


Baltimore Orioles 


81.942 


78 


San Diego Padres 


49.782 


74 


Colorado Rockies 


61.685 


72 



SOLUTION 

All 30 teams are represented in the scatter diagram of Figure 2.7. The two vari- 
ables do appear to be related — teams with a higher payroll did tend to win more 
games during the season. It makes sense that better players will win more games, 
and that better players must be paid more money, thus it comes as no surprise that 
the higher-paid teams won more games. However, the scatter diagram has pro- 
vided us with a visual picture that reinforces our intuitive feelings about wins and 
dollars. 

We can use Excel or Minitab to easily generate both a scatter diagram for the 
data and the linear equation that best fits the 30 data points. The procedures and 
results are shown in Computer Solutions 2.7, and the resulting equation can be 
written as 

Wins = 64.368 + 0.329 X Payroll 

The following are among the important applications for this kind of analysis 
and best-fit equation: 

1. Given a value for x, we can use the equation to estimate a value for y. For 
example, if a team had a payroll of $50 million, we estimate that this team 
would have had 64.368 + 0.329(50), or 80.818 wins during this season. This 
is obtained simply by substituting x = $50 million into the equation and 
calculating an estimated value for y = wins. 
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FIGURE 2.7 



Scatter diagram of Wins 
versus Payroll (millions of 
dollars) for the 30 major 
league baseball teams dur- 
ing the 1999 season. Teams 
with a higher payroll 
tended to win more games. 



Data sources: Hal Bodley, "Earnings, Salary Gaps Most Crucial Issue," USA Today, April 2, 1999, p. 13C, and The Time 
Almanac 2000, p. 1012. 
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2. We can interpret the slope of the equation — in this case, 0.329. Keeping in mind 
that the slope of the equation is (change in y)/(change in x), we can say that, on 
average, raising the payroll by an extra $1.0 million would tend to produce an 
extra 0.329 wins. Accordingly, an extra $10 million would tend to produce an 
extra 3.29 wins. 

3. We can interpret the slope in terms of the type of linear relationship between 
the variables. For the baseball data, the slope is positive (+0.329), reflecting a 
direct (positive) relationship in which payroll and wins increase and decrease 
together. 

4. We can identify data points where the actual value of y is quite different from 
the value of y predicted by the equation. In this situation, we might be inter- 
ested in teams that overperformed (won more games than their payroll would 
have predicted) or underperformed (won fewer games than their payroll would 
have predicted). For example, the Cincinnati Reds played better than they were 
paid. Given their payroll of $35,399 million, the equation would have predicted 
them to win just 64.368 + 0.329(35.399) = 76.014 games, but they actually 
won a lot more — 96 games. 



computer solutions 2.7 



The Scatter Diagram 
EXCEL 




B 



G 



_9_ 

_n_ 

12 
JI3 

11 
15 



Team 



Payroll 



New York Yankees 



i.237 



Boston Red Sox 



62.708 



Toronto Blue Jays 



37.825 



Baltimore Orioles 



81.942 



Tampa Bay Devil Rays 



36.365 



Cleveland Indians 



70.734 



Chicago White Sox 



26.305 



Detroit Tigers 



35.907 



Kansas City Royals 



27.140 



Minnesota Twins 



21.363 



Texas Rangers 



96.923 



Oakland Athletics 



25.446 



Seattle Mariners 



52.821 



Anaheim Angels 



54.866 



Wins 



98 



94 



84 



78 



69 



97 



75 



69 



64 



63 



95 



87 



79 



70 



Wins vs. Payroll 

110 T 

100 

90 

in 

| 80 

70 



60 
50 



y = 0.3287X + 64.368 




♦ ♦ 



20 40 60 80 
Payroll ($millions) 



100 



120 



Open the data file CX02BASE.XLS. The labels and data are already entered, as shown in the printout. Click on B1 and 
drag to C31 to select cells B1:C31. (Note: As with the line chart, of the two columns, the variable to be represented 
by the vertical axis should always be in the column to the right.) 

Click the ChartWizard toolbar button. In the Chart type box, click XY (Scatter). In the Chart sub-type box, click the 
first choice, identified as Scatter. Compares pairs of values. Click Next. 

Ensure that the Data range box contains the range you specified in step 1 . In the Series in section, select Columns. 
Click Next. 
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4. Under the Titles tab of the Chart Options menu, enter Wins vs. Payroll into the Chart title box. Enter Payroll 
(Smillions) into the Value (X) axis box. Enter Wins into the Value (Y) axis box. Click Next. 

5. In the Place chart section of the next menu, select As object in and retain the Sheetl that already appears in the 
box, if this is satisfactory as the location for the chart. Click Finish. 

6. This optional step adds the best-fit straight line. Right-click on any one of the points in the scatter diagram. When 
the menu appears, Click Add Trendline. Under the Type tab, click Linear. Under the Options tab, click to place a 
check mark in the Display equation on chart box. Click OK. Click on the equation and drag it toward the top of 
the display. 

7. Further appearance improvements can be made by clicking on the chart and dragging the borders to expand it verti- 
cally and horizontally. If you don't want the little "Legend" box that originally appears, just right-click within it, then 
click Clear. 



MINI TAB 

The following steps produce the accompanying scatter diagram, complete with the best-fit equation. Along with the equa- 
tion, the display includes information we will be covering later in the text. 

1 . Open the data file CX02BASE.MTW. The labels, Payroll and Wins, have already been entered at the top of columns C2 
and C3, respectively. The payroll values and the win totals have been entered into C2 and C3, respectively. 

2. Click Stat. Select Regression. Click Fitted Line Plot. Enter C3 into the Response (Y) box. Enter C2 into the 
Predictor (X) box. Click to select the Linear model. 

3. Click Options. Enter Wins vs. Payroll into the Title box. Click OK. Click OK. 

4. Double-click on the label for the horizontal axis. Edit the text by adding (Smillions). 




The nature of this chapter has allowed us to only briefly introduce the scatter 
diagram, the best-fit linear equation, and interpretation of the results. Later in the 
text, in Chapter 15, Simple Linear Regression and Correlation, we will cover this 
topic in much greater detail. 
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2.38 What is a scatterplot, and for what kind of data is it 
a useful descriptive device? 

2.39 Differentiate between a positive linear relationship 
and a negative linear relationship between variables. 

2.40 When there is no relationship between two 
variables, what will tend to be the slope of the best-fit 
straight line through the scatter diagram? 

2.41 PC World reports the following text and graphics 
speeds (pages per minute, or ppm) for its top-rated inkjet 
printers, source: PC World, July 2003, p. 137. 



x = ppm. 
Plain Text 


y = ppm, 
Graphics 


x = ppm. 
Plain Text 


y = ppm, 
Graphics 


4.7 


0.7 


4.8 


0.6 


7.3 


1.3 


4.1 


1.1 


5.9 


0.9 


4.8 


1.0 


7.7 


7.2 


6.4 


1.1 


5.4 


0.9 


6.2 


1.6 



a. Draw a scatter diagram representing these data. 

b. Using the "eyeball" method, fit a straight line to the 
data. 

c. Does there appear to be any relationship between the 
variables? If so, is the relationship direct or inverse? 

2.42 For six local offices of a large tax preparation 
firm, the following data describe x = service revenues 
and y = expenses for supplies, freight, and postage 
during the previous tax preparation season: 

x = Service y = Supplies, Freight, 

Revenues Postage 
(Thousands) (Thousands) 



$351.4 $18.4 

291.3 15.8 

325.0 19.3 
422.7 22.5 

238.1 16.0 
514.5 24.6 

a. Draw a scatter diagram representing these data. 

b. Using the "eyeball" method, fit a straight line to 
the data. 

c. Does there appear to be any relationship between the 
variables? If so, is the relationship direct or inverse? 

2.43 The following data represent x = the total number of 
businesses and y = the number of business failures in 
mountain states over a 1-year period, source: The World 
Almanac and Book of Facts 1996, p. 391 . 



State 


x = Number 
of Businesses 


y = Number 
of Business 
Failures 


Arizona 


146,672 


1090 


Colorado 


156,891 


2091 


Idaho 


46,787 


312 


Montana 


43,961 


191 


Nevada 


38,593 


255 


New Mexico 


55,334 


319 


Utah 


58,156 


372 


Wyoming 


26,383 


178 



a. Draw a scatter diagram representing these data. 

b. Usingthe "eyeball" method, fit a straight line to the data. 

c. Does there appear to be any relationship between the 
variables? If so, is the relationship direct or inverse? 

/ data Set / Note: Exercises 2.44 and 2.45 require a 
computer and statistical software. 

2.44 For a sample of 100 employees, a personnel direc- 
tor has collected data on x = the number of years an 
employee has been with the firm and y = the number of 
shares of company stock the employee owns. The data 
are in file XR02044. 

a. Generate a scatter diagram that includes the best-fit 
linear equation for these data. 

b. Does there appear to be any relationship between the 
variables? If so, is the relationship direct or inverse? 

c. Interpret the slope of the equation generated in part (a). 

d. Viewing the scatter diagram and equation in part (a), do 
you think there are any employees who appear to own 
an unusually high or low amount of stock compared to 
the ownership that the equation would have predicted? 

2.45 A rental management executive has collected data 
on square footage and monthly rental fee for 80 
apartments in her city. The data are in file XR02045. 
Considering square footage as the independent variable 
and monthly rental fee as the dependent variable: 

a. Generate a scatter diagram that includes the best-fit 
linear equation for these data. 

b. Does there appear to be any relationship between the 
variables? If so, is the relationship direct or inverse? 

c. Interpret the slope of the equation generated in part (a). 

d. Viewing the scatter diagram and equation in part (a), 
do you think there are any apartments that seem to 
have an unusually high or an unusually low rental fee 
compared to what would be predicted for an 
apartment of their size? Are there any variables that 
are not included in the data that might help explain 
such a differential? If so, what might they be? 
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TABULATION, CONTINGENCY TABLES, 2.6 
AND THE EXCEL PivotTable WIZARD 



When some of the variables represent categories, we can apply a popular and 
useful summarization method called tabulation, where we simply count how 
many people or items are in each category or combination of categories. If each 
person or item is also described by a quantitative variable (e.g., annual income), 
we can take the concept one step further and examine how members of one 
category (e.g., union employees) differ from members of another category 
(e.g., nonunion employees) in terms of average annual income. 



Example 



Tabulation Methods 

For 50 persons observed using a local automated teller machine (ATM), researchers 
have described the customers according to age category and gender category, and 
have used a stopwatch to measure how long the customer required to complete 
his or her transactions at the machine. In this situation, age category and gender 
are considered to be nominal-scale (category) variables, and time (measured in 
seconds) is a quantitative variable. The raw data are shown in part A of Table 2.3 
(page 44) and are in data file CX02ATM. 

SOLUTIONS 

Simple Tabulation 

In simple tabulation, also known as marginal or one-way tabulation, we merely 
count how many people or items are in each category. The simple tabulation 
in part B of Table 2.3 shows us that the 50 ATM users consisted of 28 males and 
22 females. An alternative would have been to express the counts in terms of 
percentages — e.g., 56% of the subjects were males and 44% were females. 

Cross-Tabulation (Contingency Table) 

The cross-tabulation, also known as the crosstab or the contingency table, shows 
how many people or items are in combinations of categories. The cross-tabulation 
in part C of Table 2.3 describes how many of the 50 ATM users were in each 
age-gender group; for example, 7 of the individuals were males in the youngest 
age group and 4 were females in the oldest age group. Because there are two 
category variables involved, part C of Table 2.3 could also be referred to as a 
two-way crosstab. Cross-tabulations help us to identify and examine possible 
relationships between the variables. For example, the crosstab in part C of Table 
2.3 indicates that female ATM users outnumbered men in the youngest age 
category, but the reverse was true in the middle and older age categories. Given 
these results, bank management might want to interview customers to learn 
whether women in the middle and older age categories might have security con- 
cerns about the ATM location, lighting, or layout. 

In a useful application of the concept of cross-tabulation, we can generate a 
tabular display that describes how a selected quantitative variable tends to differ 
from one category to another or from one combination of categories to another. 
For example, part D of Table 2.3 shows the average time (seconds) required for 
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each of our six categories of ATM users. The average time for all 50 persons was 
39.76 seconds, with males in the middle age group having the fastest average time 
(34.53 seconds) and females in the older age group having the slowest average 
time (49.95 seconds). From a gender perspective, the average time for males 
(38.36 seconds) was slightly faster than that for females (41.53 seconds). 



TABLE 2.3 



Data representing age cate- 
gory, gender, and machine- 
usage time for 50 persons 
observed using an 
automated teller machine 
(ATM). Age category and 
gender are nominal-scale 
(category) variables, and 
time is a quantitative vari- 
able measured in seconds. 
The coding for the age and 
gender categories follows: 
AgeCat = 1 for age < 30, 
2 for age 30-60, 3 for 
age > 60; Gender = 1 for 
male, 2 for female. The 
data are in file CX02ATM. 



A. Raw Data 



AgeCat Gender 


Seconds 


AgeCat 


Gender 


Seconds 


AgeCat 


Gender Seconds 


\ z 


JU. I 


z 


z 


Af, A 


z 


1 


33.2 


1 1 

I I 


JJ .u 


z 


1 
I 


J 1 .u 


z 


2 


29.1 


2 2 


43.2 


2 


1 


44.8 


2 


1 


37.7 


1 2 


34.9 


2 


1 


30.3 


2 


2 


54.9 


3 1 


37.5 


2 


1 


33.6 


1 


2 


29.4 


2 1 


37.8 


3 


2 


56.3 


3 


1 


37.1 


3 1 


49.4 


2 


2 


37.8 


3 


2 


42.9 


1 1 


50.5 


1 


2 


34 4 


1 


2 


44.3 


3 1 


48.1 


2 


1 


32.9 


3 


2 


45.0 


1 2 


27.6 


1 


1 


26.3 


2 


2 


47.6 


3 2 


55.6 


2 


1 


30.8 


1 


1 


58.1 


2 2 


50.8 


1 


1 


24.1 


1 


2 


32.0 


3 1 


43.6 


1 


2 


43.6 


2 


1 


36.9 


1 1 


35.4 


2 


2 


29.9 


1 


1 


27.5 


1 2 


37.7 


2 


1 


26.3 


3 


1 


42.3 


3 1 


44.7 


3 


1 


47.7 


2 


1 


34.1 


2 1 


39.5 


1 


2 


40.1 








B. Simple Tabulation, by Gender 












1 = Male 




2 = 


= Female 




Total 




28 






22 






50 




C. Cross-Tabulation or Contingency Table, by Age Category and Gender 

Gender: 




Age Category: 




1 


= Male 


2 


= Female 


Total 




1 = under 30 yr. 






7 




10 


17 




2 = 30-60 yr. 






13 




8 


21 




3 = over 60 yr. 






8 




4 


12 




Total 






28 




22 


50 




D. Mean Usage Times (in Seconds), by Age Category and Gender 

Gender: 






Age Category: 




1 


= Male 


2 = 


= Female 


Total 




1 = under 30 yr. 






39.27 




37.41 


38.18 




2 = 30-60 yr. 






34.53 




42.46 


37.55 




3 = over 60 yr. 






43.80 




49.95 


45.85 




Total 






38.36 




41.53 


39.76 
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The Cross-Tabulation 
EXCEL 





A 


B 


C 


D 


E 


F 


G 


1 


AgeCat 


Gender 


Seconds 


Count of Gender 


Gender 


2 


1 


2 


50.1 


AgeCat 


1 


2 


Grand Total 


3 


1 


1 


53.0 


1 


7 


10 


17 


4 


2 


2 


43.2 


2 


13 


8 


21 


5 


1 


2 


34.9 


3 


8 


4 


12 


6 


3 


1 


37.5 


Grand Total 


28 


22 


50 



1 . Open the data file CX02ATM.XLS. The labels and data are already entered, as shown in the printout. The variables are 
age category (1 = <30, 2 = 30-60, and 3 = >60), gender category (1 = male, 2 = female), and time (in seconds). 
Click on A1 and drag to C51 to select cells A1:C51. 

2. Click Data. Click PivotTable and PivotChart Report. In the Where is the data? portion of the first menu, click 
Microsoft Office Excel list or database. In the What kind of report? portion, click PivotTable. Click Next. 

3. Ensure that the Data range box contains the range you specified in step 1 . Click Next. 

4. In the Where to place the PivotTable? portion of the next menu, click Existing worksheet and (assuming this 
location is satisfactory) enter D1 into the box. 

5. Click Layout. Click the AgeCat button at the right and drag the button into the ROW rectangle. Click the Gender 
button at the right and drag the button into the COLUMN rectangle. Click the Gender button at the right and drag 
it into the DATA rectangle; the button will automatically be transformed to Sum of Gender. Double-click the Sum of 
Gender button and select Count. Click OK. Click OK. Click Finish. 



MINITAB 

Tabulated statistics: AgeCat, Gender 

Rows: AgeCat Columns: Gender 
1 2 All 

1 7 10 17 

2 13 8 21 

3 8 4 12 
All 28 22 50 

Cell Contents: Count 

1 . Open the data file CX02ATM.MTW. The labels, AgeCat, Gender, and Seconds, have already been entered at the top 
of columns C1, C2, and C3, respectively. The corresponding age categories (1 = <30, 2 = 30-60, and 3 = >60), 
gender categories (1 =male, 2=female), and times (in seconds) have been entered into C1, C2, and C3, respectively. 

2. Click Stat. Select Tables. Click Cross Tabulation and Chi-Square. In the Categorical variables section, enter C1 
into the For rows box and C2 into the For columns box. In the Display portion, click to place a check mark next to 
Counts. Click OK. 



Cross-tabulations are very basic to many of the data analysis techniques in 
the remainder of the text, and this section includes two different Computer Solu- 
tions in which Excel and Minitab are applied. Computer Solutions 2.8 shows 
the procedures and the results when Excel and Minitab are used in generating a 
cross-tabulation, or contingency table, for the data in our ATM example. The 
underlying data are in CX02ATM, and the results correspond to the crosstab in 
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part C of Table 2.3. In this application, the Excel PivotTable Wizard is a valuable 
and versatile workhorse that is the tabular counterpart to the ChartWizard used 
in creating graphs and charts like those in Sections 2.4 and 2.5. 





computer solutions 2.9 


ft* 1 




■ • * 



Cross-Tabulation with Cell Summary Information 
EXCEL 



B 



AgeCat 



Gender 



Seconds 



Average of Seconds 



Gender 



1 



50.1 



AgeCat 



1 



Grand Total 



53.0 



39.27 



37.41 



38.18 



43.2 



34.53 



42.46 



37.55 



34.9 



43.80 



49.95 



45.85 



37.5 



Grand Total 



38.36 



41.53 



39.76 



37.8 



49.4 



50.5 



10 



48.1 



11 



27.6 



12 



55.6 



13 



14 



2 
1 



50.8 
43.6 



15 



35.4 



16 



37.7 



17 



44.7 



18 



39.5 



19 



46.4 



20 



31.0 



21 



1 



44.8 



Average Time, by Age Category and 
Gender 



40 



w 60 

£ 
O 

o 
a 

CO 



20 



m 



□1 
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Age Category 



Open the data file CX02ATM.XLS and perform steps 1 through 4 of the Excel PivotTable procedure in Computer 
Solutions 2.8. 

Click Layout. Click the AgeCat button at the right and drag the button into the ROW rectangle. Click the Gender 
button at the right and drag the button into the COLUMN rectangle. Click the Seconds button at the right and drag 
it into the DATA rectangle; the button will automatically be transformed to Sum of Seconds. Double-click the Sum 
of Seconds button and select Average. Click OK. Click OK. Click Finish. 

As an option, we can also depict the information with a chart. In the rectangular PivotTable menu, double-click the 
ChartWizard symbol. Under Chart type, click Column. Click the first category, identified as Clustered Column. 
Click Next. At the next menu, enter Average Time, by Age Category and Gender into the Title box. Enter Age 
Category into the Category (X) axis box. Enter Time, Seconds into the Value (Y) axis box. Click Finish. 



MINITAB 

1 . Open the data file CX02ATM.MTW. As in Computer Solutions 2.8, the labels, AgeCat, Gender, and Seconds, have al- 
ready been entered at the top of columns C1, C2, and C3, respectively. The corresponding age categories (1 = <30, 
2 = 30-60, and 3 = >60), gender categories (1 = male, 2 = female), and times (in seconds) have been entered into 
C1, C2, and C3, respectively. 

2. Click Stat. Select Tables. Click Descriptive Statistics. In the Categorical variables section, enter C1 into the For 
rows box and C2 into the For columns box. 
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3. Click on Display summaries for Associated Variables and enter C3 into the Associated variables box. Select 
Means. Click OK. Click OK. 

Tabulated statistics: AgeCat, Gender 

Rows: AgeCat Columns: Gender 







1 




2 


All 


1 


39 


27 


37 


41 


38.18 






7 




10 


17 


2 


34 


53 


42 


46 


37.55 






13 




8 


21 


3 


43 


80 


49 


95 


45.85 






8 




4 


12 


All 


38 


36 


41 


53 


39.76 






28 




22 


50 


Cell 


Contents : 


Seconds 





Mean Count 




2.46 When variables are used as the basis for a contingency 
table, what scale of measurement must they represent? 

2.47 Differentiate between simple tabulation and cross- 
tabulation, and give a real or hypothetical example of each. 

Note: Exercises 2.48-2.50 can be done by hand, but 
computer statistical software is preferable, if available. 

2.48 The 30 vehicles operated by a taxi service have been 
listed according to their engine (1 = diesel, 2 = gasoline), 
transmission ( 1 = manual, 2 = automatic), and whether 
they have air conditioning ( 1 = no air conditioning, 2 = air 



conditioning). These characteristics, along with the miles per 
gallon (mpg) achieved by each vehicle during the past month, 
are listed here. The data are also provided in file XR02048. 

a. Construct a simple tabulation in which the counts are 
according to the type of engine. 

b. Construct a cross-tabulation describing the fleet, 
using type of engine and type of transmission as the 
categorization variables. 

c. Construct a display showing the average mpg accord- 
ing to type of engine and type of transmission. Do the 
categorization variables seem to be related to mpg? If 
so, how? 



Engine 


Trans 


AC 


mpg 


Engine 


Trans 


AC 


mpg 


1 


1 




36.4 




1 


2 


38.6 


1 


1 




37.7 




1 


2 


34.8 


1 


1 




38.2 




2 


2 


30.8 


1 


1 




41.8 




2 


2 


26.4 


1 


1 




34.8 




2 


2 


31.6 


1 


2 




35.4 




2 


2 


28.3 


1 


2 




31.3 




2 


2 


29.5 


1 


2 




32.2 


2 


1 


2 


23.0 


2 


1 




26.2 


2 


1 


2 


21.5 


2 


1 




25.5 


2 


1 


2 


19.5 


2 


1 




27.7 


2 


2 


2 


18.3 


2 


2 




22.0 


2 


2 


2 


19.1 


2 


2 




23.3 


2 


2 


2 


20.8 


1 


1 


2 


34.8 


2 


2 


2 


21.8 


1 


1 


2 


34.0 


2 


2 


2 


18.4 



2.49 For the fleet described in Exercise 2.48, 
a. Construct a simple tabulation in which the counts are 
according to the type of engine. 



Construct a cross-tabulation describing the fleet, 
using type of engine and whether the vehicle has air 
conditioning as the categorization variables. 
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c. Construct a display showing the average mpg accord- 
ing to type of engine and whether the vehicle has air 
conditioning. Do the categorization variables seem to 
be related to mpg? If so, how? 

2.50 For the fleet described in Exercise 2.48, 

a. Construct a simple tabulation in which the counts are 
according to the type of transmission. 

b. Construct a cross-tabulation describing the fleet, 
using type of transmission and whether the vehicle 
has air conditioning as the categorization variables. 

c. Construct a display showing the average mpg accord- 
ing to type of transmission and whether the vehicle 
has air conditioning. Do the categorization variables 
seem to be related to mpg? If so, how? 

/ data set/ Note: Exercises 2.51-2.54 require a 
computer and statistical software. 

2.51 A high school guidance counselor has examined the 
60 colleges and universities that are on her "highly recom- 
mended" list for her school's graduating seniors. The data 
she has collected on each college or university include 
highest degree level offered ( 1 = bachelor's, 2 = master's, 
3 = doctorate), primary support for the school ( 1 = public, 
2 = private), type of campus setting (1 = urban, 

2 = suburban, 3 = rural), and annual cost for tuition 
and fees (dollars). The data are in file XR02051. 

a. Construct a simple tabulation in which the counts are 
according to the highest degree level offered. 

b. Construct a cross-tabulation describing the schools, 
using the highest degree level offered and whether the 
school is public or private as the categorization 
variables. 

c. Construct a display showing the average value for 
tuition and fees according to highest degree level 
offered and whether the school is public or private. 
Do the categorization variables seem to be related to 
the level of tuition and fees? If so, how? 

2.52 For the schools described in Exercise 2.51, 

a. Construct a simple tabulation in which the counts 
are according to the type of campus setting. 

b. Construct a cross-tabulation describing the schools, 
using the type of campus setting and whether the school 
is public or private as the categorization variables. 



c. Construct a display showing the average value for 
tuition and fees according to type of campus setting 
and whether the school is public or private. Do the 
categorization variables seem to be related to the 
level of tuition and fees? If so, how? 

2.53 An interdisciplinary research team has examined 
35 U.S. cities, reviewing and grading each according to 
the following performance areas: financial management, 
capital projects, personnel policies, information technol- 
ogy, and managing for results. 3 The grades awarded 
were letter grades such as students might receive in 
school. We have rounded off the " + " and " — " por- 
tions of the letter grades so that they are simply A, B, 
C, D, or F, and the data are coded so that A = 4, B = 3, 
C = 2, D = 1, and F = 0. We have also included one 
more variable for each city: its population according to 
the latest census statistics at the time. b The data for all 
35 cities are in file XR02053. Source: Richard Wolf, "Phoenix 

Is Managing, To Get Everything Right," USA Today, January 31, 2000, 
p. 13A. b souRCE: The New York Times Almanac 2000, pp. 248-250. 

a. Construct a simple tabulation in which the counts are 
according to the grade on financial management. 

b. Construct a cross-tabulation describing the cities, 
using grade on financial management and grade on 
information technology as the categorization 
variables. 

c. Construct a display showing the average population 
size according to grade on financial management and 
grade on information technology. Do the categoriza- 
tion variables seem to be related to the level of popu- 
lation? If so, how? 

2.54 For the cities described in Exercise 2.53, 

a. Construct a simple tabulation in which the counts are 
according to the grade on personnel policies. 

b. Construct a cross-tabulation describing the cities, 
using the grade on personnel policies and the grade 
on managing for results as the categorization 
variables. 

c. Construct a display showing the average population 
according to grade on personnel policies and the grade 
on managing for results. Do the categorization 
variables seem to be related to the level of population? 
If so, how? 



SUMMARY 



• Frequency distributions and histograms 

To more concisely communicate the information contained, raw data can be 
visually represented and expressed in terms of statistical summary measures. When 
data are quantitative, they can be transformed to a frequency distribution or a 
histogram describing the number of observations occurring in each category. 
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The set of classes in the frequency distribution must include all possible values 
and should be selected so that any given value falls into just one category. Select- 
ing the number of classes to use is a subjective process. In general, between 5 and 
15 classes are employed. A frequency distribution may be converted to show 
either relative or cumulative frequencies for the data. 

• Stem-and-leaf displays, dotplots, and other graphical methods 

The stem-and-leaf display, a variant of the frequency distribution, uses a subset of 
the original digits in the raw data as class descriptors (stems) and class members 
(leaves). In the dotplot, values for a variable are shown as dots appearing along a 
single dimension. Frequency polygons, ogives, bar charts, line graphs, pie charts, 
pictograms, and sketches are among the more popular methods of visually 
summarizing data. As with many statistical methods, the possibility exists for the 
purposeful distortion of graphical information. 

• The scatter diagram 

The scatter diagram, or scatterplot, is a diagram in which each point represents a 
pair of known or observed values of two variables. These are typically referred to 
as the dependent variable (y) and the independent variable (x). This type of anal- 
ysis is carried out to fit an equation to the data to estimate or predict the value of y 
for a given value of x. 

• Tabulation and contingency tables 

When some of the variables represent categories, simple tabulation and cross- 
tabulation are used to count how many people or items are in each category or 
combination of categories, respectively. These tabular methods can be extended 
to include the mean or other measures of a selected quantitative variable for 
persons or items within a category or combination of categories. 




Width (Class Interval) of a Frequency Distribution Class 

Lower class limit of Lower class limit of 
Class interval = next higher class ~~ the class 

Approximate Width of Classes When Constructing a Frequency Distribution 

Largest value Smallest value 

in raw data in raw data 

Approximate class width = — - 

Number of classes desired 

Midpoint of a Class in a Frequency Distribution 

Class mark = Lower class limit + 0.5(class interval) 

Size of Each Piece of a Pie Chart 

Number of degrees Relative value of the 
for the category = category x 360 
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2.55 The breakdown of U.S. cities having a population of 
at least 10,000 persons has been reported as follows. 
source: Bureau of the Census, Statistical Abstract of the United States 
2006, p. 32. 

Population Number of Cities 



10,000-under 25,000 
25,000-under 50,000 
50,000-under 100,000 
100,000-under 250,000 
250,000-under 500,000 
500,000-under 1,000,000 
1,000,000 or more 



1476 
664 
407 
182 
37 
23 



a. How many cities have a population of at least 25,000 
but less than 500,000? 

b. How many cities have a population less than 250,000? 

c. How many cities have a population of at least 100,000 
but less than 1,000,000? What percentage of cities are 
in this group? 

d. What is the class mark for the 100,000-under 
250,000 class? 

e. Convert the table to a relative frequency distribution. 

2.56 The National Oceanic and Atmospheric Admini- 
stration reports the following record high temperatures 
(degrees Fahrenheit) in 50 U.S. cities. These data are also 
provided in the file XR02056. source: Bureau of the Census, 
Statistical Abstract of the United States 2006, p. 238. 



City 


Temperature 


Mobile, AL 


105 


Juneau, AK 


90 


Phoenix, AZ 


122 


Little Rock, AR 


112 


Los Angeles, CA 


110 


Denver, CO 


104 


Hartford, CT 


102 


Wilmington, DE 


102 


Miami, FL 


98 


Atlanta, GA 


105 


Honolulu, HI 


95 


Boise, ID 


1 1 1 


Chicago, IL 


104 


Indianapolis, IN 


104 


Des Moines, IA 


108 


Wichita, KS 


113 


Louisville, KY 


106 


New Orleans, LA 


102 


Portland, ME 


103 


Baltimore, MD 


105 


Boston, MA 


102 



a. 
b. 
c. 

d. 



City 


Tpmnpratnrp 
icii iuci aiui c 


Detroit, Ml 


104 


Duluth, MN 


97 


Jackson, MS 


107 


St. Louis, MO 


107 


Great Falls, MT 


106 


Omaha, NE 


114 


Reno, NV 


108 


Concord, NH 


102 


Atlantic City, NJ 


106 


Albuquerque, NM 


107 


Buffalo, NY 


99 


Charlotte, NC 


104 


Bismarck, ND 


111 


Cleveland, OH 


104 


Oklahoma City, OK 


110 


Portland, OR 


107 


Pittsburgh, PA 


103 


Providence, Rl 


104 


Columbia, SC 


107 


Sioux Falls, SD 


110 


Nashville, TN 


107 


Houston, TX 


109 


Salt Lake City, UT 


107 


Burlington, VT 


101 


Norfolk, VA 


104 


Seattle, WA 


100 


Charleston, WV 


104 


Milwaukee, Wl 


103 


Cheyenne, WY 


100 



Construct a stem-and-leaf display for these data. 
Construct a frequency distribution. 
Determine the interval width and the class mark for 
each of the classes in your frequency distribution. 
Based on the frequency distribution obtained in part (b), 
draw a histogram and a relative frequency polygon to 
describe the data. 



2.57 The average daily cost to community hospitals for 
patient stays during 1993 for each of the 50 U.S. states 
was as follows: 



(continued) 



AL 


$775 


GA 


$775 


ME 


$738 


AK 


1136 


HI 


823 


MD 


889 


AZ 


1091 


ID 


659 


MA 


1036 


AR 


678 


IL 


917 


Ml 


902 


CA 


1221 


IN 


898 


MN 


652 


CO 


961 


IA 


612 


MS 


555 


CT 


1058 


KS 


666 


MO 


863 


DE 


1029 


KY 


703 


MT 


482 


FL 


960 


LA 


875 


NE 


626 










(continued) 
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NV 


$900 


OK 


$797 


UT 


$1081 


NH 


976 


OR 


1052 


VT 


676 


NJ 


829 


PA 


861 


VA 


830 


NM 


1046 


Rl 


885 


WA 


1 143 


NY 


784 


SC 


838 


WV 


701 


NC 


763 


SD 


506 


Wl 


744 


ND 


507 


TN 


859 


WY 


537 


OH 


940 


TX 


1010 







These data are also provided in the file XR02057. source: 
Health Insurance Association of America, Source Book of Health Insurance 
Data 1995, p. 103. 

a. Construct a stem-and-leaf display for these data. 

b. Construct a frequency distribution. 

c. Determine the interval width and the class mark for 
each of the classes in your frequency distribution. 

d. Based on the frequency distribution obtained in part (b), 
draw a histogram and a relative frequency polygon. 

2.58 In 2004, unemployment rates in the 50 U.S. states 
were reported as follows. These data are also provided in 
the file XR02058. source: Bureau of the Census, Statistical Abstract 
of the United States 2006, p. 390. 





Percent 




Percent 




Percent 


AL 


5.8 


LA 


6.0 


OH 


6.3 


AK 


7.5 


ME 


4.7 


OK 


4.9 


AZ 


5.1 


MD 


4.2 


OR 


7.6 


AR 


5.9 


MA 


5.1 


PA 


5.6 


CA 


6.2 


Ml 


7.0 


Rl 


5.4 


CO 


5.4 


MN 


4.8 


SC 


6.9 


CT 


4.9 


MS 


6.2 


SD 


3.7 


DE 


3.9 


MO 


5.7 


TN 


5.1 


FL 


4.6 


MT 


4.9 


TX 


6.0 


GA 


4.7 


NE 


3.8 


UT 


5.3 


HI 


3.4 


NV 


4.2 


VT 


3.7 


ID 


5.3 


NH 


3.7 


VA 


3.9 


IL 


6.1 


NJ 


4.8 


WA 


6.2 


IN 


5.3 


NM 


5.6 


WV 


5.3 


IA 


4.6 


NY 


5.8 


Wl 


5.0 


KS 


5.5 


NC 


5.4 


WY 


3.8 


KY 


5.2 


ND 


3.4 







a. Construct a stem-and-leaf display for these data. 

b. Construct a frequency distribution for these data. 

c. Determine the interval width and the class mark for 
each of the classes in your frequency distribution. 

d. Based on the frequency distribution obtained in part 
(b), draw a histogram and a relative frequency poly- 
gon to describe the data. 

2.59 During 2001, sales of new, privately owned homes 
in the total United States and in the western states were 
broken down into price categories as follows: source: 
Bureau of the Census, Statistical Abstract of the United States 2002, 
p. 725. 



Sales 
(Thousands 
of Homes) 





Total United 


Western 


Price of Home 


States 


States 


under $100,000 


75 


6 


$100,000-under $150,000 


248 


46 


$150,000-under $200,000 


221 


57 


$200,000-under $300,000 


221 


78 


$300,000 or over 


142 


53 



Convert these data to relative frequency distributions, 
one for the total United States, the other for the western 
states. Do the results appear to suggest any differences 
between total U.S. prices and those in the western states? 

2.60 The following stem-and-leaf output has been gener- 
ated by Minitab. The data consist of two-digit integers. 

Stem-and-leaf of AGE N=21 

Leaf Unit=1.0 



1 


3 


3 


4 


3 


669 


4 


4 




8 


4 


5667 


(6) 


5 


012233 


7 


5 


68 


5 


6 


0 


4 


6 


779 


1 


7 


0 



a. From this display, is it possible to determine the exact val- 
ues in the original data? If so, list the data values. If not, 
provide a set of data that could have led to this display. 

b. Interpret the numbers in the leftmost column of the 
output. 

2.61 The following stem-and-leaf output has been gener- 
ated by Minitab. The data consist of three-digit integers. 

Stem-and-leaf of SCORE N=21 

Leaf Unit=10 



2 


1 


89 


3 


2 


0 


3 


2 




9 


2 


445555 


9 


2 




(2) 


2 


89 


10 


3 


0011 


6 


3 


23 


4 


3 


455 


1 


3 




1 


3 


8 



a. From this display, is it possible to determine the exact val- 
ues in the original data? If so, list the data values. If not, 
provide a set of data that could have led to this display. 
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b. Interpret the numbers in the leftmost column of the 
output. 

2.62 For the period 1990-1999, the Bristol-Myers Squibb 
Company, Inc. reported the following amounts (in billions 
of dollars) for (1) net sales and (2) advertising and prod- 
uct promotion. The data are also in the file XR02062. 

source: Bristol-Myers Squibb Company, Annual Report 1999, pp. 50-51 . 

Year Net Sales Advertising/Promotion 



1990 


$ 9.741 


$1,189 


1991 


10.571 


1.263 


1992 


1 1.156 


1.291 


1993 


11.413 


1.255 


1994 


1 1.984 


1.367 


1995 


13.767 


1.646 


1996 


15.065 


1.946 


1997 


16.701 


2.241 


1998 


18.284 


2.312 


1999 


20.222 


2.409 



For these data, construct a line graph that shows both net 
sales and expenditures for advertising/product promotion 
over time. Some would suggest that increases in advertis- 
ing should be accompanied by increases in sales. Do your 
line graphs seem to support this? 

2.63 A social scientist would like to use a pictogram in 
comparing the number of violent crimes against persons 
in 2006 with the comparable figure for 1960. Consider- 
ing that she can select from a wide variety of symbols to 
demonstrate these frequencies, suggest several possible 
symbols that would tend to underscore the violence 
and/or human tragedy of such crimes. 

/ data set/ Note: Exercises 2.64-2.72 require a 
computer and statistical software. 

2.64 If possible with your statistical software, construct 
a dotplot for the data of Exercise 2.56 (use data file 
XR02056). Are there any cities for which the record 
high temperature is so high that it would appear to be 
an outlier? If so, which one(s)? 

2.65 If possible with your statistical software, construct 
a dotplot for the data of Exercise 2.57 (use data file 
XR02057). Are there any states for which the average 
daily cost was so high that it would appear to be an 
outlier? If so, which one(s)? 

2.66 If possible with your statistical software, use the dot- 
plot method to compare the 50-state unemployment rates 
in 1980 with those in 2004. Use the same scale for each 
dotplot, then comment on whether unemployment appears 
to have changed in terms of its range (highest minus low- 
est) or in the general level of unemployment (perhaps a 
slight shift to the right or to the left) from 1980 to 2004. 
Data for both years are in the file XR02066. source: Bureau of 
the Census, Statistical Abstract of the United States 2006, p. 390. 



2.67 Among the information in its top-50 list of the na- 
tion's law schools, U.S. News & World Report provided 
two reputation scores (with maximum = 5.0) for each 
school: one from academicians, the other from 
lawyers/judges. The magazine also provided the most 
recent acceptance rate (%), student-faculty ratio 
(students/faculty), and the percentage of graduates who 
were employed immediately upon graduation. The data 
values are listed in file XR02067. source: U.S. NewsS World 
Report, Best Graduate Schools, 2001 edition, p. 49. 

a. Construct a scatter diagram where the variables are 
the two kinds of reputation scores. Do schools given 
higher scores by the academicians seem to also receive 
higher scores from the lawyers/judges? 

b. Fit a linear equation to the scatter diagram. Is the slope 
positive or is it negative? Is the sign of the slope con- 
sistent with your intuitive observation in part (a)? 

2.68 For the law schools discussed in Exercise 2.67, and 
using the data in file XR02067, 

a. Construct a scatter diagram where the dependent vari- 
able is the percentage of graduates who were em- 
ployed immediately upon graduation, and the inde- 
pendent variable is the reputation score provided by 
the lawyers/judges. Do schools given higher scores by 
the lawyers/judges also seem to have a higher percent- 
age of their graduates employed immediately upon 
graduation? 

b. Fit a linear equation to the scatter diagram. Is the slope 
positive or is it negative? Is the sign of the slope con- 
sistent with your intuitive observation in part (a)? 

2.69 The household saving rates (percentage of dispo- 
sable household income that goes into savings) have been 
reported for several countries — including the United 
States, Canada, France, Germany, and Japan — for the 
years 1980 through 1997. The percentages are listed in 
data file XR02069. source: Security Industry Association, 7999 
Securities Industry Fact Book, p. 98. 

a. Construct a scatter diagram using the household sav- 
ing rates in the United States and Canada as the two 
variables. It doesn't really matter which one you con- 
sider to be the independent variable. In the years when 
U.S. saving rates were higher, did those in Canada 
seem to be higher as well? 

b. Fit a linear equation to the scatter diagram. Is the slope 
positive or is it negative? Is the sign of the slope con- 
sistent with your intuitive observation in part (a)? 

2.70 For the nations and household saving rates in Exer- 
cise 2.69, and using the data in file XR02069, 

a. Construct a scatter diagram using the household sav- 
ing rates in the United States and Germany as the two 
variables. It doesn't really matter which one you con- 
sider to be the independent variable. In the years when 
U.S. saving rates were higher, did those in Germany 
seem to be higher as well? 
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b. Fit a linear equation to the scatter diagram. Is the slope 
positive or is it negative? Is the sign of the slope con- 
sistent with your intuitive observation in part (a)? 

2.71 An air-compressor manufacturer purchases mechani- 
cal components from two different suppliers (MechSup = 
1 or 2) and electrical/motor components from three 
different suppliers (ElectSup = 1, 2, or 3), with final 
assembly being carried out by one of two technicians 
(Tech = 1 or 2). Following production, finished compres- 
sor units are tested to measure how much pressure they 
can exert (pounds per square inch, or psi). During the 
past few months, 100 compressors have been tested 
before shipment to customers, and the resulting data are 
listed in file XR02071. 

a. Construct a simple tabulation in which the counts are 
according to which company supplied the mechanical 
components. 

b. Construct a cross-tabulation describing the 100 com- 
pressors tested, using mechanical supplier and electri- 
cal/motor supplier as the categorization variables. 

c. Construct a display showing the average pressure 
(psi) according to mechanical supplier and electri- 
cal/motor supplier. Which combination of mechani- 
cal and electrical/motor suppliers seems to result in 



the highest pressure exertion? The lowest pressure 
exertion? 

d. Based on the crosstab and means in part (c), would it 
seem that any of the five suppliers should be examined 
further with regard to the effect their product might 
have on the final pressure capabilities of the finished 
product? Explain your answer. 

2.72 For the situation and data described in Exercise 
2.71, and using data file XR02071, 

a. Construct a simple tabulation in which the counts are 
according to which company supplied the electrical/ 
motor components. 

b. Construct a cross-tabulation describing the 100 com- 
pressors tested, using electrical/motor supplier and final 
assembly technician as the categorization variables. 

c. Construct a display showing the average pressure (psi) 
exerted according to electrical/motor supplier and 
final assembly technician. Which combination of elec- 
trical/motor supplier and final assembly technician 
seems to result in the highest average pressure exerted? 
The lowest pressure exerted? 

d. Based on the crosstab and means in part (c), would it 
seem that the two technicians might not be equally 
adept at the final assembly task? Explain your answer. 



itegrated 



Hi 




Thorndike Sports Equipment 

(Meet the Thorndikes: See Video Unit One.) 



Luke Thorndike, founder and current president of 
Thorndike Sports Equipment, had guided the business 
through 34 successful years and was now interested in 
bringing his favorite grandson Ted into the company. 

The elder Thorndike, possessed of a sharp but fading 
wit, begins the meeting with, "Great to see you, Ted. You 
always were a high-strung kid. Thought you might like to 
join our tennis racquet division." Ted counters, "Not 
quite, but you're getting warm, Luke." The Thorndikes 
have always been a strange bunch. 

"Seriously, Ted, I'm getting a little up in years, the 
microcomputer I bought myself for Christmas is collecting 
dust, and I think you and your business degree could bring 
some new blood to the company. I'd like you to be my ex- 
ecutive vice president. I've been running this outfit by the 
seat of my pants for a lot of years now, and the world just 
seems to be getting too big and too complicated these days. 
I've got index cards and file folders just about piled up to 
the ceiling in that little room next door, and there's got to 



be a better way of making sense of all this information. 
Maybe I shouldn't have fought the fire department when 
they wanted to condemn the place." 

"Besides all these records piling up, a lot of technical 
developments are affecting our business — things like com- 
posite bicycle wheels, aerodynamic golf balls, and oversize 
tennis racquets. Just yesterday one of our engineers came 
in and said she's come up with a new golf ball that will go 
farther than the conventional design. She seems trustwor- 
thy, but we might need to have some numbers to back up 
our claim if we decide to come out with the product. " 

After further discussion of the business and the posi- 
tion that Mr. Thorndike has proposed, Ted accepts the 
offer. As his first official duty, he sets up a test of the new 
golf ball that's supposed to travel farther than the conven- 
tional design. He decides to mix 25 of the new balls with 25 
of the old type, have a golf pro hit all 50 of them at a driv- 
ing range, then measure how far each goes. The results are 
provided here and are also listed in the data file THORN02. 

(continued) 
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25 Drives with New Ball (Yards) 



267.5 248.3 265.1 243.6 253.6 232.7 249.2 232.3 252.8 

247.2 237.4 223.7 260.4 269.6 256.5 271.4 294.1 256.3 

264.3 224.4 239.9 233.5 278.7 226.9 258.8 



25 Drives with Conventional Ball (Yards) 



241.8 255.1 266.8 251.6 233.2 242.7 218.5 229.0 256.3 
264.2 237.3 253.2 215.8 226.4 201.0 201.6 244.1 213.5 

267.9 240.3 247.9 257.6 234.5 234.7 215.9 



1. Using 10-yard intervals beginning with 200.0-under 
210.0, 210.0-under 220.0, on up to 290.0-under 
300.0, construct a frequency distribution for the 
distances traveled by the new ball. 

2. Using the same intervals as in part 1, construct a fre- 
quency distribution for the distances traveled by the 
conventional ball. 



3. Place the frequency distribution for the new ball next 
to the one for the conventional ball. Does it appear 
that the new ball might be more "lively" than the con- 
ventional ball? 



Springdale Shopping Survey* 

The major shopping areas in the community of Spring- 
dale include Springdale Mall, West Mall, and the down- 
town area on Main Street. A telephone survey has been 
conducted to identify strengths and weaknesses of these 
areas and to find out how they fit into the shopping 



activities of local residents. The 150 respondents were 
also asked to provide information about themselves and 
their shopping habits. The data are provided in the com- 
puter file SHOPPING. The variables in the survey were as 
follows: 



A. How Often Respondent Shops at Each Area (Variables 1-3) 



1. 2. 3. 

Springdale Mall Downtown West Mall 



6 or more times/wk. (1) (1) (1) 

4-5 times/wk. (2) (2) (2) 

2-3 times/wk. (3) (3) (3) 

1 time/wk. (4) (4) (4) 

2-4 times/mo. (5) (5) (5) 

0-1 times/mo. (6) (6) (6) 

B. How Much the Respondent Spends during a Trip to Each Area (Variables 4-6) 



4. 5. 6. 

Springdale Mall Downtown West Mall 



$200 or more (1) (1) (1) 

$150-under $200 (2) (2) (2) 

$100-under$150 (3) (3) (3) 

$ 50-under$100 (4) (4) (4) 

(continued) 



Source: Materials for this case have been provided courtesy of The Archimedes Group, Indiana, PA. Data are based on actual responses obtained to this 
subset of the questions included in the survey; town and mall identities have been disguised. 
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$ 25-under$50 
$ 15-under$25 
less than $15 



(5) 
(6) 
(7) 



(5) 
(6) 
(7) 



C. General Attitude toward Each Shopping Area (Variables 7-9) 



7. 

Springdale Mall 



8. 

Downtown 



(5) 
(6) 
(7) 



9. 

West Mall 



Like very much 
Like 
Neutral 
Dislike 

Dislike very much 



(5) 
(4) 
(3) 
(2) 
(1) 



(5) 
(4) 
(3) 
(2) 
(1) 



(5) 
(4) 
(3) 
(2) 
(1) 



D. Which Shopping Area Best Fits Each Description (Variables 10-17) 







Springdale 




West 


No 






Mall 


Downtown 


Mall 


Opinion 


10. 


Easy to return/exchange goods 


(1) 


(2) 


(3) 


(4) 


11. 


High quality of goods 


(1) 


(2) 


(3) 


(4) 


12. 


Low prices 


(1) 


(2) 


(3) 


(4) 


13. 


Good variety of sizes/styles 


(1) 


(2) 


(3) 


(4) 


14. 


Sales staff helpful/friendly 


(1) 


(2) 


(3) 


(4) 


15. 


Convenient shopping hours 


(1) 


(2) 


(3) 


(4) 


16. 


Clean stores and surroundings 


(1) 


(2) 


(3) 


(4) 


17. 


A lot of bargain sales 


(1) 


(2) 


(3) 


(4) 



E. Importance of Each Item in Respondent's Choice of a Shopping Area (Variables 18-25) 

Not 
Important 



Very 
Important 



18. 


Easy to return/exchange goods ( 


) (2) 


(3) 


(4) 


(5) 


(6) 


(7) 


19. 


High quality of goods ( 


) (2) 


(3) 


(4) 


(5) 


(6) 


(7) 


20. 


Low prices ( 


) (2) 


(3) 


(4) 


(5) 


(6) 


(7) 


21. 


Good variety of sizes/styles ( 


) (2) 


(3) 


(4) 


(5) 


(6) 


(7) 


22. 


Sales staff helpful/friendly ( 


) (2) 


(3) 


(4) 


(5) 


(6) 


(7) 


23. 


Convenient shopping hours (' 


) (2) 


(3) 


(4) 


(5) 


(6) 


(7) 


24. 


Clean stores and surroundings ( 


) (2) 


(3) 


(4) 


(5) 


(6) 


(7) 


25. 


A lot of bargain sales ( 


) (2) 


(3) 


(4) 


(5) 


(6) 


(7) 



F. Information about the Respondent (Variables 26-30) 



26. Gender: (1) = Male (2) = Female 

27. Number of years of school completed: 

(1) = less than 8 years (3) = 1 2-under 1 6 years 

(2) = 8-under 12 years (4) = 16 years or more 

28. Marital status: (1) = Married (2) = Single or other 

29. Number of people in household: persons 

30. Age: years 



Each respondent in this database is described by 
30 variables. As an example of their interpretation, con- 
sider row number 1. This corresponds to respondent 
number 1 and contains the information shown at the top 
of page 56. 



The data from these 150 respondents will be the basis 
for further analyses later in the text. In applying some of 
the techniques from this chapter, the following questions 
could provide insights into the perceptions and behavior 
of Springdale residents regarding the three shopping areas. 
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Variable number 6 = 7. The 
respondent usually spends 
less than $15 when she 
shops at West Mall. 

f 

555677444113111 

t , 

Variable number 1 = 5. 
The respondent shops at Spring- 
dale Mall 2—4 times/month. 



Variable number 26 = 2. 
The respondent is a female. 

1167765667232421 

t 

Variable number 19 = 7. 
Respondent believes "high 
quality of goods" is very impor- 
tant in choosing a shopping area. 



1. Using the data file SHOPPING, construct three different 
frequency distributions, one each for variables 7, 8, 
and 9. How do the three areas compare in terms of 
residents' general attitudes toward each? 

2. Do people tend to spend differently at the three areas? 
Construct and compare frequency distributions for 
variables 4, 5, and 6. 

3. To find out more about specific strengths and weak- 
nesses of the areas, set up a frequency distribution for 
variable 10 (i.e., how many "best-fits" votes did each 



area get for "easy to return/exchange goods"? Repeat 
this for variables 11-17 and interpret the results. Which 
of the malls seems to be the "bargain mall"? Which area 
seems to have the most convenient shopping hours? 
4. Generate a cross-tabulation in which the categoriza- 
tion variables are variable 26 (gender) and variable 28 
(marital status). For each of the four subcategories in 
the cross-tabulation, have your statistical software 
include the average for variable 30 (respondent age). 
Interpret the results. 



Employee Absenteeism: Slick Trick or Truly Sick? 



The average government worker is allowed 12.7 paid sick days per year and uses 6.1 
of them. The average retail worker is allowed just 5.8 paid sick days per year and uses 
3.7 of them. Does retailing attract younger and healthier workers, or is it possible that 
workers who are allowed more paid sick days simply use more paid sick days? 

According to a survey of 574 human resource executives, the answer is the 
latter — allowing more paid sick days leads to more paid sick days being taken. 
Human resource directors suspect that only 28% of paid sick leave is actually used 
because of illness, but they have no way of knowing for sure which absent employ- 
ees are really sick. 

Rosemary Haefner, Senior Career Advisor for CareerBuilder.com, makes an interest- 
ing seasonal observation about the contrived absenteeism situation: "With the cold 
and flu season kicking in, it's a popular time of the year for employees to call in sick. 
However, the number of those who are actually feeling under the weather may not 
necessarily match up with unscheduled absences. Twenty percent of workers say they 
called into work because they just didn't feel like going to the office that day. One in 
four workers report they feel sick days are equivalent to taking extra vacation days and 
treat them as such." 

The situation can also be viewed from the standpoint of 
union/management status, and using the median 
(the sick-days value where just as many work- , t ea t the wean, 

ers are higher as lower) instead of the '^on't eat the Tfh^first decile " 

average, or mean. From this perspective, union ( w0 n't even 
hourly workers use a median of 6.1 sick days 
per year, nonunion hourly workers take 4.9, and 
management uses 4.0. 

In the pages ahead, we'll talk more about 
the mean and the median, as well as several 
other important numerical measures that describe 
data. Try not to be absent. 



Sources: Del Jones, "Firms Take New Look at Sick Days," USA Today, 
October 8, 1996, p. 8B; and Kate Lorenz, "Why Are People Calling in 
Sick? What's Your Excuse?," Netscape.com, May 1, 2006. 
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learning 
objectives 

After reading this 
chapter, you should 
be able to: 



Describe data by using measures of central tendency and dispersion. 
Convert data to standardized values. 

Use the computer to visually represent data with a box-and-whisker plot. 

Determine measures of central tendency and dispersion for grouped data. 

Use the coefficient of correlation to measure association between two quanti- 
tative variables. 



3.1 INTRODUCTION 



In Chapter 2, we saw how raw data are converted into frequency distributions, 
histograms, and visual displays. We will now examine statistical methods for 
describing typical values in the data as well as the extent to which the data are 
spread out. Introduced in Chapter 1, these descriptors are known as measures of 
central tendency and measures of dispersion: 

• Measures of central tendency Numbers describing typical data values. 
Chances are the typical subscriber to the Wall Street Journal earns more than 
the typical subscriber to Mad magazine. By using measures of central ten- 
dency, we can numerically describe the typical income of members of each 
group. The primary measures of central tendency discussed in this chapter are 
the arithmetic mean, weighted mean, median, and mode. 

• Measures of dispersion Numbers that describe the scatter of the data. It's 
very likely that the variety of heights of centers in the National Basketball 
Association (most such players are between 6'8" and 7'4") is not as wide as 
the dispersion that exists within the general population, which includes indi- 
viduals as tall as the NBA centers and persons as short as newborn babies. 
Measures of dispersion allow us to numerically describe the scatter, or spread, 
of measurements. Among the measures of dispersion discussed in this chapter 
are the range, quantiles, mean absolute deviation, variance, and standard 
deviation. 

When we discuss measures of central tendency and dispersion, we'll be using 
two other key terms from Chapter 1: population and sample. Remember that 
(1) we often wish to use information about the sample to make inferences about 
the population from which the sample was drawn, and (2) characteristics of the 
population are referred to as parameters, while characteristics of the sample are 
referred to as statistics. 

Chapter 2 also introduced the scatter diagram and best-fit linear equation as 
a graphical method of examining the possible linear relationship between two 
quantitative variables. In this chapter, we will extend this to two statistical mea- 
sures of association: the coefficients of correlation and determination. As with the 
scatter diagram and best-fit equation, these will be discussed in greater detail in 
Chapter 15, Simple Linear Regression and Correlation. 
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STATISTICAL DESCRIPTION: MEASURES 3.2 
OF CENTRAL TENDENCY 



We now discuss methods for representing the data with a single numerical value. 
This numerical descriptor has the purpose of describing the typical observation 
contained within the data. 



The Arithmetic Mean 

Defined as the sum of the data values divided by the number of observations, 
the arithmetic mean is one of the most common measures of central tendency. 
Also referred to as the arithmetic average or simply the mean, it can be 
expressed as p (the population mean, pronounced "myew") or x (the sample 
mean, "x bar"). 

The population mean (/jl) applies when our data represent all of the items 
within the population. The sample mean (x) is applicable whenever data repre- 
sent a sample taken from the population. 



Population mean: 



N 

p = or simply fi = — 

where p, = population mean 

x i = the ith data value in the population 
2 = the sum of 

N = number of data values in the population 



The leftmost version of the equation for the population mean (p,) includes com- 
plete summation notation. Notation such as the "i =1" and "N" will be included 
in the remainder of the text only when the nature of the summation is not clear 
from the context in which it is being used. 



Sample mean: 

_ 2x 



where x = sample mean 

Xj = the z'th data value in the sample 
2 = the sum of 

n = number of data values in the sample 



In determining either a population mean (/i) or a sample mean (3c), the sum of 
the data values is divided by the number of observations. As an example of the 
calculation of a population mean, consider the following data for shipments of 
peanuts from a hypothetical U.S. exporter to five Canadian cities. 
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City Peanuts (Thousands of Bags) 

Montreal 64.0 

Ottawa 15.0 

Toronto 285.0 

Vancouver 228.0 

Winnipeg 45.0 

Because these were the only Canadian cities identified as receiving U.S. 
peanuts from this supplier during this period, they can be considered a popula- 
tion. We can calculate the arithmetic mean (/x) for these data as follows: 

2x, 64.0 + 15.0 + 285.0 + 228.0 + 45.0 „„„ „ , 
ix = = = 127.4 thousand bags 

On the average, each Canadian destination for this U.S. company's peanuts 
received 127.4 thousand bags of peanuts during the time period involved. 

To help understand what the arithmetic mean represents, picture a play- 
ground seesaw with markings ranging from 0 at one end to 300 at the other end, 
with five people of equal weight sitting at positions matching the shipments 
received by each Canadian city. Assuming a weightless seesaw, the individuals 
would be perfectly balanced when the pivot bar is located exactly at the 127.4 
position, as shown in part (b) of Figure 3.1. 

Figure 3.1 also reveals a potential weakness of the mean as a descriptor of typ- 
icalness. Notice in part (b) that three of the five cities received relatively moderate 
shipments, 15.0, 45.0, and 64.0 thousand bags, while the other two cities received 
228.0 and 285.0 thousand bags of peanuts. Thus each of the latter received more 
than the other three cities combined. This results in three of the five cities having 
shipments that are far below the average, making the arithmetic mean not as typical 
as we might like. A solution to this difficulty is the median, to be discussed shortly. 



FIGURE 3.1 



The arithmetic mean, 
or average, is a mathemati- 
cal counterpart to the cen- 
ter of gravity on a seesaw. 
Although the influence of 
the two values that are 
more than 200 thousand is 
not quite as great as that of 
Arnold, it causes the arith- 
metic mean to be greater 
than three of the five data 
values. 



C'mon Arnold, 
Mom says to 
let us down. 




5 



Center of gravity 
(a) Center of gravity on a seesaw 



45 



64 



285 



1 L L 



50 



100 



150 



200 



250 



300 



127.4 = Mean 



(b) Arithmetic mean of five data values in the text 
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The Weighted Mean 

When some values are more important than others, a weighted mean (sometimes 
referred to as a weighted average) may be calculated. In this case, each data value 
is weighted according to its relative importance. The formula for the weighted 
mean for a population or a sample will be as follows: 



Weighted mean, ^ w (for a population) or x (for a sample): 



M,,, or x „, = Jf' X ' where w ■ = weight assigned to the z'th data value 

W 2,W: , 1 ■ 1 J 1 

' X; = the zth data value 



Continuing with the peanut example, let's assume that shipments to the 
respective cities will be sold at the following profits per thousand bags: $15.00, 
$13.50, $15.50, $12.00, and $14.00. {Note: In this example, we are trying to 
determine the weighted mean for the profit; accordingly, the preceding profit data 
will constitute the x i values in our example and will be weighted by the shipment 
quantities to the five cities.) 

The average profit per thousand bags will not be (15.00 + 13.50 + 15.50 + 
12.00 + 14.00)/5, because the cities did not receive equal quantities of peanuts. A 
weighted mean must be calculated if we want to find the average profit per thou- 
sand bags for all shipments of peanuts from this U.S. exporter to Canada during 
this period. Using our assumed profit figures, this weighted mean is calculated as 

^WjXj 

_ (64)($15.0) + (15)($13.5) + (285)($15.5) + (228)($12.0) + (45)($14.0) 
64 + 15 + 285 + 228 + 45 

= $14.04 per thousand bags 



The Median 

In a set of data, the median is the value that has just as many values above it as 
below it. For example, the numbers of bags of peanuts (in thousands) shipped by 
the U.S. exporter to the five Canadian cities were 



115.0 



45.01 



Two values 
below 64.0 




1228.0 



285.01 



Two values 
above 64.0 



For these data, 64.0 is the value that has just as many values above it as below 
it. Montreal, which received 64.0 thousand bags of peanuts, is in the median 
position among the five Canadian cities. 

The preceding example had an odd number of data values (5), and the median 
was the observation with two values above and two below. For some data, how- 
ever, the number of values will be even. In such cases, the median will be halfway 
between the two values in the middle when the data are arranged in order of size. 



62 



Part 1: Business Statistics: Introduction and Background 



For example, Ryder System, Inc. reported the following data for percentage 
return on average assets over an 8 -year period. 1 

Raw data 2.8 7.0 1.6 0.4 1.9 2.6 3.8 3.8 

Ordered data 1 0.4 1.6 1.9| |2.6 2.8 1 |3.8 3.8 7.0\ % return on avg. assets 













Three values 






Three values 



Three values are equal to or above the 
two in the middle, and three are equal 
to or below them. The median is 
halfway between 2.6 and 2.8, or 2.7. 



For these data, the median, 2.7, is the value halfway between the two values 
in the middle of the ordered arrangement, 2.6 and 2.8. Unlike the mean, the 
median is not influenced by extreme high or low values in the data. For example, 
if the highest data value had been 600% instead of 7.0%, the median would still 
be 2.7%. 

The Mode 

In a set of data, the mode is a value that occurs with the greatest frequency. As 
such, it could be considered the single data value most typical of all the values. 
Consider again the 8 years of return-on-assets data reported by Ryder System, 
Inc.: 

Ordered data 0.4 1.6 1.9 2.6 2.8 13.8 3.81 7.0 % return on avg. assets 



Mode = 3.8 



For these data, the mode is 3.8, since it occurs more frequently than any other 
value. In this case, the mode does not appear to be a very good descriptor of the 
data, as five of the other six values are all smaller than 3.8. 

Depending on the data, there can be more than one mode. For example, if the 
leftmost data value had been 1.6 instead of 0.4, there would have been two 
modes, 1.6 and 3.8; if this were the case, the values 1.6 and 3.8 would each have 
occurred in two different years. When there are two modes, a distribution of 
values is referred to as bimodal. 

Comparison of the Mean, Median, and Mode 

As we have seen, the mean, median, and mode are alternative approaches to 
describing the central tendency of a set of values. In deciding which measure to 
use in a given circumstance, there are a number of considerations: 

• The mean gives equal consideration to even very extreme values in the data, 
while the median tends to focus more closely on those in the middle of the 
data array. Thus, the mean is able to make more complete use of the data. 

Source: Ryder System, Inc., 2005, 2003, and 1998 Annual Reports. 
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However, as pointed out earlier, the mean can be strongly influenced by just 
one or two very low or high values. A demonstration of how a single observa- 
tion can affect the mean and median is shown in Seeing Statistics Applet 1, at 
the end of the chapter. 

• There will be just one value for the mean and one value for the median. How- 
ever, as indicated previously, the data may have more than one mode. 

• The mode tends to be less useful than the mean and median as a measure of 
central tendency. Under certain circumstances, however, the mode can be 
uniquely valuable. For example, when a television retailer decides how many 
of each screen size to stock, it would do him little good to know that the mean 
television set sold has a 32.53-inch screen — after all, there is no such thing as 
a 32.53-inch television. Knowledge that the mode is 30 inches would be much 
more useful. 



The relative values of the mean, median, and mode are very much dependent on 
the shape of the distribution for the data they are describing. As Figure 3.2 shows, 
distributions may be described in terms of symmetry and skewness. In a symmetrical 
distribution, such as that shown in part (a), the left and right sides of the distribu- 
tion are mirror images of each other. The distribution in part (a) has a single 
mode, is bell shaped, and is known as the normal distribution. It will be discussed 
in Chapter 6 — for now, just note that the values of the mean, median, and mode 
are equal. 



Distribution Shape and Measures 
of Central Tendency 





Mean, Median, and Mode 



Mode Mean 
Median 

(b) Positively skewed distribution 



(a) Symmetrical distribution 




Mean | Mode 
Median 



(c) Negatively skewed distribution 



The shape of the distribution determines the relative positions of the mean, median, and mode for a set of data values. 






U.S. Males Post Skewed Incomes 

Data values are skewed positively when the mean exceeds 
the median, and incomes are especially likely to exhibit this 



type of distribution. For example, the incomes of males 
who were 25 or older in 2000 were distributed as follows: 



Age Group 



Income 


25 to <35 


35 to <45 


45 to <55 


55 to <65 


65 or over 


under $10,000 


8.4% 


7.7% 


8.4% 


11.6% 


18.7% 


$10,000-under $25,000 


29.9 


21.0 


17.7 


24.4 


45.2 


$25,000-under $50,000 


40.5 


36.7 


34.6 


30.4 


23.5 


$50,000-under $75,000 


13.8 


19.6 


20.3 


16.7 


6.4 


$75,000 or over 


7.4 


15.0 


19.0 


16.9 


6.3 




100.0 


100.0 


100.0 


100.0 


100.1* 


Median (thousands) 


$30.63 


$37.09 


$41.07 


$34.41 


$19.17 


Mean (thousands) 


$42.15 


$53.48 


$58.46 


$61.05 


$56.50 


*Differs from 100.0 due to rounding. 



For each of these distributions, the mean exceeds the 
median, reflecting that incomes are skewed positively 
(toward larger incomes) for all of these age groups. In addi- 
tion, the gap between the median and the mean tends to 
grow as we proceed from the youngest age group to the 
oldest. The gap is especially pronounced for 65-or-over 



males — many of these people are on fixed incomes, but 
others may have incomes that are quite substantial. 



Source: Bureau of the Census, U.S. Department of Commerce, Statistical 
Abstract of the United States 2002, pp. 439-440. 



Skewness refers to the tendency of the distribution to "tail off" to the right or 
left, as shown in parts (b) and (c) of Figure 3.2. In examining the three distribu- 
tion shapes shown in the figure, note the following relationships among the mean, 
median, and mode: 

• Symmetrical distribution The mean, median, and mode are the same. This 
will be true for any unimodal distribution, such as this one. (When a distribu- 
tion is bimodal, it will of course be impossible for the mean, median, and both 
modes to be equal.) 

• Positively skewed distribution The mean is greater than the median, which 
in turn is greater than the mode. Income distributions tend to be positively 
skewed, since there is a lower limit of zero, but practically no upper limit on 
how much a select few might earn. In such situations, the median will tend to 
be a better measure of central tendency than the mean. Statistics in Action 3.1 
shows the income distribution for male income-earners in the United States. 
For each of the five age groups, the mean is greater than the median, indicat- 
ing that all of the distributions are positively skewed. 

• Negatively skewed distribution The mean is less than the median, which in 
turn is less than the mode. Data that have an upper limit (e.g., due to the max- 
imum seating capacity of a theater or stadium) may exhibit a distribution that 
is negatively skewed. As with the positively skewed distribution, the median is 
less influenced by extreme values and tends to be a better measure of central 
tendency than the mean. 
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Computer Solutions 3.1 describes the procedures and shows the results when 
we use Excel and Minitab to obtain descriptive statistics for the automated teller 
machine usage times for the 50 customers discussed in Chapter 2. The data are in 
file CX02ATM. The printouts are typical of what we would expect from most sta- 
tistical software. Shown below are the printout items that pertain to the measures 
of central tendency discussed in this section and our interpretation of their values. 

• Sample size = 50 Excel refers to this as "count," while Minitab calls it "N". 
Computer printouts often include only capital letters, so "N" can still appear 
instead of the "n" that would be more appropriate for describing the size of a 
sample. (The N* term in the Minitab printout shows how many data values were 
either blank or missing. There are no missing values in this database, so N* is 0.) 

• Mean = 39.76 seconds and Median = 37.80 seconds As the printout shows, 
the average customer required 39.76 seconds to complete his or her trans- 
action, and just as many people had a time longer than 37.80 seconds as had 
a shorter time. Since the mean is greater than the median, the distribution is 
positively skewed. 

• Mode = 37.80 seconds Excel reports the most frequent data value as 37.80 
seconds. Minitab can also display the mode, but it has not been selected for 
this printout. 

• Skewness = 0.23 Excel has quantified the skewness and reported it as 
+0.23. The sign indicates a positively skewed distribution — that is, one in 
which the mean exceeds the median. 

Not shown in either printout is a measure called the trimmed mean, in which 
statistical software ignores an equal number or percentage of data values at both 
the high end and the low end of the data before it is calculated. This descriptor 
can optionally be shown by Minitab, but not by Excel. Minitab's trimmed mean 
ignores the smallest 5% and the largest 5% of the data values. The purpose of the 
trimmed mean is to reduce the effect exerted by extreme values in the data. 



cc 


imputer so 
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Descriptive Statistics 
EXCEL 





A 


B 


C 


D 


E I F 


1 


AgeCat 


Gender 


Seconds 




Seconds 


2 


1 


2 


50.1 








3 


1 


1 


53.0 




Mean 


39.76 


4 


2 


2 


43.2 




Standard Error 


1.26 


5 


1 


2 


34.9 




Median 


37.80 


6 


3 


1 


37.5 




Mode 


37.80 


7 


2 


1 


37.8 




Standard Deviation 


8.92 


8 


3 


1 


49.4 




Sample Variance 


79.49 


9 


1 


1 


50.5 




Kurtosis 


-0.87 


10 


3 


1 


48.1 




Skewness 


0.23 


11 


1 


2 


27.6 




Range 


34 


12 


3 


2 


55.6 




Minimum 


24.1 


13 


2 


2 


50.8 




Maximum 


58.1 


14 


3 


1 


43.6 




Sum 


1987.8 


15 


1 


1 


35.4 




Count 


50 



(continued) 
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1 . Open the data file CX02ATM.XLS. The labels and data values have already been entered as shown in the printout. 

2. Click Tools. Click Data Analysis. In the Analysis Tools menu, click Descriptive Statistics. Click OK. 

3. In the Input section, enter C1:C51 into the Input Range box. In the Grouped By section, click to select Columns. 
Because the input range includes the name of the variable in the first row, click to place a check mark in the Labels in 
First Row box. 

4. In the Output Options section, click to select Output Range, then enter E1 into the Output Range box. Click to 
place a check mark in the Summary Statistics box. Click OK. 

5. Appearance improvements can be made by changing column widths and reducing the number of decimal places 
shown in the column of descriptive statistics. 



MINITAB 

Descriptive Statistics: Seconds 

Variable N N* Mean SE Mean StDev Minimum Ql Median Q3 Maximum 

Seconds 50 0 39.76 1.26 8.92 24.10 32.67 37.80 46.70 58.10 

1 . Open the data file CX02ATM.MTW. The times (seconds) for the 50 ATM users are in column C3. 

2. Click Stat. Select Basic Statistics. Click Display Descriptive Statistics. Enter C3 into the Variables box. To get the 
printout shown here, click OK. [Note: As an option, we could easily obtain descriptive statistics for the males compared 
with those for the females. Just click to place a check mark next to By variable, then enter C2 (the gender variable, 
where 1 = male, 2 = female) into the By variable box.] 



exercises 




3.1 Determine the mean and the median wage rate for 
the data in Exercise 2.9. 

3.2 Using the data in Exercise 2.11, determine the mean 



What was the mean net rate of investment income for this 
period? The median? source: American Council of Life Insurance, 
Life Insurance Fact Book 1999, p. 98. 



and the median number of goals per season that 


Year 


Percentage 


Year 


Percentage 


Mr. Gretzky scored during his 20 seasons in the National 










Hockey League. 


1979 


7.73 


1989 


9.10 


1980 


8.02 


1990 


8.89 


3.3 Erika operates a website devoted to providing infor- 


1981 


8.57 


1991 


8.63 


mation and support for persons who are interested in 


1982 


8.91 


1992 


8.08 


organic gardening. According to the hit counter that 


1983 


8.96 


1993 


7.52 


records daily visitors to her site, the numbers of visits 


1984 


9.45 


1994 


7.14 


during the past 20 days have been as follows: 65, 36, 52, 


1985 


9.63 


1995 


7.41 


70, 37, 55, 63, 59, 68, 56, 63, 63, 43, 46, 73, 41, 47, 75, 


1986 


9.35 


1996 


7.25 


75, and 54. Determine the mean and the median for these 


1987 


9.10 


1997 


7.35 


data. Is there a mode? If so, what is its value? 


1988 


9.03 


1998 


6.95 



3.4 A social scientist for a children's advocacy organiza- 
tion has randomly selected 10 Saturday-morning 
television cartoon shows and carried out a content analy- 
sis in which he counts the number of incidents of verbal 
or physical violence in each. For the 10 cartoons examined, 
the counts were as follows: 27, 12, 16, 22, 15, 30, 14, 
30, 11, and 21. Determine the mean and the median for 
these data. Is there a mode? If so, what is its value? 

3.5 For 1979 through 1998, the net rate of investment 
income for U.S. life insurance companies was as follows. 



3.6 The following is a list of closing prices for 15 stocks 
widely held by Merrill Lynch account holders. What 
was the average closing price for this sample of stocks? 
The median closing price? source: USA Today, August 22, 
2000, p. 4B. 

Company Price Company Price 



AT8TT 

America Online 
Cisco 



$31.69 
56.69 
65.50 



ExxonMobil 

GE 

Intel 



$83.50 
56.88 
72.06 
(continued) 
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Company Price Company Price 



IBM $121.44 Microsoft $70.63 

Johnson Motorola 35.81 

& Johnson 97.00 Oracle 83.19 

Lucent 42.25 Pfizer 43.63 

Merck 71.88 SBC Comm. 40.06 

3.7 A reference book lists the following maximum life 
spans (in years) for animals in captivity: source: The World 
Almanac and Book of Facts 2006, p. 305. 

Beaver 50 Grizzly bear 50 

Bison 40 Horse (domestic) 50 

Camel 50 Kangaroo 24 

Cat (domestic) 28 Moose 27 

Chimpanzee 60 Pig 27 

Cow 30 Polar bear 45 

Dog (domestic) 20 Rabbit 13 

Asian elephant 77 Sea lion 34 

Giraffe 36 Sheep 20 

Goat 18 Tiger 26 

Gorilla 54 Zebra 50 



What is the mean of these maximum life spans? The 
median? 

3.8 According to a utility company, utility plant 
expenditures per employee were approximately $50,845, 
$43,690, $47,098, $56,121, and $49,369 for the years 
2002 through 2006. Employees at the end of each year 
numbered 4738, 4637, 4540, 4397, and 4026, 
respectively. Using the annual number of employees as 
weights, what is the weighted mean for annual utility 
plant investment per employee during this period? 

3.9 A student scored 90 on the midterm exam, 78 on the 
final exam, and 83 on the term project. If these three 
components are weighted at 35%, 45%, and 20%, 
respectively, what is the weighted mean for her course 
performance? 

3.10 An observer stands at an overpass and, for each 
motorcycle that passes below, records the value of x = 
the number of riders on the motorcycle. 

a. What value would you anticipate for the mode of her 
data? Why? 



b. Would you anticipate that the mean would be greater 
than or less than the mode? Why? 

c. Would you anticipate that the mean would be greater 
than or less than the median? Why? 

3.11 In preparation for upcoming contract negotiations, 
labor and management representatives of the Hoffer Con- 
crete Company have been collecting data on the earnings 
of a sample of employees (ranging from the president to 
the lowest paid "go-fer") at similar firms. Assuming that 
such sample data are available, and that the mean and 
median have been identified: 

a. Which of these measures of central tendency would 
be likely to best serve the purposes of the 
management representative? Why? 

b. Which of these measures of central tendency would 
be likely to best serve the purposes of the union rep- 
resentative? Why? 

/ data Set / Note: Exercises 3.12-3.15 require a 
computer and statistical software. 

3.12 For the U.S. News & World Report top-50 U.$. law 
schools in Exercise 2.67, find the mean and median scores 
that the schools received from the academic evaluators. 
Compare these with the mean and median scores awarded 
by the lawyer/judge evaluators. Does either set of evalua- 
tors seem to provide ratings that are positively skewed or 
negatively skewed? The data are in file XR02067. 

3.13 Each of the 100 air compressors described in Exer- 
cise 2.71 was tested to measure the amount of pressure it 
could exert. Find the mean and the median pressure, and 
comment on possible skewness of the data. The data are 
in file XR02071. 

3.14 A personnel administrator has collected data on 100 
employees, including gender, age, and number of days 
absent during the previous year. Compare the mean and 
median number of days absent for females (coded as 
gender = 1) versus males (coded as gender = 2) within 
the company. The data are in file XR03014. 

3.15 For the data in Exercise 3.14, compare the mean 
and median ages for female employees (gender code = 1) 
versus male employees (gender code = 2). The data are in 
file XR03014. 



STATISTICAL DESCRIPTION: MEASURES 3.3 
OF DISPERSION 



Although the mean and other measures are useful in identifying the central ten- 
dency of values in a population or a set of sample data, it's also valuable to describe 
their dispersion, or scatter. As a brief introduction to the concept of dispersion, con- 
sider the three distributions in Figure 3.3 (page 68). Although distributions B and C 
have the same mean (/jl = 12.5 ounces), notice that B has a much wider dispersion 
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FIGURE 3.3 



The dispersion of a distribu- 
tion refers to the amount of 
scatter in the values. Distri- 
butions A, B, and C 
describe outputs for a 
machine that fills cereal 
boxes. Although B and C 
have the same mean, B has 
much greater dispersion 
and will result in many of 
the boxes being filled to 
less than the 1 2 ounces 
stated on the package. 



Frequency 




Mean of B = Mean of C 

[Dispersion] > [Dispersion] 
ofB J L ofC J 



10 



11 



Ounces 




Stated weight 
on package 



14 



and that both B and C have a much larger mean than A (/x. = 9.0 ounces). Consid- 
ering these three distributions as representing the output of a machine that fills 
boxes of breakfast cereal, we can make the following observations: 

• Machine A is set too low. On the average, it fills boxes to just 9 ounces instead 
of the 12 ounces advertised on the package. The firm's legal counsel will soon 
be very busy. 

• Machine B delivers an average of 12.5 ounces, but the dispersion is so wide 
that many of the boxes are being underfilled. The firm's legal counsel will 
soon be moderately busy. 

• Machine C delivers an average of 12.5 ounces, and the dispersion is narrow 
enough that very few of the boxes are being underfilled. The firm's legal counsel 
can relax. 

As this example shows, measures of dispersion can have great practical value 
in business. We'll now discuss several of the more common measures, including 
the range, quantiles, mean absolute deviation, variance, and standard deviation. 

Range 

The simplest measure of dispersion, the range is the difference between the highest 
and lowest values. Part (a) of Figure 3.4 lists new privately owned housing 
starts for each of the 50 U.S. states. When the data are arranged in numerical 
order in part (b) of Figure 3.4, we see a range of 270.2 thousand housing starts 
(271.4 thousand for California minus the 1.2 thousand for Wyoming). 

Although the range is easy to use and understand, it has the weakness of being 
able to recognize only the extreme values in the data. For example, if we were to 
eliminate the three highest states, 94% of the states would still be represented, but 
the range would now be only 71.9 thousand starts (73.1 thousand for Georgia 
minus the 1.2 thousand for Wyoming). Thus the range would be cut to just one- 
fourth of its original value despite the unchanged relative positions of the other 
47 states. 

Ads such as "Earn from $10 to $10,000 a week by sharpening saws at 
home" take advantage of this aspect of the range. The vast majority of such 
entrepreneurs may well earn less than $20 per week. However, it takes only one 
unusually successful person for the range to be so impressive. If there is even one 
very extreme value, the range can be a large number that is misleading to the unwary. 
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FIGURE 3.4 


















(a) Raw data (new housing starts, in thousands) 










State Starts 


State 


Starts 


State 


Starts 


State 


Starts 


State 


Starts 


AL 


17.2 


HI 


7.3 


MA 


39.2 


NM 


1 1.8 


SD 


2.5 


AK 


4.0 


ID 


4.3 


Ml 


37.6 


NY 


61.9 


TN 


38.1 


AZ 


71.8 


IL 


38.7 


MN 


28.6 


NC 


70.7 


TX 


143.1 


AR 


9.9 


IN 


23.0 


MS 


o.o 


ND 


2.6 


UT 


16.5 


CA 271.4 


IA 


5.2 


MO 


27.2 


OH 


33.0 


VT 


4.1 


CO 


32.8 


KS 


13.3 


MT 


2.0 


OK 


10.7 


VA 


64.1 


CT 


24.5 


KY 


13.8 


NE 


5.0 


OR 


1 1.3 


WA 


35.5 


DE 


4.6 


LA 


18.8 


NV 


14.0 


PA 


43.6 


WV 


1.5 


FL 202.6 


ME 


8.1 


NH 


17.8 


Rl 


5.4 


Wl 


20.2 


GA 


73.1 


MD 


42.1 


NJ 


55.0 


SC 


32.8 


WY 


1.2 


(b) Data arranged from smallest to largest 












1.2 




5.2 




13.8 






28.6 




43.6 


1.5 




5.4 




14.0 






32.8 




55.0 


2.0 




7.3 




16.5 






32.8 




61.9 


2.5 




8.1 




17.2 






33.0 




64.1 


2.6 




8.8 




17.8 






35.5 




70.7 


4.0 




9.9 




18.8 






37.6 




71.8 


4.1 




10.7 




20.2 






38.1 




73.1 


4.3 




11.3 




23.0 






38.7 




143.1 


4.6 




11.8 




24.5 






39.2 




202.6 


5.0 




13.3 




27.2 






42.1 




271.4 


(c) Quartiles 


















25% of values 
\ 
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\ 


25% of values 
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i 
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0 




10 


20 


30 


40 


50 


60 


70 


80 


I 

2, = 6.8 Q 2 = 
Interquartile range = g 3 - Q l = 38.8 


\ 

18.3 
-6.8 = 


I 

Q 3 = 38.8 
32.0 Quartile deviation = 


Q3-Q1 _ 38. 
2 


5 - 6.8 
2 


16.0 



Raw data, ordered data, and quartiles for new privately owned housing starts in the 50 U.S. states 
Source: Bureau of the Census, U.S. Department of Commerce, Construction Reports, series C20. 
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The midrange, a variant of the range, is the average of the lowest data value 
and the highest data value. Although it is vulnerable to the effects of extremely 
low or high outliers in the data, the midrange is sometimes used as a very approx- 
imate measure of central tendency. For the data in Figure 3.4, the midrange would 
be (271.4 + 1.2)/2, or 136.3 thousand housing starts. 



Quantiles 

We have already seen how the median divides data into two equal-size groups: 
one with values above the median, the other with values below the median. 
Quantiles also separate the data into equal-size groups in order of numerical 
value. There are several kinds of quantiles, of which quartiles will be the primary 
topic for our discussion. 

PERCENTILES divide the values into 100 parts of equal size, each comprising 1 % 
of the observations. The median describes the 50th percentile. 

DECILES divide the values into 1 0 parts of equal size, each comprising 1 0% of the 
observations. The median is the 5th decile. 

QUARTILES divide the values into four parts of equal size, each comprising 25% 
of the observations. The median describes the second quartile, below which 50% 
of the values fall. (Note: In some computer statistical packages, the first and third 
quartile values may be referred to as hinges.) 

After values are arranged from smallest to largest, quartiles are calculated 
similarly to the median. It may be necessary to interpolate (calculate a position 
between) two values to identify the data position corresponding to the quartile. 

For N values arranged from lowest to highest: 

., ~ , .. (N+l) 

First quartile, Q 1 = Data value at position 



Second quartile (the median), Q 2 = Data value at position 
Third quartile, Q 3 = Data value at position 



4 

2(N + 1) 
4 

3(N + 1) 



4 

(Use N if data represent a population, n for a sample.) 

As an example of how this works, we'll calculate the quartiles for the data 
shown in part (b) of Figure 3.4. Since there are 50 data values, representing 
a population consisting of the 50 U.S. states, N = 50 will be used in the 
calculations. 

The first quartile, Q v is located at the (50 + l)/4, or 12.75 position. Since 
there is no "12.75th" state, we interpolate between the 12th and 13th, going 75% 
of the way from the 12th state to the 13th state: 

Position x 12 x ins x \3 

Data value 5.4 | 6.8 | 7.3 thousand housing starts 



75% of the way from 5.4 to 7.3, Q 1 = 6.8 
(6.825 rounded to one decimal place) 
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In calculating Q 7 , which is also the median, we identify what would be the 
2(50 + l)/4, or 25.5th data value. This is halfway between the 25th state (17.8 
thousand housing starts) and the 26th state (18.8), or Q ? = 18.3. The third 
quartile corresponds to the position calculated as 3(50 + l)/4, or the 38.25th 
data value. Interpolating 25% of the way from the 38th state (38.7 thousand 
housing starts) to the 39th state (39.2), we find Q 3 = 38.8 thousand housing 
starts (38.825 rounded to 38.8). Although the rounding is not strictly necessary, 
it keeps the quartiles to the same number of decimal places as the original data. 

Quartiles offer a special feature that allows us to overcome the inability of the 
range to consider any but the most extreme values. In particular, there are two 
useful descriptors that involve quartiles: 

The interquartile range is the difference between the third quartile and the 
first quartile, or Q 3 — Q v In percentile terms, this is the distance between 
the 75% and 25% values. 

The quartile deviation is one-half the interquartile range. The quartile 
deviation is (Q 3 — Qj)/2. 

Because they reduce the importance of extreme high and low values, the 
interquartile range and quartile deviation can be more meaningful than the range 
as indicators of the dispersion of the data. In addition, they consider more than 
just the two extreme values, thus making more complete use of the data. Part (c) of 
Figure 3.4 shows a sketch of the approximate distribution of the 50-state housing- 
starts data, along with the quartiles, the interquartile range, and the quartile 
deviation. The distribution is positively skewed (i.e., it tapers off toward a few very 
high values at the right), and the mean (34.6) is much greater than the median (18.3). 

Mean Absolute Deviation (MAD) 

With this descriptor, sometimes called the average deviation or the average abso- 
lute deviation, we now consider the extent to which the data values tend to differ 
from the mean. In particular, the mean absolute deviation (MAD) is the average 
of the absolute values of differences from the mean and may be expressed as 
follows: 

Mean absolute deviation (MAD) for a population: 

MAD = — where /jl = population mean 

x i = the ?'th data value 

N = number of data values in the population 
(To calculate MAD for a sample, substitute n for N and x for fi.) 

In the preceding formula, we find the sum of the absolute values of the differ- 
ences between the individual values and the mean, then divide by the number of 
individual values. The two vertical lines ("| |") tell us that we are concerned with 
absolute values, for which the algebraic sign is ignored. To illustrate how the 
mean absolute deviation is calculated, we'll examine the following figures, which 
represent annual research and development (R&D) expenditures by Microsoft 
Corporation. 2 

^Source: Microsoft Corporation, 2005 Annual Report. 
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TABLE 3.1 



Calculation of mean 
absolute deviation for 
annual research and 
development (R&D) 
expenditures for Microsoft 
Corporation. Data are in 
millions of dollars. 



R&D 



Deviation 
from Mean 



Absolute Value of 
Deviation from Mean 



Year 




(x, - ft) 


1*/ " Ml 


2001 


4379 


-1868.2 


1868.2 


2002 


6299 


51.8 


51.8 


2003 


6595 


347.8 


347.8 


20U4 


/ / /y 


I 53 I .5 


1 53 1 .o 


2005 


6184 


-63.2 


63.2 




31,236 


0.0 


3862.8 




= ^Xj 




= ^ 1*/ ~ M 




Mean: 


2x, 31,236 
M= N = 5 


= $6247.2 million 



Mean absolute deviation: MAD = 



2 \x, - p\ 3862.8 



N 



= $772.6 



Year 2001 2002 2003 2004 2005 

R&D 4379 6299 6595 7779 6184 (millions of dollars) 

If we consider these data as representing the population of values for these 
5 years, the mean will be fi. It is calculated as 

4379 + 6299 + 6595 + 7779 + 6184 .„. 
/x = = $6247.2 million 

As shown in Table 3.1, the mean absolute deviation is the sum of the absolute 
deviations divided by the number of data values, or 3862.8/5 = $772.6 million. 
On the average, annual expenditures on research and development during these 
5 years were $772.6 million away from the mean. In the third column of 
Table 3.1, note that the sum of the deviations is zero. This sum will always be 
zero, so it isn't of any use in describing dispersion, and we must consider only the 
absolute deviations in our procedure. 



Variance and Standard Deviation 

Variance 

The variance, a common measure of dispersion, includes all data values and is cal- 
culated by a mathematical formula. For a population, the variance (o- 2 , "sigma 
squared") is the average of squared differences between the N data values and the 
mean, /a. For a sample variance (s 2 ), the sum of the squared differences between 
the n data values and the mean, x, is divided by (n — 1). These calculations can be 
summarized as follows: 

Variance for a population: 

population variance 
population mean 
the fth data value 
number of data values 
in the population 



2 2 - /a) 2 2 

o-^ = — where a A 

N 

x i 
N 
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Variance for a sample: 






2 2 (a^ - %) 2 

s = 1 

n — 1 


where s 2 


= sample variance 

— SallipiC IllCall 




*i 


= the ith data value 




w 


= number of data values 
in the sample 



Using the divisor (n — 1) when calculating the variance for a sample is a 
standard procedure that makes the resulting sample variance a better estimate 
of the variance of the population from which the sample was drawn. Actually, 
for large sample sizes (e.g., n > 30) the subtraction of 1 from n makes very 
little difference. 

Part A of Table 3.2 shows the calculation of the variance for a sample of 
estimates of highway miles per gallon (mpg) for five sports-utility vehicles. 
In the table, notice the role played by residuals. These are the same as the 
deviations from the mean discussed in the preceding section. For each obser- 
vation, the residual is the difference between the individual data value and the 
mean. 



Residual for a data _ , _ . 

, . , = X- — x where x = sample mean 

value m a sample ' 

x i = value of the 
ith observation 

For example, the residual for the third observation (21 mpg) is 21.0 — 20.0 = 
1.0 mpg. The residual is an important statistical concept that will be discussed 
further in later chapters. 





Highway 




Residual 


Residual 2 


Model 


mpg (x f ) 




(Xj - X) 


(X; " X) 2 


Chevrolet Blazer 


23 


529 


3.0 


9.0 


Toyota 4Runner 


21 


441 


1.0 


1.0 


Honda Passport 


21 


441 


1.0 


1.0 


Lincoln Navigator 


17 


289 


-3.0 


9.0 


Chevrolet Suburban 


18 


324 


-2.0 


4.0 




100 


2024 




24.0 




= Ex ( . 


= Exf 




= 2(X, - X) 2 


X 


_ Ex,- 100 
5 ~ 5 


= 20.0 miles per 


gallon 




s 2 


E(x, - x) 2 
n — 1 


24.0 


s = V6X) 


= 2.45 



TABLE 3.2 



Calculation of variance (s 2 ) 
and standard deviation (s) 
for estimated highway fuel 
economy (mpg) for five 
2000 sports-utility vehicles 



Source: U.S. Environmental Protection Agency, 2000 Gas Mileage Guide. 
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Standard Deviation 

The positive square root of the variance of either a population or a sample is a 
quantity known as the standard deviation. The standard deviation is an especially 
important measure of dispersion because it is the basis for determining the pro- 
portion of data values within certain distances on either side of the mean for cer- 
tain types of distributions (we will discuss these in a later chapter). The standard 
deviation may be expressed as 

For a For a 

Population Sample 

Standard Deviation <x = vcr 2 s = Vs 2 

For the data in Table 3.2, the sample standard deviation is 2.45 miles 
per gallon. This is calculated by taking the square root of the sample 
variance, 6.0. 

Variance and Standard Deviation: Additional Notes 

• Unless all members of the population or sample have the same value (e.g., 33, 
33, 33, 33, and so on), the variance and standard deviation cannot be equal 
to zero. 

• If the same number is added to or subtracted from all the values, the variance 
and standard deviation are unchanged. For example, the data 24, 53, and 36 
will have the same variance and standard deviation as the data 124, 153, and 
136. Both the variance and standard deviation depend on the residuals (differ- 
ences) between the observations and the mean, and adding 100 to each data 
value will also increase the mean by 100. 

Computer Solutions 3.2 describes the procedures and shows the results when 
we use Excel and Minitab to obtain descriptive statistics for the housing-starts 
data shown in Figure 3.4. The data are in file CX03HOUS. The printout items per- 
taining to the measures of dispersion in this section and an interpretation of their 
values follow: 

• Standard Deviation (or StDev) = 49.78 The standard deviation is s = 49.78 
thousand housing starts during the year. Excel also lists the variance, although 
this could be readily calculated as the square of the standard deviation. 

• Standard Error (SE Mean) = 7.04 This is the standard error for the mean, a 
term that will not be discussed until Chapter 8. Disregard it for now. 

• Minimum = 1.20 and Maximum = 271.40 These are the values for the min- 
imum and maximum observations in the data. Excel also provides the range, 
which is simply 271.40 — 1.20, or 270.20 thousand housing starts. 

• Ql = 6.83 and Q2 = 38.83 Reported by Minitab, the first and third quar- 
tiles are 6.83 and 38.83: 50% of the states had between 6.83 and 38.83 thou- 
sand housing starts. Excel does not include the first and third quartiles in its 
printout of summary statistics, but you can get at least approximate values by 
using the procedure described in Computer Solutions 3.2. If you are using the 
Data Analysis Plus add-in with Excel, the first and third quartiles are provided 
along with the Boxplot; this is demonstrated in Computer Solutions 3.3 in the 
next section. 

Statistical software typically assumes the data are a sample instead of a population 
and uses (number of observations — 1 ) in calculating the standard deviation. This 
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computer solutions 3.2 



Descriptive Statistics 
EXCEL 





A 


B 


C 


D 


1 


Hstarts 




Hstarts 


2 


17.2 








3 


4.0 




Mean 


34.65 


4 


71.8 




Standard Error 


7.04 


5 


9.9 




Median 


18.30 


6 


271.4 




Mode 


32.80 


7 


32.8 




Standard Deviation 


49.78 


8 


24.5 




Sample Variance 


2478.55 


9 


4.6 




Kurtosis 


12.20 


10 


202.6 




Skewness 


3.27 


11 


73.1 




Range 


270.20 


12 


7.3 




Minimum 


1.20 


13 


4.3 




Maximum 


271 .40 


14 


38.7 




Sum 


1732.30 


15 


23.0 




Count 


50 




Open the data file CX03HOUS.XLS. The steps are the same as for Computer Solutions 3.1, except that we enter A1:A51 
into the Input Range box and C1 into the Output Range box. Excel does not include the first and third quartiles in its 
printout of summary statistics. However, if you have a large number of data values, you can display at least approximate 
first and third quartiles by dividing the sample size by 4, rounding to the nearest integer if necessary, then entering this 
number into the Kth Largest box as well as the Kth Smallest box during step 3. 



MINITAB 

Descriptive Statistics: Hstarts 

Variable N N* Mean SE Mean StDev Minimum Ql Median Q3 Maximum 

Hstarts 50 0 34.65 7.04 49.78 1.20 6.82 18.30 38.83 271.40 

Open the data file CX03HOUS.MTW. The steps are the same as for Computer Solutions 3.1, except the 50 values for the 
housing starts are in column C1. 



is usually not a problem because (1) much of the data in business statistics will 
actually be sample data, and (2) there will not be an appreciable difference when the 
number of observations is large. However, if your data do represent a population, 
and you would like to know the value of a instead of s, you can multiply s from the 
printout to get a, as shown here: 



Standard deviation 



Standard deviation , , 4 N — 

i , , XT , j. ■ = based on (N — 1) X \/ 

based on (N) divisor v V N 
divisor 



exercises f ,^ 




3.16 Provide a real or hypothetical example of a situation 
where the range could be misleading as a measure of 
dispersion. 

3.17 For the sample of computer website hits described 
in Exercise 3.3, determine the range, the mean 
absolute deviation, the standard deviation, and the 
variance. 

3.18 For the sample of Saturday-morning cartoon 
violence counts described in Exercise 3.4, determine the 
range, the mean absolute deviation, the standard 
deviation, and the variance. 

3.19 During July 2005, the seven most-visited shopping 
websites (with number of visitors, in millions) were: 
eBay (61.7), Amazon (42.0), Wal-Mart (22.6), 
Shopping.com (22.4), Target (20.5), Apple Computer 
(17.4), and Overstock.com (17.2). Considering these as 
a population, determine: source: The World Almanac and Book 
of Facts 2006, p. 390. 

a. The mean, median, range, and midrange. 

b. The mean absolute deviation. 

c. The standard deviation and variance. 

3.20 For a sample of 11 employers, the most recent 
hourly wage increases were 18, 30, 25, 5, 7, 2, 20, 12, 
15, 55, and 40 cents per hour. For these sample data, 
determine: 

a. The mean, median, range, and midrange. 

b. The mean absolute deviation. 

c. The standard deviation and variance. 

3.21 According to the U.S. Environmental Protection 
Agency, a sample of 10 subcompact models shows the 
following estimated values for highway fuel economy 
(mpg): 40, 33, 32, 30, 27, 29, 27, 23, 21, and 10. For 
these sample data, determine: 

a. The mean, median, range, and midrange. 

b. The mean absolute deviation. 

c. The standard deviation and variance. 

3.22 For a sample of eight apartment buildings in 
Chicago, the percentage-occupancy rates during a recent 
month were as shown below: 

50.0% 79.1 61.3 43.9 
68.1 70.0 72.5 66.3 
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For these sample data, determine: 

a. The mean, median, range, and midrange. 

b. The mean absolute deviation. 

c. The standard deviation and variance. 

3.23 Determine the first, second, and third quartiles for 
the data in Exercise 3.20; then calculate the interquartile 
range and the quartile deviation. 

3.24 Determine the first, second, and third quartiles for 
the data in Exercise 3.21; then calculate the interquartile 
range and the quartile deviation. 

/ data Set / Note: Exercises 3.25-3.27 require a 
computer and statistical software. 

3.25 During the 1999 holiday shopping season, Keynote 
Systems, Inc. visited a number of e-commerce websites 
to see how long it took for the site to come up on the 
PC of a consumer using a 56K modem. The average 
time for Wal-Mart.com was 23.35 seconds. Under the 
assumption that the company visited Wal-Mart's site on 
50 occasions and obtained the sample times (in seconds) 
in data file XR03025, determine and interpret: source: 
"How Key Web Sites Handle Holiday Shopping Rush," USA Today, 
November 24, 1999, p. 3B. 

a. The mean, median, range, and midrange. 

b. The mean absolute deviation. 

c. The standard deviation and variance. 

3.26 Data file XR03014 includes the number of absences 
from work for a sample of 100 employees. For the vari- 
able representing number of days absent, determine and 
interpret the following: 

a. The mean, median, range, and midrange. 

b. The mean absolute deviation. 

c. The standard deviation and variance. 

3.27 Data file XR02028 shows the depths (in meters) that 
80 water-resistant watches were able to withstand just 
before leakage. For the watches in this sample, determine 
and interpret the following: 

a. The mean, median, range, and midrange. 

b. The mean absolute deviation. 

c. The standard deviation and variance. 
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ADDITIONAL DISPERSION TOPICS 



The Box-and-Whisker Plot 

Also referred to as a box plot, the box-and-whisker plot is a graphical device that 
simultaneously displays several of the measures of central tendency and disper- 
sion discussed previously in the chapter. It highlights the first and third quartiles, 
the median, and the extreme values in the data, allowing us to easily identify these 
descriptors. We can also see whether the distribution is symmetrical or whether it 
is skewed either negatively or positively. 

Computer Solutions 3.3 describes the procedures and shows the results when 
we use Excel and Minitab to generate a box plot for the housing-starts data 




The Box Plot 
EXCEL 





A 


B 


C 


D 


E 


F 


G 


H 


1 


Box Plot 
















2 


Hstarts 
















3 


Smallest = 1.2 














4 


Q1 = 6.825 














5 


Median = 18.3 














6 


Q3 = 38.825 














7 


Largest = 271.4 














8 


IQR = 32 














9 


Outliers: 271.4, 202.6, 143.1, 












10 
11 
12 


I 














BoxPlot 






13 








14 
15 




• • • 


16 




-l—l 1 




17 
18 








19 
20 


- 0.0 50.0 100.0 150.0 200.0 250.0 300.0 




21 


I I I I I 





1. Open the data file CX03HOUS.XLS. Click on A1 and drag to A51 to select the data values in cells A1:A51. Click 
Tools. Click Data Analysis Plus. 

2. In the Data Analysis Plus menu, click Box Plot. Click OK. Ensure that the Input Range box contains the range you 
specified in step 1. Click Labels. Click OK. 

3. Improve the appearance by selecting and rearranging the components of the printout. 



(continued) 



78 



Part 1: Business Statistics: Introduction and Background 



MINITAB 



Boxplot of Hstarts 



50 



100 



150 
Hstarts 



200 



250 



300 



1 . Open data file CX03HOUS.MTW. The 50 values for the housing starts are in column CI. 

2. Click Graph. Click Boxplot. Select One Y Simple and click OK. In the Graph variables box, enter C1 . 

3. Click Scale and select the Axes and Ticks menu. Select Transpose value and category scales. (This causes the box 
plot to be oriented horizontally.) Click OK. Click OK. 



shown in Figure 3.4. The data are in file CX03HOUS. The box plot has the follow- 
ing components: 

• Quartiles The box extends horizontally from the first quartile (25th percentile) 
to the third quartile (75th percentile). Q 1 and Q 3 are shown at the left and right 
sides of the box, and the width of the box is the interquartile range. 

• Median This is shown by the vertical line within the box. It is off-center 
within the box, being located slightly left of center. Because the median is 
closer to Q 1 than to Q 3 , the distribution is skewed positively. 

• Extreme values The whisker on the left side of the box extends to the small- 
est data value, Wyoming's 1.2 thousand housing starts. The whisker on the 
right side extends to cover all but the three most distant data values, which are 
considered outliers. 



Chebyshev's Theorem 

When either a population or a sample has a small standard deviation, individual 
observations will tend to be closer to the mean. Conversely, a larger standard 
deviation will result when individual observations are scattered widely about 
their mean. An early mathematician named Chebyshev set forth a theorem that 
quantifies this phenomenon. It specifies the minimum percentage of observations 
that will fall within a given number of standard deviations from the mean, regard- 
less of the shape of the distribution. 
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CHEBYSHEV'S THEOREM For either a sample or a population, the percentage 
of observations that fall within k (for k > 1) standard deviations of the mean will 
be at least 

1 - -r X 100 



To demonstrate this rule, let's examine how it applies to the housing-starts 
data in Figure 3.4. For these data, values for the population mean and standard 
deviation are p, = 34.6 and cr = 49.3, respectively (in thousands of housing 
starts). Using Chebyshev's theorem, we can find the percentage of values that 
should fall within k = 2 and k = 3 standard deviations of the mean. For example, 
if k = 2, this percentage should be at least [1 — (1/2 2 )](100), or 75%. Expressed 
another way, we should find that at least 75% of the states have housing starts 
that fall in the interval described by 34.6 ± 2(49.3), or from -64.0 to 133.2. The 
actual percentage of data values falling within this interval is 94%, well above the 
minimum of 75% predicted by the theorem. 

The Empirical Rule 

Although Chebyshev's theorem applies to any distribution, regardless of shape, 
there is a rule of thumb that applies only to distributions that are bell shaped and 
symmetrical, like part (a) of Figure 3.2 and part (a) of Figure 3.3. This rule is 
known as the empirical rule: 

THE EMPIRICAL RULE For distributions that are bell shaped and symmetrical: 

• About 68% of the observations will fall within 1 standard deviation of the 
mean. 

• About 95% of the observations will fall within 2 standard deviations of the 
mean. 

• Practically all of the observations will fall within 3 standard deviations of the 
mean. 

Let's consider part (a) of Figure 3.3 (page 68), which happens to be a bell- 
shaped, symmetrical distribution with a mean of 9.0 and a standard deviation of 
0.3. If the filling machine inserts an average of 9.0 ounces of cereal into the boxes, 
with a standard deviation of 0.3 ounces, then approximately 68% of the boxes 
will contain 9.0 ± 1(0.3) ounces of cereal, or between 8.7 and 9.3 ounces. Again 
applying the empirical rule, but using 2 standard deviations instead of 1, approx- 
imately 95% of the boxes will contain 9.0 ± 2(0.3) ounces of cereal, or between 
8.4 and 9.6 ounces. Practically all of the boxes will contain 9.0 ± 3(0.3) ounces 
of cereal, or between 8.1 and 9.9 ounces. 

Standardized Data 

In our discussion of Chebyshev's theorem, we used the standard deviation as a 
unit of measurement, or measuring stick, for expressing distance from the mean 
to a data value. Standardizing the data is based on this concept and involves 



80 



Part 1: Business Statistics: Introduction and Background 



TABLE 3.3 



Standardization of 
estimated highway fuel 
economy (mpg) data for 
a sample of five 2000 
sports-utility vehicles. 
Standardizing the data 
involves expressing each 
value in terms of its 
distance from the mean, 
in standard deviation units. 



Original 
Data 



Model 



Chevrolet Blazer 
Toyota 4Runner 
Honda Passport 
Lincoln Navigator 
Chevrolet Suburban 



Highway 
mpg (x ( ) 



Standardized 
Data 

(x, x) 



23.00 
21.00 
21.00 
17.00 
18.00 



3.00 
1.00 
1.00 
-3.00 
-2.00 



1.22 
0.41 
0.41 
-1.22 
-0.82 

0.00 

= Szj 



Original Data Standardized Data 



x = 20.00 mpg 


z= 0.000 


s x = 2.45 mpg 


s z = 1 .000 



expressing each data value in terms of its distance (in standard deviations) from 
the mean. For each observation in a sample: 

x ■ — x 

Zj = — where z t = standardized value for the z'th observation 

x = sample mean 
X: = the z'th data value 
s = sample standard deviation 

(When data represent a population, /jl replaces x, and a replaces s.) 

What we're really doing is expressing the data in different units. It's like refer- 
ring to a 6-footer as someone who is 2 yards tall. Instead of using feet, we're 
using yards. For example, the mpg estimates for the five sports-utility vehicles 
in Table 3.2 can be converted as shown in Table 3.3. After conversion, some stan- 
dardized values are positive, others are negative. This is because we're using 
the mean as our reference point, and some data values are higher than the mean, 
others lower. 

Standardized values have no units. This is true even though the original 
data, the standard deviation, and the mean may each represent units such as 
dollars, hours, or pounds. Because the data values and the standard deviation are 
in the same units, when one quantity is divided by the other, these units are in 
both the numerator and denominator, and the units cancel each other out. 

A standardized value that is either a large positive number or a large negative 
number is a relatively unusual occurrence. For example, the estimated 23 mpg for 
the Chevrolet Blazer was easily the highest data value for the five vehicles in the 
sample. A standardized distribution will always have a mean of 0 and a standard 
deviation of 1. The standard deviation will be either s or a, depending on whether 
the data represent a sample or a population. Standardizing the original data val- 
ues makes it easier to compare two sets of data in which the original measurement 
units are not the same. Computer Solutions 3.4 describes the procedures and 
shows the results when we use Excel and Minitab to standardize the data for the 
five sports-utility vehicles. 
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iuter solutions 3.4 



Standardizing the Data 
EXCEL 





A 


B 


C 


D 


E 


1 


MPG 


STDMPG 




Mean is: 


20.00 


2 


23 


1.22 




Std. Dev. is: 


2.45 


3 


21 


0.41 








4 


21 


0.41 








5 


17 


-1.22 








6 


18 


-0.82 









Open the data file CX03SUV.XLS. The data are entered in column A, as shown here. Type the labels Mean is: and Std. 
Dev. is: into cells D1 and D2, respectively. Enter the equation =AVERAGE(A2:A6) into E1. Enter the equation 
= STDEV(A2:A6) into E2. (Alternatively, you can find the values for the mean and standard deviation by using a 
procedure like that of Computer Solutions 3.1 to obtain values for all the descriptive statistics.) 
Enter the equation =(A2-20.00)/2.45 into B2. Click on cell B2. Click on the lower right corner of cell B2 and the cur- 
sor will turn to a " + " sign. Drag the " + " sign down to cell B6 to fill in the remaining standardized values. Enter the 
label into cell B1. 



MINITAB 

Data Display 



Row 


MPC 


STDMPC 


1 


23 


1.22474 


2 


21 


0.40825 


3 


21 


0.40825 


4 


17 


-1.22474 


5 


18 


-0.81650 



Descriptive Statistics: 

Variable N N* Mean 
MPC 5 0 20.00 

STDMPC 5 0 -0.000 



PC, STDMPC 

SE Mean StDev Minimum 

1.10 2.45 17.00 

0.447 1.000 -1.225 



Ql Median Q3 
17.50 21.00 22.00 
-1.021 0.408 0.816 



Variable Maximum 

MPC 23.00 
STDMPC 1.225 

1 . Open data file CX03SUV.MTW. The five fuel-economy values are in column C1. 

2. Click Calc. Click Standardize. Enter C1 into the Input column(s) box. Enter C2 into the Store results in box. Click 
to select Subtract mean and divide by std. dev. Click OK. 

3. The standardized values are now in C2, which we can title by entering STDMPG into the label cell above the values. 

4. Besides the original and standardized values, the printout shown here includes descriptive statistics for both. (Note: The 
mean and standard deviation of the standardized values are 0 and 1, respectively.) To get the printout: Click Edit. Click Com- 
mand Line Editor. Enter print C1 C2 into the first line. Enter describe C1 C2 into the second line. Click Submit 
Commands. 
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The Coefficient of Variation 



Expressing the standard deviation as a percentage of the mean, the coefficient of 
variation (CV) indicates the relative amount of dispersion in the data. This 
enables us to easily compare the dispersions of two sets of data that involve 
different measurement units (e.g., dollars versus tons) or differ substantially in 
magnitude. For example, the coefficient of variation can be used in comparing the 
variability in prices of diamonds versus prices of coal or in comparing the vari- 
ability in sales, inventory, or investment of large firms versus small firms. 

In comparing the variability of gold prices and slab zinc prices, for example, 
standard deviations would tell us that the price of gold varies a great deal more 
than the price of zinc. However, the dollar value of an ounce of gold is much 
higher than that of a pound of slab zinc, causing its variability to be exaggerated. 
Dividing each standard deviation by the mean of its data, we are better able to 
see how much each commodity varies relative to its own average price. The co- 
efficient of variation is expressed as a percentage and is calculated as follows: 



For a Population 



For a Sample 



Coefficient of Variation 



exercises 



CV = - X 1 00 



CV=- X 100 

X 



Suppose that, over a sample of 12 months, the mean price of gold had been 
x = $405.3 per ounce, with a standard deviation of s = $24.5 per ounce, while 
the mean and standard deviation for slab zinc during this same period had been 
$0.52 per pound and $0.04 per pound, respectively. Which commodity would 
have had the greater relative dispersion, or coefficient of variation? 

We calculate the coefficient of variation for each commodity: The CV for gold 
was s/x, or $24.5/$405.3 = 0.060, and the CV for slab zinc was $0.04/$0.52 = 
0.077. Expressed as percentages, the coefficients of variation would be 6.0% for gold 
and 7.7% for slab zinc. Although the standard deviation of monthly gold prices was 
more than 612 times the standard deviation of zinc prices, from a relative dispersion 
perspective, the variability of gold prices was actually less than that for slab zinc. 




3.28 A manufacturing firm has collected information on 
the number of defects produced each day for the past 50 
days. The data have been described by the accompanying 
Minitab box-and-whisker plot. 

a. What is the approximate value of the median? The 
first and third quartiles? 

b. What do the asterisks (*) at the right of the display 
indicate? What implications might they have for the 
production supervisor? 

c. Does the distribution appear to be symmetric? If not, 
is it positively skewed or negatively skewed? 

3.29 According to Chebyshev's theorem, what per- 
centage of observations should fall 

a. within 2.5 standard deviations of the mean? 

b. within 3.0 standard deviations of the mean? 

c. within 5.0 standard deviations of the mean? 




35 36 37 



T 

38 39 40 41 

DEFECTS 



r 

42 



43 44 
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3.30 For the data in Exercise 3.20, determine the 
percentage of observations that fall within k = 1.5 
standard deviation units from the mean. Do the results 
support Chebyshev's theorem? 

3.31 Standardize the data in Exercise 3.21, then identify 
the percentage of observations that fall within k = 2.0 
standard deviation units from the mean. Do the results 
support Chebyshev's theorem? 

3.32 The manufacturer of an extended-life lightbulb 
claims the bulb has an average life of 12,000 hours, with 
a standard deviation of 500 hours. If the distribution is 
bell shaped and symmetrical, what is the approximate 
percentage of these bulbs that will last 

a. between 11,000 and 13,000 hours? 

b. over 12,500 hours? 

c. less than 11,000 hours? 

d. between 11,500 and 13,000 hours? 

3.33 For college-bound high school seniors from a 
certain midwestern city, math scores on the Scholastic 
Aptitude Test (SAT) averaged 480, with a standard devia- 
tion of 100. Assume that the distribution of math scores 
is bell shaped and symmetrical. 

a. What is the approximate percentage of scores that 
were between 380 and 580? 

b. What is the approximate percentage of scores that 
were above 680? 

c. Charlie scored 580 on the math portion 

of the SAT. What is the approximate percentage 
of students who scored lower than Charlie? 

d. Betty scored 680 on the math portion of the 
SAT. What is the approximate percentage of 
students who scored higher than Charlie but 
lower than Betty? 

3.34 For data set A, the mean is $1235, with a standard 
deviation of $140. For data set B, the mean is 15.7 
inches, with a standard deviation of 1.87 inches. Which 
of these two data sets has the greater relative 
dispersion? 

3.35 A sample of 20 customers from Barnsboro National 
Bank reveals an average savings account balance of $315, 



with a standard deviation of $87. A sample of 25 
customers from Wellington Savings and Loan reveals an 
average savings account balance of $8350, with a 
standard deviation of $1800. Which of these samples has 
the greater relative dispersion? 

/ data set/ Note: Exercises 3.36-3.38 require a 
computer and statistical software. 

3.36 A Polk Company survey found that households with 
someone who frequently plays golf have a median income 
of $54,826, compared to a median of $37,354 for all 
households. Under the assumption that the golfing house- 
holds surveyed by the company could have been 
represented by the 100 incomes in file XR03036: 

source: "Golfers in the Green," USA Today, June 14, 1999, p. 1C. 

a. Construct and interpret the box-and-whisker plot for 
the data. 

b. Standardize the data, then determine the mean and 
standard deviation of the standardized values. 

3.37 In Exercise 3.14, the personnel director has 
collected data describing 100 employees, with one of the 
variables being the number of absences during the past 
year. Using data file XR03014, describe the variable, 
number of absences. 

a. Construct and interpret the box-and-whisker plot for 
the number of absences. 

b. Standardize the data, then determine the mean and 
standard deviation of the standardized values. 

3.38 Exercise 3.25 described a study in which 
e-commerce researchers visited Wal-Mart.com on 
multiple occasions during the holiday shopping season 
and measured the time (in seconds) it took for the site to 
come up on the computer screen. Data file XR03025 
contains results that we assumed could have been 
obtained during this study. 

a. Construct and interpret the box-and-whisker plot 
for the times. 

b. Standardize the data, then determine the mean and 
standard deviation of the standardized values. 



DESCRIPTIVE STATISTICS FROM GROUPED DATA 3.5 



The frequency distribution, also referred to as grouped data, is a convenient 
summary of raw data, but it loses some of the information originally contained in 
the data. As a result, measures of central tendency and dispersion determined 
from the frequency distribution will be only approximations of the actual values. 
However, since the original data may not always be available to us, these approx- 
imations are sometimes the best we can do. Although other descriptors can be 
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TABLE 3.4 



The formulas in the text are 
used to calculate the 
approximate mean and 
standard deviation based 
only on a frequency 
distribution for the usage 
times of the 50 automated 
teller machine (ATM) 
customers in data file 
CX02ATM. 



Time (seconds) 


(1) 
Midpoint 

( m i) 


(2) 

Frequency 
( f i) 


(3) 
f i m i 


(4) 

mf 


(5) 
f,m? 


20-under 25 


22.5 


1 


22.5 


506.25 


506.25 


25-under 30 


27.5 


7 


192.5 


756.25 


5,293.75 


30-under35 


32.5 


10 


325.0 


1056.25 


10,562.50 


35-under40 


37.5 


9 


337.5 


1406.25 


12,656.25 


40-under 45 


42.5 


9 


382.5 


1806.25 


16,256.25 


45-under 50 


47.5 


6 


285.0 


2256.25 


13,537.50 


50-under 55 


52.5 


5 


262.5 


2756.25 


13,781.25 


55-under 60 


57.5 


3 


172.5 


3306.25 


9,918.75 






50 


1980.0 




82,512.50 






= 2 f f = n 


= 2f,m, 






Approximate mean: 


_ 2 fjirij 
n 


1980.0 
50 ^ 




Approximate variance: 




nx 2 82,512.50 - 50(39.6) 2 


n - 1 




50 - 1 








82,512.50 - 78,408 


4104.5 


83.77 






50 


- 1 


49 



Approximate standard deviation: s = V83.77 = 9.15 



approximated from the frequency distribution, we will concentrate here on the 
mean and the standard deviation. Techniques for their approximation are sum- 
marized in Table 3.4 and explained subsequently. In this section, we'll assume 
that all we know about the usage times for the 50 automated teller machine 
(ATM) customers in data file CX02ATM is the frequency distribution shown in the 
first three columns of Table 3.4. We will approximate the mean and standard de- 
viation, then compare them with the actual values (which happen to be x = 39.76 
seconds and s = 8.92 seconds). 

Arithmetic Mean from Grouped Data 

When approximating the mean from a frequency distribution, we consider each 
observation to be located at the midpoint of its class. As an example, for the 
30-under 35 class, all 10 persons in this category are assumed to have a time of 
exactly 32.5 seconds, the midpoint of the class. Using this and other midpoints, 
we can find the approximate value of the mean. 

Approximate value of the sample mean, from grouped data: 

x = — - — - where x = approximate sample mean 
n fj = frequency for class i 

m j = midpoint for class /' 
n = number of data values in the sample 

(If data are a population, /jl replaces x and N replaces n.) 
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The formula assumes that each midpoint occurs f- times — e.g., that 22.5 sec- 
onds occurs 1 time, 27.5 seconds occurs 7 times, and so on. Applying this formula 
as shown in Table 3.4, we calculate the mean as approximately 39.6 seconds. 
Again, this is an approximation — the actual value, computed from the original 
data, is 39.76 seconds. (Note: If each data value really is at the midpoint of its 
class, the calculated mean, variance, and standard deviation will be exact values, 
not approximations.) 

Variance and Standard Deviation from 
Grouped Data 

Variance 

As in our previous discussion of the variance, its calculation here will depend on 
whether the data constitute a population or a sample. The previously determined 
approximate value for the mean (in this case, estimated as x = 39.6) is also used. 
When computing the variance from grouped data, one of the following formulas 
will apply: 

Approximate value of the variance from grouped data: 

• If the data represent a population: 

^2 _ ^fi m i ~ where /j, = approximate population mean 

N f = frequency for class i 

m i = midpoint for class i 
N = number of data values in the 
population 

• If the data represent a sample: 

s 2 _ ^fi m t nx where x = approximate sample mean 

n — 1 f. = frequency for class i 

m i = midpoint for class i 
n = number of data values in the 
sample 

Calculation of the approximate population variance for the ATM customer 
data is shown in Table 3.4 and results in an estimate of 83.77 for the value of s 2 . 
This differs from the s 2 = 79.49 reported by Excel in Computer Solutions 3.1, but 
remember that our calculation here is an approximation. If the original data are 
not available, such approximation may be our only choice. 

Standard Deviation 

The approximate standard deviation from grouped data is the positive square 
root of the variance. This is true for both a population and a sample. As Table 3.4 
shows, the approximate value of s is the square root of 83.77, or 9.15 seconds. 
Like the variance, this compares favorably with the value that was based on the 
original data. 
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3.39 Using the data in Exercise 3.5, construct a frequency 
distribution. 

a. Determine the approximate values of the mean and 
standard deviation. 

b. Compare the approximate values from part (a) with 
the actual values. 

c. Construct a frequency distribution with twice as 
many classes as before, then repeat parts (a) and (b). 
Have the approximations improved? 

d. If you were to construct a "frequency distribution" 
in which each data value was at the midpoint of its 
own class, what values do you think the "approxima- 
tions" would have? Explain. 

3.40 A sample consisting of 100 employees has been given 
a manual-dexterity test. Given the accompanying frequency 
distribution, determine the approximate mean and standard 
deviation for these data. 

3.41 Eighty packages have been randomly selected from a 
frozen food warehouse, and the age (in weeks) of each pack- 
age is identified. Given the frequency distribution shown, 
determine the approximate mean and standard deviation for 
the ages of the packages in the warehouse inventory. 



Data for Exercise 3.40 



Score 



Number of Persons 



5-under 1 5 
1 5-under 25 
25-under 35 
35-under 45 
45-under 55 
55-under 65 
65-under 75 
75-under85 
85-under95 
95-under 105 

Data for Exercise 3.41 
Age (Weeks) 

0-under 10 
10-under20 
20-under30 
30-under 40 
40-under 50 
50-under 60 



7 
9 
12 
14 
13 
9 
8 
1 1 
10 
7 



Number of Packages 

25 
17 
15 



3 . 6 STATISTICAL MEASURES OF ASSOCIATION 



In Chapter 2, we introduced the scatter diagram as a method for visually repre- 
senting the relationship between two quantitative variables, and we also showed 
how statistical software can be used to fit a linear equation to the points in 
the scatter diagram. This section introduces a way to numerically measure the 
strength of the linear relationship between two variables — the coefficient of 
correlation. 

The coefficient of correlation (r, with -1 <r< +1) is a number that indi- 
cates both the direction and the strength of the linear relationship between the 
dependent variable (y) and the independent variable (x): 

• Direction of the relationship If r is positive, y and x are directly related, as in 
parts (a) and (c) of Figure 3.5. If r is negative, the variables are inversely 
related, as in parts (b) and (d) of Figure 3.5. 

• Strength of the relationship The larger the absolute value of r, the stronger 
the linear relationship between y and x. If r = — 1 or r = +1, the best-fit linear 
equation will actually include all of the data points. This is the case in parts 
(a) and (b) of Figure 3.5. However, in parts (c) and (d), the respective equa- 
tions are less than perfect in fitting the data, resulting in absolute values of r 
that are less than 1. 
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FIGURE 3.5 



The stronger the linear rela- 
tionship between the vari- 
ables, the greater will be 
the absolute value of the 
correlation coefficient, r. 
When r = 0, the slope of 
the best-fit linear equation 
will also be 0. 

x 

(a) Perfect positive correlation (b) Perfect negative correlation 




x I x 

(c) Strong positive correlation (d) Weak negative correlation 



y r=0.00 y 



l x 

(e) No correlation (f) No correlation 




r = u.UU 



• • • • 



In parts (e) and (f) of Figure 3.5, the slope is zero, r is zero, and there is no lin- 
ear relationship whatsoever between y and x. In each case, the value of y doesn't 
depend on the value of x. At the end of the chapter, Seeing Statistics Applet 2 can 
be used to demonstrate how the value of r changes as the pattern of observations 
is changed. 

In Chapter 15, Simple Linear Regression and Correlation, we will examine 
the coefficient of correlation and its method of calculation in greater detail. For 
now, it's important that you simply be aware of this important statistical measure 
of association. In this chapter, we'll rely on statistical software to do the calcula- 
tions for us. 
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Another measure of the strength of the linear relationship is the coefficient of 
determination, r 2 . Its numerical value is the proportion of the variation in y that 
is explained by the best-fit linear equation. For example, in parts (a) and (b) of 
Figure 3.5, r 1 is 1.00 and 100% of the variation in y is explained by the linear 
equation describing y as a function of x. On the other hand, in part (c), the equa- 
tion does not fully explain the variation in y — in this case, r 2 = 0.94 2 = 0.884, 
and 88.4% of the variation in y is explained by the equation. As with the coeffi- 
cient of correlation, it will be covered in more detail in Chapter 15. A final note: 
It doesn't matter which variable is identified as y and which one as x — the values 
for the coefficients of correlation and determination will not change. 

Computer Solutions 3.5 describes the procedures and shows the results when 
we use Excel and Minitab to find the coefficient of correlation between a baseball 
team's payroll and the number of games they won during the season. In Chapter 2, 
Computer Solutions 2.7 showed the Excel and Minitab scatter diagrams and best- 
fit linear equations for the data, which are in file CX02BASE. 



'ikh computer solutions 3.5 










* 



Coefficient of Correlation 
EXCEL 

The accompanying printout includes the scatter diagram and best-fit linear equation from Computer Solutions 2.7. It also 
shows the results for each of two options for determining the coefficient of correlation. 





A 


B 


C 


D 


E 


F 


G 


H 


I 


1 


Team 


Payroll 


Wins 
















2 


New York Yankees 


88.237 


98 




Wins vs. Payroll y = 0.3287X + 64.368 




3 


Boston Red Sox 


62.708 


94 






R 2 = 0.3262 




4 


Toronto Blue Jays 


37.825 


84 




10C 

| 8C 

7C 
6C 


- 






5 


Baltimore Orioles 


81.942 


78 




- 
- 




6 


Tampa Bay Devil Rays 


36.365 


69 




7 


Cleveland Indians 


70.734 


97 




- 




8 


Chicago White Sox 


26.305 


75 








9 


Detroit Tigers 


35.907 


69 




- 


— *^ ♦ — ♦-♦ 

♦ ♦ 


10 


Kansas City Royals 


27.140 


64 




- 




11 


Minnesota Twins 


21.363 


63 






' 1 1 1 1 1 


12 


Texas Rangers 


96.923 


95 
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Payroll (Smillions) 


13 


Oakland Athletics 


25.446 


87 




14 


Seattle Mariners 


52.821 


79 




15 


Anaheim Angels 


54.866 


70 
















16 


Atlanta Braves 


75.718 


103 














17 


New York Mets 


66.205 


97 






Payroll 


Wins 






18 


Philadelphia Phillies 


33.683 


77 




Payroll 


1 








19 


Montreal Expos 


17.617 


68 




Wins 


0.571 


1 







Excel Option 1, to simply obtain the coefficient of correlation: 

1 . Open the data file CX02BASE.XLS. The labels and data are already entered, as shown in the printout. Click on B1 and 
drag to C31 to select cells B1:C31. Click Tools. Click Data Analysis. 

2. In the Analysis Tools box, click Correlation. Click OK. Enter B1:C31 into the Data range box. In the Grouped By 
section, click next to Columns. Click to place a check mark next to Labels in First Row. Enter E17 into the Output 
Range box. Click OK. 



(continued) 
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Excel Option 2, to display r 2 on an existing chart like the one from Computer Solutions 2.7: 

1 . Right-click anywhere on the best-fit line in the chart. Click Format Trendline. 

2. Click Options. Click to place a check mark next to Display R-squared value on chart. Click OK. 



MINITAB 

Minitab Option 1, to simply determine the coefficient of correlation: 
Correlations: Payroll, Wins 

Pearson correlation of Payroll and Wins = 0.571 

1 . Open the data file CX02BASE.MTW. The labels Payroll and Wins have already been entered at the top of columns C2 
and C3, respectively. The payroll values and the win totals have been entered into C2 and C3, respectively. 

2. Click Stat. Select Basic Statistics. Click Correlation. Enter C2 and C3 into the Variables box. Click OK. 

Minitab Option 2, to display r 2 along with a scatter diagram and best-fit linear equation: 

Generate the Minitab scatter diagram and best-fit linear equation as in the Minitab portion of Computer Solutions 2.7 
(not shown here). The chart automatically includes "R-Sq = 32.6%" at the top. 



Both software packages report the coefficient of correlation as r = +0.571. If 
we have also generated a scatter diagram and best-fit linear equation, as in Com- 
puter Solutions 2.7, the r 1 value (0.326, or 32.6%) is displayed either automati- 
cally (Minitab) or as an option (Excel). Interpreting the results, we conclude that 
there is a positive relationship between a team's payroll and the number of games 
it won during the season. Further, the size of the payroll explains 32.6% of the 
variation in the number of wins. From the opposite perspective, (100.0% — 
32.6%) or 67.4% of the variation in wins is not explained by payroll. This makes 
sense, as there are probably a great many other variables — for example, level of 
fan support, quality of coaching, player injuries, strength of schedule — that also 
influence the number of games a team wins. 




3.42 What is the coefficient of determination and what does 
it tell us about the relationship between two variables? 

3.43 For a set of data, r 1 is 0.64 and the variables x and 
y are inversely related. What is the numerical value of r? 

/ data SEt / Note: Exercises 3.44-3.47 require a 
computer and statistical software. 

3.44 In Exercise 3.14, two of the variables examined by the 
personnel director were age and number of absences dur- 
ing the past year. Using the data in file XR03014, generate 
a scatter diagram (with y = number of absences and x = 
age), fit a linear equation to the data, then determine and 
interpret the coefficients of correlation and determination. 



3.45 In the U.S. News & World Report top-50 U.S. law 
schools in Exercise 2.67, the listing for each school included 
two rating scores: one from academic evaluators and 
another from lawyer/judge evaluators. Using the data in file 
XR02067, generate a scatter diagram with axes of your 
choice, fit a linear equation to the data, then determine and 
interpret the coefficients of correlation and determination. 

3.46 The American Association of Retired Persons 
(AARP) has compiled data on death rates (deaths per 
1000 people) from heart disease and cancer for the 

50 states. The states and their respective death rates from 
these two diseases are provided in data file XR03046. 
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Generate a scatter diagram with axes of your choice, fit a 
linear equation to the data, then determine and interpret 
the coefficients of correlation and determination, source: 
Robert Davis, "Regional Health Disparities Reflect Range of Care Options," 
USA Today, February 16, 2000, p. 10D. 

3.47 A report on the average price of a sample of brand- 
name prescription drugs with their generic equivalents 
has found the generics to be considerably cheaper. The 



drugs and the average prices for their brand-name versus 
generic versions are listed in data file XR03047. Generate a 
scatter diagram with axes of your choice, fit a linear 
equation to the data, then determine and interpret the 
coefficients of correlation and determination, source: Julie 
Appleby, "Drug Makers Fight Back As Patents Near Expiration," USA Today, 
November 26, 1999, p. 4B. 



• Measures of central tendency and dispersion 

In addition to summarization by frequency distributions and related visual 
techniques, raw data may be statistically described through the use of measures of 
central tendency and dispersion. As with the frequency distribution, this process 
loses some of the detail in the original data, but offers the benefit of replacing 
voluminous raw data with just a few numerical descriptors. 

Measures of central tendency describe typical values in the data. Four of the 
most common are the arithmetic mean, weighted mean, median, and mode. Disper- 
sion measures describe the scatter of the data, or how widely the values tend to be 
spread out. Common measures of dispersion include the range, quantiles, mean 
absolute deviation, variance, and standard deviation. The midrange is the average 
of the lowest and highest values and is sometimes used as a very rough measure of 
central tendency. 

Chebyshev's theorem applies to any distribution, regardless of shape, and spec- 
ifies a minimum percentage of observations that will fall within a given number of 
standard deviations from the mean. For distributions that are bell shaped and sym- 
metrical, the empirical rule expresses the approximate percentage of observations 
that will fall within a given number of standard deviations from the mean. 

• Conversion of data to standardized values 

Standardization of the data involves expressing each value in terms of its distance 
from the mean, in standard deviation units. The resulting standardized distribu- 
tion will have a mean of 0 and a standard deviation of 1. The coefficient of vari- 
ation expresses the standard deviation as a percentage of the mean and is useful 
for comparing the relative dispersion for different distributions. 

• Box-and-whisker plot and estimation from grouped data 

Measures of central tendency and dispersion are part of the output whenever 
computer statistical packages have been used to analyze the data. In addition, the 
statistical package is likely to offer further description in the form of information 
such as the trimmed mean and the box-and-whisker plot. The approximate value 
of measures of central tendency and dispersion may be calculated from the 
grouped data of the frequency distribution. 

• The coefficient of correlation 

The coefficient of correlation (r) is an important measure of the strength of the 
linear relationship between two quantitative variables. The coefficient of correla- 
tion is between —1 and +1, and the larger the absolute value of r, the stronger the 
linear relationship between the variables. The coefficient of determination (r 2 ) is 
the proportion of the variation in y that is explained by the best-fit linear equa- 
tion, and r 2 is always s 1. 
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Arithmetic Mean 

• Population mean: 

2x," 



= — — - where /x = population mean 

Xj = the z'th data value in the population 
2 = the sum of 

N = number of data values in the population 



Sample 



n 



where x = sample mean 

Xj = the ?'th data value in the sample 
2 = the sum of 

n = number of data values in the sample 



Weighted Mean 

Weighted mean, /jl w (for a population) or je (for a sample 



V w or x u 



where w { = weight assigned to the z'th data value 
x- = the «th data value 



Range 



• Range = Largest data value — Smallest data value 

• Midrange = (Largest data value + Smallest data value)/2 

Quartiles 

In a set of N values arranged from lowest to highest: 

• First quartile, Q 1 = Data value at position - — - — - 

• Second quartile (the median), Q 2 = Data value at position 



2(N + 1) 



Third quartile, Q, = Data value at position 



3(N + 1) 



, 3 . 4 

(Use N if data represent a population, n for a sample.) 

Interquartile Range and Quartile Deviation 

• Interquartile range: third quartile minus first quartile, or Q, — 

• Quartile deviation: one-half the interquartile range, or (Q 3 — Qj)/2 

Mean Absolute Deviation (MAD) 

Mean absolute deviation for a population: 



MAD 



N 



where (jl = population mean 
Xj = the ith data value 

N = number of data values in the population 



(To calculate MAD for a sample, substitute n for N and x for /x.) 
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Variance 

• Variance for a population: 

<j = — where a = population variance 

M = population mean 
x i = the zth data value 

N = number of data values in the population 

• Variance for a sample: 

-\ ^i(Xj x) 2 



1 



Standard Deviation 



where s 2 = sample variance 
x = sample mean 
x = the zth data value 
n = number of data values in the sample 



For a Population For a Sample 
a = \Tir 2 s = V? 



Residual 



Residual for a data _ _ 

value in a sample " X > " X where x = sam P le mean 

x, = value of the zth observation 



Chebyshev's Theorem 

For either a sample or a population, the percentage of observations that fall within k (for 
k >1) standard deviations of the mean will be at least 

1 - A X 100 

\ k L \ 

The Empirical Rule 

For distributions that are bell shaped and symmetrical: 

• About 68% of the observations will fall within 1 standard deviation of 
the mean. 

• About 95% of the observations will fall within 2 standard deviations of the mean. 

• Practically all of the observations will fall within 3 standard deviations of the mean. 

Standardized Data 

x — x 

Zj = — where z, = standardized value for the zth observation 

x = sample mean 

x- = the zth data value 

s = sample standard deviation 
(When data represent a population, /jl replaces x and cr replaces s.) 

Coefficient of Variation (CV) 

For a Population For a Sample 

CV = - X 100 CV = = X 100 

fx, x 
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Conversion of Computer-Output Standard Deviation Based on {N — 1) Divisor 
to a Population Standard Deviation Based on (N) Divisor 

Standard deviation Standard deviation * IN — 1 

based on (N) divisor based on (N — 1) divisor y N 



Mean and Variance from Grouped Data 

• Approximate value of the mean from grouped data: 



N 



where /u, = approximate population mean 
fj = frequency for class i 
m. = midpoint for class i 
N = number of data values in the population 

(If data are a sample, x replaces /x and n replaces N.) 

Approximate value of the variance from grouped data: 
If data represent a population: 



a 2 ^ 



Ifitnj - Nfx 2 
N 



where /x = approximate population mean 
fj = frequency for class i 
m- = midpoint for class i 
N = number of data values in the population 



If data represent a sample: 

?2 



1 



where x = approximate sample mean 
fj = frequency for class i 
nij = midpoint for class % 
n = number of data values in the sample 




Note: For many of the Exercises 3.48-3.64, a computer 
and statistical software will be desirable and useful. 
However, any necessary calculations can also be done 
with the aid of a pocket calculator. For readers using 
statistical software, keep in mind the file naming key — 
for example, the data for Exercise 3.57 will be in data 
file XR03057. 

3.48 The first seven customers of the day at a small donut 
shop have checks of $1.25, $2.36, $2.50, $2.15, $4.55, 
$1.10, and $0.95, respectively. Based on the number of 
customers served each day, the manager of the shop claims 
that the shop needs an average check of $1.75 per person 
to stay profitable. Given her contention, has the shop made 
a profit in serving the first seven customers? 

3.49 A dental supplies distributor ships a customer 50 
boxes of product A, 30 boxes of B, 60 boxes of C, and 
20 boxes of D. The unit shipping costs (dollars per box) 



for the four products are $5, $2, $4, and $10, 
respectively. What is the weighted mean for shipping 
cost per unit? 

3.50 During fiscal year 1986, McDonnell Douglas 
Corporation was second only to Lockheed Missiles & 
Space Company in net value of U.S. Department of 
Defense prime contract awards over $25,000 for 
research, development, test, and evaluation. The work 
awarded to McDonnell Douglas was done in 20 
locations, with a breakdown (millions of dollars) as 
follows: source: U.S. Department of Defense, 500 Contractors Receiv 
ing the Largest Volume of Prime Contract Awards for Research, Develop- 
ment, Test, and Evaluation (1986), p. 13. 



$0.7 


4.1 


0.3 


1 1.2 


0.2 


103.8 


0.3 


434.6 


0.1 


0.7 


1.1 


2.6 


4.9 


0.1 


6.2 


354.7 


0.2 


1.3 


0.7 


0.2 
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a. Determine the mean, median, and mode for these data. 

b. In this case, does the mode appear to be a good mea- 
sure of the typical amount of work that was awarded 
to a McDonnell division during that year? Why or 
why not? 

3.51 According to Honda Motor Co., Inc., the exchange 
rate (yen per U.S. dollar) from 1998 through 2005 was 
123, 128, 112, 111, 125, 122, 113, and 108. Determine 
the mean and median for these data. Is there a mode? 

If so, what is its numerical value? source: Honda Motor 
Co., Ltd., 2005 Annual Report. 

3.52 Natural History magazine has published a listing of 
the maximum speeds (in mph) for a wide variety of 
animals, including those shown in the table. 

source: The World Almanac and Book of Facts 2006, p. 307. 



Cheetah 


70 


Grizzly bear 


30 


Pronghorn 




Domestic cat 


30 


antelope 


61 


Human 


27.89 


Wildebeest 


50 


Elephant 


25 


Lion 


50 


Black mamba 




Coyote 


43 


snake 


20 


Mongolian 




Squirrel 


12 


wild ass 


40 


Spider 


1.17 


Giraffe 


32 


Giant tortoise 


0.17 


Wart hog 


30 


Garden snail 


0.03 


White-tailed deer 


30 







a. Determine the mean and the median for these data. 

b. Is there a mode? If so, what is its numerical value? 

3.53 The frequency distribution for population density 
(persons per square mile) for the 50 U.S. states is as 
follows: source: Bureau of the Census, Statistical Abstract of the 
United States 2002, p. 23. 

Population Density Number of States 



0-under 100 


27 


100-under200 


1 1 


200-under 300 


4 


300-under 400 


1 


400-under 500 


2 


500-under 600 


1 


600-under 700 


0 


700-under 800 


1 


800-under 900 


1 


900-under 1000 


0 


1000-under 1 100 


1 


1100-under 1200 


1 



Does this distribution appear to be symmetrical? If not, is 
it skewed positively or negatively? 

3.54 The cafeteria manager at a manufacturing plant has 
kept track of the number of cups of coffee purchased 
during each of the past 90 working days. 



a. Using the following descriptive statistics, construct 
a box-and-whisker plot for the data. Does the 
distribution appear to be symmetrical? If not, is it 
positively skewed or negatively skewed? 

N Mean Median Tr Mean StDev SE Mean 
CUPS 90 125.64 123.50 125.50 30.80 3.25 

Min Max Ql Q3 

CUPS 54.00 206.00 102.75 147.25 

b. If this distribution of daily coffee totals is bell 
shaped and symmetrical, on approximately what 
percent of the days were 64 or fewer cups of coffee 
purchased? 

3.55 A grocery store owner has found that a sample 

of customers purchased an average of 3.0 pounds of lun- 
cheon meats in the past week, with a sample standard 
deviation of 0.5 lb. 

a. If the store's meat scale is off by 0.1 lb (e.g., a purchase 
listed as 1 lb actually weighs 1.1 lb), what will be the 
values of the sample mean and standard deviation after 
the owner corrects his data for the weighing error? 

b. Using the mean and standard deviation you calculated 
in part (a), and assuming the distribution of purchase 
amounts was bell shaped and symmetrical, what 
luncheon meat purchase amount would have been 
exceeded by only 2.5% of the customers last week? 

3.56 For a sample of 5 different years from the period 
1960 through 1998, it is found that U.S. work stoppages 
involving at least 1000 workers occurred 268 times in 
one of these years, with 424, 235, 145, and 44 work 
Stoppages in the Other four, source: Bureau of the Census, Statis- 
tical Abstract of the United States 2002, p. 452. 

a. Determine the mean, median, range, and midrange. 

b. Calculate the mean absolute deviation. 

c. Calculate the standard deviation and variance. 

3.57 A quality control supervisor has taken a sample of 
16 bolts from the output of a thread-cutting machine and 
tested their tensile strengths. The results, in tons of force 
required for breakage, are as follows: 

2.20 1.95 2.15 2.08 1.85 1.92 
2.23 2.19 1.98 2.07 2.24 2.31 
1.96 2.30 2.27 1.89 

a. Determine the mean, median, range, and midrange. 

b. Calculate the mean absolute deviation. 

c. Calculate the standard deviation and variance. 

3.58 The accompanying box-and-whisker plot represents 
the number of gallons of water used by 80 households 
over a 1-day period. Determine the approximate values 
for the median, the first and third quartiles, and the 
range. Does the distribution appear to be skewed? If so, 
is it positively skewed or negatively skewed? 
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3.59 A testing firm has measured the power consumption 
of 40 dorm-size microwave ovens selected at random from 
the manufacturer's production line. Given the following 
box-and-whisker plot, determine the approximate values 
for the median, the first and third quartiles, and the range. 
Does the distribution appeared to be skewed? If so, is it 
positively skewed or negatively skewed? 



110 



T 



120 
WATTS 



130 



3.60 A law enforcement agency, administering breathalyzer 
tests to a sample of drivers stopped at a New Year's Eve 
roadblock, measured the following blood alcohol levels for 
the 25 drivers who were stopped: 



0.00% 


0.08% 


0.15% 


0.18% 


0.02% 


0.04 


0.00 


0.03 


0.1 1 


0.17 


0.05 


0.21 


0.01 


0.10 


0.19 


0.00 


0.09 


0.05 


0.03 


0.00 


0.03 


0.00 


0.16 


0.04 


0.10 



Calculate the mean and standard deviation for this 
sample. 

Use Chebyshev's theorem to determine the minimum 
percentage of observations that should fall within 



k = 1.50 standard deviation units of the mean. Do 
the sample results support the theorem? 
c. Calculate the coefficient of variation for these data. 

3.61 Use the coefficient of variation to compare the vari- 
ability of the data in Exercise 3.57 with the variability of 
the data in Exercise 3.60. 

3.62 Using the frequency distribution in Exercise 3.53, 
determine the approximate mean and standard deviation 
for the underlying data. 

3.63 The following dotplot describes the lengths, in pages, 
of a sample consisting of 20 reports generated by a consult- 
ing firm during the past 3 months. Based on the dotplot, 
what are the values for the median and the first and third 
quartiles? 



10 



20 



30 



40 



- PAGES 
50 



3.64 For the sample data presented in the dotplot of Exer- 
cise 3.63, construct a frequency distribution in which the 
classes are 10-under 20, 20-under 30, and 30-under 40. If 
this frequency distribution were all that we knew about the 
underlying data, what approximate values would be 
estimated for the mean and standard deviation of these 
data? 

/ data set/ Note: Exercises 3.65-3.69 require a 
computer and statistical software. 

3.65 According to the Energy Information Administration, 
the average U.S. household spends $1338 per year for util- 
ities (including heat, electricity, cooling, and water). The 
averages by region were as follows: Northeast ($1644), 
Midwest ($1396), South ($1328), and West ($1014). 
Assume that the findings for the Northeast could have 
been based on the utility expenditures reported by 250 
households, with the data in file XR03065. source: Anne R. 
Carey and Quin Tian, "Who Pays Most for Utilities?" USA Today, March 29, 
2000, p. IB. 

a. Confirm the mean and determine the median and the 
standard deviation of the utility expenditures. 

b. Generate and interpret the box-and-whisker plot for 
the households from this region. 

c. From the results in parts (a) and (b), do there appear to 
be any "outlier" households? If so, for what reason(s) 
might energy-conservation officials wish to examine 
the habits or characteristics of these households? 

3.66 Bride's magazine reports the cost of the average 
honeymoon as $3657. Assume that their findings could 
have been based on the honeymoon expenditures of a 
sample consisting of 300 couples whose costs are listed in 
data file XR03066. source: Patti Hinzman and Suzy Parker, "Some 
Like It Hot," 7999 USA Snapshot Calendar for July 21 . 
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a. Confirm the mean and determine the median and the 
standard deviation of the honeymoon expenditures 
for this sample. 

b. Generate and interpret the box-and- whisker plot for 
the couples in the sample. 

c. From the results in parts (a) and (b), do there appear 
to be any "outlier" couples who have spent an unusu- 
ally large amount of money on their honeymoon? If 
so, what kinds of companies or organizations might 
have an interest in finding out how to identify such 
couples before they've made their honeymoon plans? 

3.67 During 1999, 1,220,130 college-bound high school 
seniors took the Scholastic Aptitude Test (SAT). The aver- 
age score on the mathematics component was 511, with a 
standard deviation of 114. We are assuming that the SAT- 
math scores in data file XR03067 could have been the math 
scores for a sample of 400 college-bound seniors who took 
the SAT that year. (An important note: The mean and the 
standard deviation for our sample are close, but not exactly 
the same as the mean and standard deviation of the popula- 
tion from which the sample was obtained — this is a key 
concept that is central to Chapter 8, Sampling 
Distributions.) Based on the assumed sample data in file 
XR03067, source: The College Board, 1999 College-Bound 
Seniors, p. 7. 

a. Determine the mean, the median, and the standard 
deviation of the math scores in the sample. 

b. Generate and interpret the box-and- whisker plot for 
the math scores in the sample. 
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c. What math score would a test-taker have to achieve 
to be higher than 75% of the sample members? To 
be higher than 25% of the sample members? 

3.68 Data for a sample of college players eligible to enter 
the National Football League include weight (pounds) 
and time (seconds) required to sprint 40 yards. Using the 
data listed in file XR03068, and with weight as the x 
(independent) variable and time as the y (dependent) 
variable, generate a scatter diagram, fit a linear equation 
to the diagram, then determine and interpret the 
coefficients of correlation and determination. 

source: "Rating the Prospects," USA Today, April 14, 2000, p. 19C. 

3.69 During the 1994-1998 period, the Federal Aviation 
Administration took administrative actions (letters of cor- 
rection or warning notices) and levied fines against 

U.S. airline companies for various safety-related reasons. 
For each airline involved, data file XR03069 lists the num- 
ber of administrative actions taken toward the airline 
along with the total amount of fines (in millions of 
dollars) levied against the airline during this time period. 
Using number of administrative actions as the x (indepen- 
dent) variable and fines as the y (dependent) variable, 
generate a scatter diagram, fit a linear equation to the 
diagram, then determine and interpret the coefficients 
of correlation and determination, source: "Fines and 
Administrative Actions Against U.S. Airlines," USA Today, March 13, 
2000, p. 3B. 



cases 




Thorndike Sports Equipment 

When we last visited the Thorndikes, one of the company's 
engineers had just developed a new golf ball. According to 
the engineer, the new ball incorporates some changes that 
may enable it to travel farther than conventional golf balls. 
The engineer, an avid golfer herself, is very excited about the 
prospect that she may have developed an innovation that 
could revolutionize the game of golf. Old Luke says he will 
be happy only if the new ball revolutionizes his company's 
net profits. 

Some testing has already been done. As reported in the 
Chapter 2 episode of the Thorndikes, Ted set up a test in 
which he mixed 25 of the new balls with 25 of the old type, 
then had a golf pro hit all 50 of them at a driving range. All 
50 distances were measured and are listed in data file 
THORN02. For reader convenience, they are also repeated 



here. Ted has examined the visual descriptions of the data 
carried out in Chapter 2, but he would like to have some sta- 
tistical summaries as well. 

1. For the 25 drives made with the new balls, calculate 
statistics that you think would be appropriate in describ- 
ing the central tendency and the dispersion of the data. 

2. Repeat part 1 for the 25 drives made with the conven- 
tional golf balls. 

3. Ted wants to write a one-paragraph memo to Luke 
describing the results as they are reflected in parts 1 and 
2, but he had to fly to New York this morning. Help Ted 
by writing a first draft for him. Keep in mind that Luke 
appreciates plain English, so try to use your own words 
whenever possible. 



Chapter 3: Statistical Description of Data 
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25 Drives with New Ball (Yards): 



267.5 248.3 265.1 243.6 253.6 232.7 249.2 232.3 252.8 

247.2 237.4 223.7 260.4 269.6 256.5 271.4 294.1 256.3 

264.3 224.4 239.9 233.5 278.7 226.9 258.8 



25 Drives with Conventional Ball (Yards): 



241.8 255.1 266.8 251.6 233.2 242.7 218.5 229.0 256.3 
264.2 237.3 253.2 215.8 226.4 201.0 201.6 244.1 213.5 

267.9 240.3 247.9 257.6 234.5 234.7 215.9 



Springdale Shopping Survey 



This exercise focuses on the Springdale Shopping Survey, 
the setting and data collection instrument for which were 
provided at the end of Chapter 2. The data are in the com- 
puter file SHOPPING. 

We will concentrate on variables 18-25, which reflect 
how important each of eight different attributes is in the re- 
spondent's selection of a shopping area. Each of these vari- 
ables has been measured on a scale of 1 (the attribute is not 
very important in choosing a shopping area) to 7 (the attri- 
bute is very important in choosing a shopping area). The 
attributes being rated for importance are listed below. Exam- 
ining the relative importance customers place on these attri- 
butes can help a manager "fine-tune" his or her shopping area 
to make it a more attractive place to shop. 

18 Easy to return/exchange goods 

19 High quality of goods 

20 Low prices 

21 Good variety of sizes/styles 

22 Sales staff helpful/friendly 

23 Convenient shopping hours 



24 Clean stores and surroundings 

25 A lot of bargain sales 

1. Perform the following operations for variables 18-25: 

a. Obtain descriptive statistics, including the mean 
and median. 

b. Generate a box-and-whisker plot for the variable. 
Does the distribution appear to be skewed? If so, 
is the skewness positive or negative? 

2. Based on the results for question 1, which attributes seem 
to be the most important and the least important in 
respondents' choice of a shopping area? 

3. Use their respective coefficients of variation to compare 
the relative amount of dispersion in variable 29 (number 
of persons in the household) with that of variable 30 
(respondent's age). 

4. Determine the coefficient of correlation between vari- 
able 29 (number of persons in the household) and vari- 
able 30 (age). What percentage of the variation in 
household size is explained by respondent age? 
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Baldwin Computer Sales (A) 




Baldwin Computer Sales is a small company located in 
Oldenburg, Washington. The founder of the company, 
Jonathan Baldwin, began the business by selling computer 
systems through mail-order at discount prices. Baldwin 
was one of the first computer mail-order companies to 
offer a toll-free phone number to their customers for support 
and trouble-shooting. Although the company has grown 
over time, many new competitors have entered the com- 
puter mail-order marketplace so that Baldwin's share of 
this market has actually declined. 

(continued) 
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Five years ago, Bob Gravenstein, a marketing consul- 
tant, was contracted by Baldwin to develop long-term 
marketing plans and strategies for the company. After 
careful study, he recommended that Baldwin branch out to 
reach a new and growing segment of the computer sales 
market: college students. At that time, most universities 
were providing microcomputer labs and facilities for their 
students. However, many students were purchasing their 
own computers and printers so that they could have access 
to a computer at any time in their apartments or dorms. 
After graduation, students take the computers, with which 
they are already familiar, to their new jobs or use them at 
home after work hours. The percentage of college students 
owning a computer was increasing rapidly, and Bob 
Gravenstein recommended that Baldwin Computer Sales 
take advantage of this marketing opportunity. 

The marketing plan developed by Bob Gravenstein 
worked as follows: Five universities were initially selected for 
the program with expansion to other universities planned 
over time. Any student in at least his or her sophomore year 
at one of these universities was eligible to purchase a dis- 
counted JCN-2001 microcomputer system with printer from 
Baldwin Computer Sales under the program. The JCN-2001 
is a private-label, fully compatible system with all of the 
features of brand name models. The student makes a small 
payment each semester at the same time regular tuition pay- 
ments are due. When the student graduates and finds a job, 
the payments are increased so that the computer will be paid 
for within two years after graduation. If the student fails to 
make payments at any time, Baldwin could repossess the 
computer system. 

The prospect of future sales of computer equipment to 
these students was also part of the marketing strategy. The 
JCN-2001 is an entry-level computer system that suffices 
for most academic and professional uses. Eventually, how- 
ever, many students who purchased this system would out- 
grow it and require an upgrade to a more powerful machine 
with more features. Bob Gravenstein argued that after 
their good experience with the JCN-2001, these customers 
would make their future purchases from Baldwin Computer 
Company. 

Today, five years later, Baldwin Computer Sales is still 
operating the student purchase program and has expanded 
it to several more universities. There are currently enough 
data available from the early days of the program for Bald- 
win to determine whether the program has been successful 
and whether it should be continued. To discuss the future 
of the student purchase program, Jonathan Baldwin has 
called a meeting with Ben Davis, who has been in charge of 
the program since it began, and Teresa Grimes, the new 
vice president of marketing. 

Baldwin: "I called you both in here today to discuss the 
student purchase program. As you know, the pro- 
gram has been in place for approximately five years 
and we need to make decisions about where to go 



from here. Ben, weren't you telling me last week that 
we now have enough data to evaluate the program?" 

Davis: "Yes, sir. Any student who began the program as 
a sophomore five years ago should be out of school 
and employed for at least two years." 

Baldwin: "Well, based on your information, would you 
say the program has been a success or a failure?" 

Davis: "That's a tough call, sir. While most of the par- 
ticipants in the program eventually pay their account 
in full, we have had a high rate of defaults, some 
while the students were still in school, but mostly 
after they graduated." 

Baldwin: "How much are we losing on those who 
default?" 

Davis: "Each case is different. As I said, some default 
early and others after they have graduated. There are 
also the costs of repossession and repair to bring the 
product back up to resale quality. In many instances 
we were not able to retrieve the computer systems. 
Our data suggest that our average loss on each student- 
customer who defaults is about $1200. On the other 
hand, our average profit from participants who pay 
their accounts in full is approximately $750. Overall, 
we are close to just breaking even." 

Grimes: "Ben, have you considered 'qualifying' stu- 
dents for the program, much like a loan officer 
would qualify someone for a loan?" 

Davis: "We had initially thought about doing that, but 
we didn't believe that there was much information, if 
any, in the way of a credit history on most college 
students. Applicants are still requested to provide as 
much information as possible on their application, 
including their class, grade point average, work expe- 
rience, scholarships, and how much of their college 
expenses were earned through work. However, we 
weren't sure that this information was particularly 
useful for screening applicants." 

Grimes: "Knowing that we were going to have this dis- 
cussion today, I had one of my assistants who is well- 
versed in statistics look over some of those data last 
week. She has come up with a 'screening test' based 
only on the information you have been collecting 
from student applicants. By being more selective 
about whom we allow to participate in the student 
purchase program, it may be possible to increase our 
profit from the program." 

Davis: "It would be easy enough to check out her 
screening test by trying it out on our early data 
from the program. In those cases, we know 
whether or not the student actually defaulted." 

Baldwin: "Why don't the two of you spend some time 
on this idea and get back to me next week? At that 
time I want a recommendation to either discon- 
tinue the program, continue it as is, or continue it 
using this 'screening test' idea. Make sure you have 
the evidence to back up your recommendation." 
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Influence of a Single Observation on the Median 




The dots in this applet can be viewed as showing the 
speeds of a sample of nine cars being driven through a res- 
idential area. Eight of the values are fixed, but we can 
change the "speed" of one of them to see how this change 
affects the median for the sample data. By clicking on the 
data value for the ninth car (represented by the green dot 
at the far right), we can drag the value upward or down- 
ward and change this car's speed. The median for the data 
is shown at the top of the applet. 
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Applet Exercises 

1.1 Click on the rightmost green dot and drag it so that the 
speed of the ninth car is 40 mph. What is the value for the 
median? 

1.2 Click on the rightmost green dot and drag it so that the 
speed of the ninth car increases from 40 mph to 1 00 mph. 
What effect has this had on the value of the median? 

1.3 By adjusting the speed of the ninth car, identify the 
highest and lowest possible values for the median speed for 
the nine cars in the applet. 

1.4 Suppose the ninth car were a jet-powered supersonic 
vehicle and traveled through the neighborhood at 1 000 mph. 
Although such a speed lies far beyond the scale in the 
applet, what would be the value of the median? What 
would be the approximate value for the mean? Which 
of these two sample descriptors has been more greatly 
affected by this extreme value in the data? 
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Assignment 

Ben Davis and Teresa Grimes must analyze the data from 
the student purchase program and make a recommenda- 
tion to Mr. Baldwin about the future of the program. The 
necessary data are contained in the file BALDWIN on 
the text CD. A description of this data set is given in the 
Data Description section that follows. Using this data set 
and other information given in the case, help Ben Davis 
and Teresa Grimes evaluate the student purchase program 
and the potential usefulness of the screening test developed 
by the assistant to Teresa Grimes. The case questions will 
assist you in your analysis of the data. Use important de- 
tails from your analysis to support your recommendations. 

Data Description 

The BALDWIN file on the text CD contains data on all par- 
ticipants in the student purchase program who by now 
should have either paid in full or defaulted (i.e., those par- 



ticipants who should have graduated and held a job for at 
least two years). A partial listing of the data is shown here. 



Student School Default When Score 



6547 1 1 1 64 

4503 2 0 0 58 

1219 2 0 0 52 

9843 4 1 0 56 

6807 1 0 0 47 

6386 4 0 0 58 



These data are coded as follows: 

STUDENT: Student transaction number for identifi- 
cation purposes 

SCHOOL: University where the student was enrolled 

DEFAULT: 1, in the event of default, 

0, if account was paid in full on time 

(continued) 



Scatter Diagrams and Correlation 



This applet displays a scatter diagram that we can rearrange 
by using the slider at the bottom. By clicking on the slider 
and adjusting its position, we cause the points to gravitate 
to positions that make the linear relationship between the 
variables either stronger or weaker. For any given configura- 
tion, the best-fit straight line is shown, the coefficient of 
correlation is displayed at the top of the diagram, and we 
can reverse the nature of the relationship by clicking on the 
"Switch Sign" button. 




Applet Exercises 

2.1 Click on the slider and adjust it so that the best-fit 
straight line is as close as possible to horizontal. What is the 
value of the coefficient of correlation? 

2.2 Click on the slider and adjust it so that the coefficient of 
correlation is +1.0. Describe the best-fit line and its rela- 
tionship to the pattern of points. 

2.3 Adjust the slider so that the coefficient of correlation is 
as close as possible to +0.60. Describe how the best-fit line 
and its relationship to the pattern of points have changed. 

2.4 Click the "Switch Sign" button. Describe how the best- 
fit line and its relationship to the pattern of points have 
changed. 

2.5 With the slider positioned at the far left, gradually 
move it to the far right. Describe the changes taking place 
in the best-fit line and the pattern of points along the way. 
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Switch Sign 



WHEN: 1, if default occurred (or paid in full) before 
graduation, 

0, if default occurred (or paid in full) after 
graduation 

SCORE: Score on screening test based on student ap- 
plicant information such as his or her class, 
grade point average, work experience, schol- 
arships, and how much of their college ex- 
penses were earned through work 

1. Generate appropriate descriptive statistics of the 
screening test scores for those who did not default on 
their computer loan. Be sure to include the mean and 
the third quartile. Do the same for those who did 
default, then compare the results. Is the mean score on 
the screening test higher for those who did not default? 

2. In the descriptive statistics of the screening test scores 
for the students who did not default, identify the value 
of the third quartile and interpret its meaning. If this 



numerical value had been established as a cutoff for re- 
ceiving a computer loan, what percentage of those who 
repaid would have been denied a loan in the first place? 
Comment on the advisability of setting a cutoff score 
that is this high. 

3. Construct adjacent dotplots (use box plots if the dot- 
plot feature is not available on your computer statisti- 
cal package) and visually compare the screening test 
scores of students who did not default on their com- 
puter loan to the scores of those who defaulted. Does 
the distribution of screening test scores for those who 
did not default appear to be shifted to the right of the 
distribution of scores for those who did default? 

4. Based on your results and responses to the previous 
questions, does the screening test appear to be poten- 
tially useful as one of the factors in helping Baldwin 
predict whether a given applicant will end up default- 
ing on his or her computer loan? 
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U.5. Census: Extensive, Expensive, 
but Undercounting Is a Problem 

Since 1 790, at 1 0-year intervals, the United States has taken a count of the nation's 
populace. Required by the Constitution, this enumeration is the basis for the 
allocation of seats in the U.S. House of Representatives and is used by state and local 
governments for determining election districts. During the 2000 census, the Bureau 
of the Census spent more than $5 billion and employed about 500,000 enumerators, 
which is equivalent to hiring four of every five people who live in Vermont. 

Despite such Herculean efforts, undercounting occurs. It has been estimated that 
approximately 5 million people were not included in the 2000 enumeration and that 
those not counted were especially likely to include blacks, Hispanics, illegal aliens, 
and the poor. As a result of undercounting, such communities 
tend to receive less than proportionate shares 
of political representation and federal funds. 

To compensate for undercounting, the 
Bureau of the Census has proposed some 
changes to help it better estimate how many 
persons are missed during the conventional 
enumeration. These changes would include the 
use of sampling methods like the ones discussed 
in this chapter and also could involve small-scale, 
postcensus surveys across the nation. The proposal 
has led to conflict between states having low and 
high numbers of those who tend to be 
undercounted. Inclusion of the "undercounted" 
would cause states like California and Texas to gain 
legislators and federal funding, while Alabama, 
Pennsylvania, and others would get less. It has been 
reported that Republicans would suffer most from an 
"adjusted" census partially based on sampling meth- 
ods, because those who tend to be undercounted also 
tend to vote Democratic. 

Although charged with the seemingly straightfor- 
ward job of determining how many people live in the 
United States, the Bureau of the Census finds itself 
caught in the middle of a controversy with significant 
political overtones. To the extent that federal 
representation and funding are allocated according to the 
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population distribution, the way in which the official count is determined could greatly influence 
the fortunes or misfortunes of the individual states affected. 

Sources: Cyndee Miller, "Marketers Hail Proposed Census Changes," Marketing News, April 8, 1996, p. 1; Haya El 
Nasser, "Census Mobilization Is Largest in Peacetime," USA Today, April 24, 2000, p. 8A; Genaro C. Armas, "Billions 
Riding on Census Methods," The Indiana Gazette, March 10, 2000, p. 3. 



learning 
objectives 

After reading this 
chapter, you should 
be able to: 



Describe the types of business research studies and their purposes. 

Explain the difference between primary and secondary data. 

Describe how survey research is conducted. 

Discuss the major sources of error in survey research. 

Describe the experimentation and observational approaches to research. 

Differentiate between internal and external secondary data, and identify 
sources for each. 

Understand the relevance of the data warehouse and data mining. 

Explain why a sample is generally more practical than a census. 

Differentiate between the probability and nonprobability approaches to 
sampling, and understand when each approach is appropriate. 

Differentiate between sampling error and nonsampling error. 

Understand that probability samples are necessary to estimate the amount 
of sampling error. 



4.1 INTRODUCTION 



Statistics can be used as (1) a descriptive tool for expressing information relevant 
to business and (2) an inferential device for using sample findings to draw conclu- 
sions about a population. To use statistics for either of these applications, data are 
required, and such data may be gathered from a wide variety of sources. The tech- 
niques presented in subsequent chapters are only as good as the data on which 
they are based. 

In this chapter, we'll examine some of the popular approaches used in collecting 
statistical data. Our discussion of business and survey research will be on a 
strictly introductory level, but it should make you more familiar with the sources 
and procedures used in obtaining the kinds of data seen in this textbook, as well 
as in the business and general media. We will also discuss the importance of sam- 
pling and describe some of the more popular sampling methods used in business 
research today. 



RESEARCH BASICS 



Types of Studies 

Business research studies can be categorized according to the objective involved. 
Accordingly, we can identify four types of studies: exploratory, descriptive, causal, 
and predictive. 
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Exploratory 

Often the initial step, exploratory research helps us become familiar with the 
problem situation, identify important variables, and use these variables to form 
hypotheses that can be tested in subsequent research. The hypothesis will be a 
statement about a variable or the relationship between variables — for example, 
"Production will increase if we switch the line locations of operators A and B." 
The hypothesis may not be true, but it is useful as an assertion that can be examined 
through the collection of sample data for a test period during which the operators 
have traded line positions. 

Exploratory research can also be of a qualitative nature. One such method is 
the focus group interview, in which a moderator leads a small-group discussion 
about a topic and the client watches and listens from behind a one-way mirror. 
The U.S. Environmental Protection Agency has used focus groups to learn what 
people think and know about their drinking water. Among the results: people 
expressed distrust in water companies and knew that some contaminants could be 
in the water, but most people were unable to specify any of the specific contaminants 
that could be there. 1 

Descriptive 

As might be expected, descriptive research has the goal of describing something. 
For example, a survey by the National Association of Home Builders found that 
the average size of single-family homes ballooned from 1500 square feet in 1970 
to nearly 2400 square feet in 2004. 2 

Causal 

In causal research, the objective is to determine whether one variable has an effect 
on another. In examining data for broken utility poles, the Duquesne Light 
Company found that about 30% fewer poles were being damaged after Pennsyl- 
vania's stricter drunken driving laws took effect in 1983. According to a company 
spokesperson, "It may just be a coincidence, but before '83, we averaged 1000 
broken poles a year. After that, it dropped to about 700 a year. Most of the acci- 
dents involving our poles occur between 1 and 4 A.M., about the time when most 
bars are closing and the people are going home." 3 

Regarding causal studies, it should be pointed out that statistical techniques 
alone cannot prove causality. Proof must be established on the basis of quantita- 
tive findings along with logic. In the case of the telephone poles in the preceding 
paragraph, it would seem obvious that causation was not in the reverse direction 
(i.e., reduced pole damage causing the stricter drunk driving laws). However, we 
must consider the possibility that one or more other variables might have con- 
tributed to the reduction in pole damage — for example, the company may have 
begun a switch to underground wiring at the same time that the stricter drunken 
driving legislation was enacted. 



Source: "Consumer Feedback Focus Group Results," U.S. Environmental Protection Agency, 
www.epa.gov, July 25, 2006. 

2 David Struckey and Robert W. Ahrens, "The Swelling Dwelling," USA Today, April 10, 2006, p. 1A. 
3 "New Drunken Driving Laws May Be Best News for Poles since Creosote," Pittsburgh Post-Gazette, 
July 18, 1986, p. 8. 
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Predictive 

Predictive research attempts to forecast some situation or value that will occur in 
the future. A common variable for such studies is the expected level of future sales. 
As might be expected, forecasts are not always accurate. For example, in 1984, the 
Semiconductor Industry Association predicted a 22% increase for 1985, a year in 
which sales actually fell by about 17%. 4 With any forecast, there will tend to be an 
error between the amount of the forecast and the amount that actually occurs. 
However, for a good forecasting model, the amount of this error will be consis- 
tently smaller than if the model were not being used. Forecasting will be one of the 
topics in Chapter 18. 

The Research Process 

A research study begins by defining the problem or question in specific terms that 
can be answered by research. Relevant variables must also be identified and 
defined. The steps in the research process include (1) defining the problem, 
(2) deciding on the type of data required, (3) determining through what means the 
data will be obtained, (4) planning for the collection of data and, if necessary, 
selection of a sample, (5) collecting and analyzing the data, (6) drawing conclu- 
sions and reporting the findings, and (7) following through with decisions that 
take the findings into consideration. 

Primary versus Secondary Data 

Statistical data can be categorized in a number of ways, including primary versus 
secondary. Primary data refer to those generated by a researcher for the specific 
problem or decision at hand. Survey research (Section 4.3), experimentation, and 
observational research (both discussed in Section 4.4) are among the most popular 
methods for collecting primary data. 

Secondary data have been gathered by someone else for some other purpose. 
Secondary data can be internal or external depending on whether the data were 
generated from within your firm or organization or by an outside person or 
group. Some of the many possible sources of secondary data will be described in 
Section 4.5. 




4.1 What is the difference between primary data and 
secondary data? Between internal secondary data and 
external secondary data? 

4.2 A published article reports that 60% of U.S. 
households have cable television. The president of an 
electronics firm clips the item from the paper and files it 
under "Cable TV data." Would this material be 
considered primary data or secondary data? 



4.3 A pharmaceutical firm's annual report states that 
company sales of over-the-counter medicines increased 
by 48.1%. 

a. To the firm itself, would this information be primary 
data or secondary data? 

b. To a competitor, what type of information would this 
represent? 



4 Source: Otis Port, "A Rosy Forecast for Chips Has Skeptics Hooting," Business Week, October 14, 
1985, p. 104. 
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4.4 Bertram Pearlbinder, owner of an apartment building 
near the campus of Hightower University, is considering the 
possibility of installing a large-screen color television in the 
building's recreation room. For each of the following, indi- 
cate whether the associated data are primary or secondary: 
a. Bertram interviews the current residents of the building 

to determine whether they would rather have the color 

television or a pool table. 



b. Bertram, while reading Newsweek, notices an article 
describing the property damage in college dormitories 
and apartments. 

c. One of the tenants tells Bertram that many of the resi- 
dents are "very excited" about the prospect of getting 
the big TV. 



SURVEY RESEARCH 



In survey research, we communicate with a sample of individuals in order to 
generalize on the characteristics of the population from which they were drawn. 
It has been estimated that approximately $20 billion per year is spent on survey 
research worldwide. 5 



Types of Surveys 
The Mail Survey 

In a mail survey, a mailed questionnaire is typically accompanied by a cover letter and 
a postage-paid return envelope for the respondent's convenience. A good cover letter 
will be brief, readable, and will explain who is doing the study, why it's being done, 
and why it's important for the reader to fill out and return the enclosed questionnaire. 



The Personal Interview 

In the personal interview, an interviewer personally secures the respondent's 
cooperation and carries out what could be described as a "purposeful conversa- 
tion" in which the respondent replies to the questions asked of her. The personal 
interview tends to be relatively expensive compared to the other approaches, but 
offers a lot of flexibility in allowing the interviewer to explain questions, to probe 
more deeply into the answers provided, and even to record measurements that are 
not actually asked of the respondent. For example, along with obtaining 
responses, the interviewer may record such things as the respondent's gender, 
approximate age, physical characteristics, and mode of dress. 



The Telephone Interview 

The telephone interview is similar to the personal interview, but uses the telephone 
instead of personal interaction. Telephone interviews are especially useful for 
obtaining information on what the respondent is doing at the time of the call (e.g., 
the television program, if any, being watched). Also, use of the telephone makes it 
possible to complete a study in a relatively short span of time. 



Questionnaire Design 

Also referred to as the data collection instrument, the questionnaire is either filled 
out personally by the respondent or administered and completed by an interviewer. 
The questionnaire may contain any of three types of questions: (1) multiple choice, 
in which there are several alternatives from which to choose; (2) dichotomous, 

5 Source: "Top 25 Research Companies Maintain Position in 2003," www.esomar.org, July 25, 2006. 
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having only two alternatives available (with "don't know" or "no opinion" some- 
times present as a third choice); and (3) open-ended, where the respondent is free 
to formulate his or her own answer and expand on the subject of the question. 

In general, multiple choice and dichotomous questions can be difficult to 
formulate, but data entry and analysis are relatively easily accomplished. The reverse 
tends to be true for open-ended questions, where a respondent may state or write 
several paragraphs in response to even a very short question. Because they give 
the respondent an opportunity to more fully express his feelings or describe his 
behaviors, open-ended questions are especially useful in exploratory research. 

Proper wording of each question is important, but often difficult to achieve. 
A number of problems can arise, including the following: (1) the vocabulary level 
may be inappropriate for the type of person being surveyed; (2) the respondent 
may assume a frame of reference other than the one the researcher intended; 
(3) the question may contain "leading" words or phrases that unduly influence 
the response; and (4) the respondent may hesitate to answer a question that involves 
a sensitive topic. Examples of each situation follow: 

• Inappropriate vocabulary level 

Poor wording "Have you patronized a commercial source of cinematic enter- 
tainment within the past month?" 

Problem Vocabulary level will be too high for many respondents. 
Better wording "Have you gone to a movie within the past month?" 

• Confusing frame of reference 

Poor wording "Are you in better shape than you were a year ago?" 

Problem To what does "shape" refer — physical, financial, emotional? 

Better wording (if the desired frame of reference is physical) "Are you in 
better physical condition than you were a year ago?" 

• "Leading" words/phrases 

Poor wording "To help maintain the quality of our schools, hospitals, and 
public services, do you agree that taxes should be increased next year?" 

Problems "Schools, hospitals, and public services" has an emotional impact. 
Also, "do you agree" suggests that you should agree. 

Better wording "Do you support an increase in taxes next year?" 

• Sensitive topic 

Poor wording "How much money did you make last year?" 

Problem This question requests detailed, personal information that respon- 
dents are hesitant to provide. The question is also very blunt. 

Better wording "Which of the following categories best describes your 
income last year?" 



The preceding represent only a few of the pitfalls that can be encountered 
when designing a questionnaire. In general, it's a good idea to pretest the ques- 
tionnaire by personally administering it to a small number of persons who are 
similar to the eventual sample members. 



[ ] under $20,000 
[ ] $20,000-$39,999 
[ ] $40,000-$59,999 



[ ] $60,000-$79,999 
[ ] $80,000-$99,999 
[ ] $100,000 or more 
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Sampling Considerations in Survey Research 

The sampling techniques presented later in the chapter are readily applicable to 
survey research. As we will discuss further in Section 4.7, we can make statistical 
inferences from the sample to the population only if we employ a probability 
sample — one in which every person in the population has either a known or a 
calculable chance of being included in the sample. 

One of the problems that is unique to telephone surveys is the growing inci- 
dence of unlisted numbers. Overall, about 30% of U.S. households do not have 
their phone number listed in the directory. In some locales, the percentage is much 
higher — for example, over 70% in some parts of California. Households that are 
unlisted may have different characteristics than those that are listed. Studies have 
found unlisted households are more likely to be urban, young, and mobile compared 
to households whose number is in the telephone directory. 6 

Random-digit dialing is a technique that has been developed to overcome the 
problem of unlisted numbers. In this approach, at least the final four digits are 
randomly selected, either by a computer or from a random number table. The 
area code and the three-digit exchange number can also be randomly selected if 
the survey is national in scope. The use of randomly selected numbers results in 
both listed and unlisted households being included in the sample. 

If we are to use information from a few thousand sample members to make 
inferences about a population numbering in the millions, the sample selection 
process must be both precise and rigorous. As a result, a number of companies 
specialize in the generation of samples for survey research clients. Survey 
Sampling, Inc., a leading supplier of statistical samples for the research industry, 
is such a company. Each year, Survey Sampling "marks" millions of telephone 
numbers that have been selected from its Random Digit Sampling Database. Any 
number that is called is automatically eliminated from consideration for a period 
of 6 months. One of the purposes of this practice is to help ensure that respon- 
dents will be "fresher" and more likely to cooperate. 7 

Mailing lists are readily available for the identification of practically any pop- 
ulation a business researcher would ever want to consider. For example, infoUSA, 
Inc. has a database of 11 million businesses across the United States, and these are 
coded according to such criteria as geographical location and standard industrial 
classification (SIC). If you wish to mail a questionnaire to a simple random 
sample from populations like the 24,947 sporting goods retailers, the 24,758 
tanning salons, or the 17,811 siding contractors, you can even obtain the mailing 
labels directly from the list company. 8 

Either an entire list or a sample from it can be generated on the basis of geogra- 
phical location, at levels down to individual states and counties. Of particular interest 
is the ability to select samples of persons from what are known as compiled versus 
response lists. The distinction between these two kinds of samples is described here: 

COMPILED LIST A sample consisting of firms or persons who are alike in some 
way, such as being (1 ) physicians, (2) homeowners, (3) personal computer owners, or 
(4) residents of Westmoreland County, Pennsylvania. A compiled list is passive in that 
you can be placed on one without having done anything. 

(continued) 

^Source: Survey Sampling, Inc., website, May 2003. 
7 Ibid. 

8 Source: American Business Information, Sales Leads & Mailing Lists, August 1999, pp. 30-31. 
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RESPONSE LIST A sample consisting of firms or persons who have engaged in a 
specified behavior or activity, such as (1) subscribing to PC Magazine, 
(2) contributing to Greenpeace, (3) buying from a mail-order catalog, or 
(4) applying for an American Express credit card. 

Like questionnaire design, survey sampling can itself be the topic of an entire 
textbook. This is especially true when the design involves either proportionate or 
disproportionate stratified sampling, two of the more advanced designs discussed 
later in the chapter. 

Errors in Survey Research 

Survey research may lead to several different kinds of errors, and it's recom- 
mended that you be a skeptical "consumer" of statistical findings generated by 
this mode of research. These errors may be described as sampling error, response 
error, and nonresponse error. Sampling error, discussed below, is a random error. 
It can also be described as nondirectional or nonsystematic, because measure- 
ments exhibiting random error are just as likely to be too high as they are to be 
too low. On the other hand, response and nonresponse errors are both of the 
directional, or systematic, type. 

Sampling Error 

Sampling error occurs because a sample has been taken instead of a complete 
census of the population. If we have taken a simple random sample, the meth- 
ods described in Chapter 9 can be used to estimate the likely amount of error 
between the sample statistic and the population parameter. One of the procedures 
discussed in Chapter 9 is the determination of the sample size necessary to have a 
given level of confidence that the sample proportion will not be in error by more 
than a specified amount. 

Response Error 

Some respondents may "distort" the truth (to put it kindly) when answering a 
question. They may exaggerate their income, understate their age, or provide 
answers that they think are "acceptable." Biased questions can even encourage 
such response errors, for example, "Shoplifting is not only illegal, but it makes 
prices higher for everyone. Have you ever shoplifted?" 

Nonresponse Error 

Not everyone in the sample will cooperate in returning the questionnaire or in 
answering an interviewer's questions. This would not be a problem, except that 
those who respond may be different from those who don't. For example, if we're 
using a mail questionnaire to find out the extent to which people are familiar 
with the works of William Shakespeare, those who are less literate or less inter- 
ested in this classic author may also be less likely to complete and return our 
questionnaires. As a result, we could "measure" a much higher level of interest 
than actually exists. 
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4.5 What are the major approaches to carrying out 
survey research? 

4.6 Provide an example of a survey question that would 
tend to exceed the vocabulary level of the typical adult. 

4.7 Comment on the appropriateness of the question, 
"Have you ever broken the law and endangered the lives 
of others by running a red light?" 

4.8 What is random-digit dialing and why is it used? 

4.9 How does a compiled mailing list differ from a 
response mailing list? 

4.10 Explain what is meant by sampling error, response 
error, and nonresponse error in survey research. 

4.11 A research firm finds that only 33% of those who 
reported buying Kellogg's Frosted Flakes had actually 



purchased the product during the period monitored. 
What type of survey research error does this represent? 

4.12 In order to increase the response rate to mail ques- 
tionnaires, researchers sometimes include a dollar bill or 
other monetary incentive to reward the respondent for 
his or her cooperation. Could there be occasions where a 
relatively large reward — e.g., the inclusion of a $20 bill 
with a short questionnaire — might end up exerting a 
biasing influence on the answers the respondent 
provides? If so, give a real or hypothetical example of 
such an occasion. 

4.13 A company has constructed two prototypes of a new 
personal digital assistant (PDA) and would like to find out 
which one of the two looks easier to use. Of the three 
types of surveys discussed in this section, which one 
would the company not wish to employ? 



EXPERIMENTATION AND 4.4 
OBSERVATIONAL RESEARCH 



Experimentation 

In experiments, the purpose is to identify cause-and-effect relationships between 
variables. There are two key variables in an experiment: (1) the independent 
variable, or treatment, and (2) the dependent variable, or measurement. 

We must also consider what are known as extraneous variables — outside 
variables that are not part of the experiment, but can influence the results. 
Persons or objects receiving a treatment are said to be in an experimental group, 
while those not exposed are in the control group. In symbolic terms, we will 
refer to the independent variable as "T" and each measurement of the depen- 
dent variable as "O": 

O l5 1st measurement: T, treatment: 0 2 , 2nd measurement: 

In 1974, 53.0% of 
Norwegian males 
smoked daily. 



In 1975, Norway 
banned tobacco 
advertising. 



In 1984,41.6% of 
Norwegian males 
smoked daily. 



In evaluating experiments, two kinds of validity must be considered: internal 
and external. We will examine each in terms of the Norwegian ban on tobacco 
advertising. 9 

Internal Validity 

Internal validity refers to whether T really made the difference in the measure- 
ments obtained. In the preceding experiment, was it really the advertising ban 



'Source: Jack Burton, "Norway Up in Smoke," Advertising Age, April 14, 1986, p. 82. 
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that made the difference? For example, extraneous variables included a high 
cigarette tax ($1 to $2 a pack), public service announcements, and extensive 
educational efforts on the hazards of smoking. 

External Validity 

Even if T did make the difference, external validity asks whether the results can 
be generalized to other people or settings. For example, would similar results have 
been obtained in the United States or Great Britain, or if the tobacco ad ban had 
been enacted 10 or 20 years earlier instead of in 1975? 

Health groups have long studied such phenomena as the effect of cigarette 
smoking on longevity. However, it is difficult to remove the influence of other 
variables on this relationship. For example, the heavy smoker is more apt to be a 
heavy drinker or a compulsive workaholic, while the nonsmoker is more likely to 
be a nondrinker or light drinker and to be more attentive to other factors (such as 
exercise and weight control) that tend to increase life span. 

It can be useful to design an experiment in which people or objects are 
randomly placed into either the experimental or control group, then measure- 
ments are made on the dependent variable. Consider the following example in 
which 200 persons are randomly assigned, 100 to each group: 

• Experimental group (n 1 = 100): 



Members sample soft 
drink in bottle with a 
new label design. 




65% say 
"tastes great." 


— >- 







• Control group (n 2 = 100): 



Members sample the same 
soft drink, but in bottle 
with the old label design. 




39% say 
"tastes great." 







These results suggest that the new label is effective in causing testers to evaluate 
the soft drink as tasting good. Although this is just one of many possible experimen- 
tal designs, it shows how an experimental group and a control group can be used in 
carrying out an experiment. To facilitate internal validity, it's best if persons or objects 
are randomly assigned to the experimental and control groups. The significance of the 
difference between the 65% for the experimental group and the 39% for the control 
group can be obtained using the hypothesis testing methods presented in Chapter 11. 

Observational Research 

Observation relies on watching or listening, then counting or measuring. For 
example, observational research allows us to passively watch highway travelers 
and monitor variables such as speed and adherence to other traffic laws, or the 
extent to which drivers from one age group, gender group, or vehicle group behave 
differently from those of other groups. One of the more interesting applications of 
observational research involves actually contriving a situation that is to be 
observed — for example, hiring a "mystery shopper" to present retail store person- 
nel with a given problem, product return, or complex request, then have the hired 
shopper submit a report on how he or she was treated, and the extent to which her 
contrived situation was handled according to corporate policy. 
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Experimental research 

Cause and effect: The national speed limit had 
been credited with saving 2000-4000 lives. 
$2 billion worth of imported oil, and $65 million 
of medical and welfare payments each year. 



■ 



Survey research 

Public support: Surveys found the public 
supported the limit by a margin of 3 to 1 (2 to 1 
for high-mileage drivers). 




Observational research 



Compliance: Motorists observed driving on 
highways posted at 55 traveled at an average 
speed of 59.1 mph. 





FIGURE 4.1 



News media often report 
research results in the con- 
text of an article or story. 
This figure dramatizes the 
unusual case of a single 
article that included the 
findings of three different 
types of investigations cen- 
tered on the same topic: 
the 55-mph national speed 
limit that was in effect 
between 1974 and 1987. 
Note the contradiction 
between the support indi- 
cated by surveys and the 
lack of compliance 
measured in observational 
studies. 



Data source: Geoff Sundstrom, "Speed Limit Retention Is Favored," Automotive News, December 3, 1984, p. 6. 



The results of surveys, experiments, and observational studies are often reported 
in news media. Figure 4.1 shows data from a single article containing elements of all 
three research methods. The research dealt with the former 55-mph national speed 
limit and involved a 2-year study by the Transportation Research Board of the 
National Research Council. The study seems to have included some rather contra- 
dictory findings. 



4.14 In studying the effectiveness of monetary rewards on 
seat belt usage, a firm has been giving out lottery tickets to 
employees wearing their belts when entering the company 
parking area. At the end of each week, the holder of the 
winning ticket gets a free dinner at a local restaurant. Before 
the experiment, only 36% of employees wore seat belts. 
After 1 month of the lottery, the usage rate is up to 57%. 

a. Identify the dependent and independent variables in 
this experiment. 

b. Is a control group involved in this experiment? If 
so, who? 

4.1 5 A direct-mail firm tries two different versions of a 
mailing, one of which includes a 50-cent monetary incentive 
for the respondent. Responses to the mailing with the incen- 
tive were 15% higher than for a similar mailing without it. 
a. Identify the dependent and independent variables in 

this experiment. 



b. Is a control group involved in this experiment? If 
so, who? 

4.16 In a health study, the dietary habits of 420,000 men 
were examined over a 3-year period. For those who ate 
no fried foods, the death rate was 1208 per 100,000 
man-years. For men who consumed fried foods more 
than 15 times a week, the corresponding rate was only 
702. Considering that the relative ages of members of 
these two groups were not mentioned, critique the inter- 
nal validity of the experiment, source: "Science Finds Fried 
Foods Not Harmful," Moneysworth magazine, November 24, 1975, p. 1 . 

4.17 When traveling, some motorists assume that a 
restaurant has good food and reasonable prices if there 
are a lot of trucks parked outside. What other observed 
characteristics of a restaurant might a traveler use in 
deciding whether to stop and eat? 
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4.18 In doing "observational studies" while driving 
through suburban neighborhoods during the summer 
months, home burglars look for potential victims who are 
away on vacation. What kinds of observation indicators 



would tend to tell the burglars that a family is likely away 
on vacation? What are some of the strategies that vacation- 
ing homeowners might use in attempting to fool potential 
burglars into thinking they are not away on vacation? 



E 




Secondary data are collected by someone other than the researcher, for purposes 
other than the problem or decision at hand; such data can be either internal or exter- 
nal. Internal secondary data are those that have been generated by your own firm or 
organization. External secondary data were gathered by someone outside the firm or 
organization, often for another purpose. The external sources described here are 
among the more popular, but they by no means exhaust the possibilities. As a prac- 
tical matter, if you're in search of relevant secondary data, don't hesitate to consult 
the information desk of your university or corporate library. Don't even think about 
feeling embarrassed — this is one case where being humble for a few minutes can save 
you many hours of frustration. 



Internal secondary data have traditionally existed in the form of accounting or 
financial information. Among the many other possibilities are annual reports, as 
well as more specific reports that may be available from departments such as 
production, quality control, engineering, marketing, sales, and long-range plan- 
ning. In short, anything in written form that has ever been generated within the 
company falls within the realm of internal secondary data. In addition, some 
information may not even exist in written form, but may be found in the collec- 
tive memories and observations that executives, managers, and engineers have 
recorded over the years. 

External Sources 
Government Agencies 

Federal, state, and local governments are extremely prolific generators and 
purveyors of secondary data, with the federal government, through the U.S. 
Department of Commerce and its Bureau of the Census, being an especially 
important source. The following are among the many U.S. Department of Commerce 
publications that are highly relevant to business research: 

• Statistical Abstract of the United States Published annually, this volume is 
an especially extensive collection of social, economic, and political data from 
a wide variety of sources, both governmental and private. 

• Census of Population Conducted every 10 years, this is the research for 
which the Bureau of the Census is probably best known. It provides detailed 
information on the U.S. population, including age, education, employment, 
and a broad range of other demographic characteristics. 

• Census of Housing Also published every 10 years, this report presents 
voluminous data on such variables as home ownership or rental, type of struc- 
ture, number of residents, market value, heating source, major appliances, 
and much more. 



Internal Sources 
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• Economic Censuses Conducted in years ending in 2 and 7, these studies 
provide data on retail and wholesale trade, the construction and transporta- 
tion industries, and the manufacturing and minerals industries. Data are 
aggregated by national, regional, and local levels. 

• Survey of Current Business A monthly publication containing information 
on a number of statistical topics, including population, trade, and transporta- 
tion. It includes over 2500 updated statistical series. 

Other key publications of the U.S. Department of Commerce include Historical 
Statistics of the United States from Colonial Times to 1970, County Business 
Patterns, State and Metropolitan Area Data Book, and the U.S. Industrial Outlook. 
The Department of Labor provides employment and wage information in the 
Monthly Labor Revieiv, while the Federal Reserve System publishes the Federal 
Reserve Bulletin, which includes statistics on banking, industrial production, and 
international trade. In addition to the staggering quantity of information avail- 
able from the federal government, published data are also provided by govern- 
mental agencies at the state, county, and local levels. 

Other Published Sources 

This category includes a variety of periodicals, handbooks, and other publications. 
The following list is a sample of the information available: 

• Business Periodicals Index Published monthly, this is a bibliography of arti- 
cles in selected business publications, all indexed by subject area. 

• Reader's Guide to Periodical Literature Similar to the Business Periodicals 
Index, but indexes popular, rather than business, publications. 

• Rand McNally Commercial Atlas and Marketing Guide Includes statistical 
data for each county in the United States and for about 100,000 cities and towns. 

Other published sources include (1) indexes, such as the New York Times 
Index and the Wall Street Journal Index, and (2) popular business publications, 
including American Demographics, Business Week, Fortune, Sales and Marketing 
Management, Dun's Business Month, Business Horizons, and Industry Week. 

Commercial Suppliers 

Information may also be purchased from outside suppliers who specialize in 
business research. One of the best-known commercial suppliers is the A. C. 
Nielsen Company, which provides a wide variety of data services for client firms. 
Although some data are supplied to a number of subscribing firms, studies may 
also be tailored to the specific needs of a single research customer. 

Trade Associations 

Trade associations collect and disseminate data that are related to their industry; 
these groups may be identified in the Encyclopedia of Associations. This publication 
is also an excellent source of private organizations having special interests that 
may be related to your informational needs. Most such groups generate regular 
publications that are available upon request or special order. 

Data Warehousing and Data Mining 

Advancements in computing power, mass data storage, and data analysis tech- 
niques have led to a relatively recent and exciting data source — the data ware- 
house. This is a large collection of data from inside and outside the company, put 
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together for the express purpose of using sophisticated analytical techniques to 
uncover patterns and discover interrelationships within the data. The procedure by 
which these analytical techniques are applied is known as data mining — it is anal- 
ogous to physically sifting through great quantities of ore in the quest for gold. 

The data warehouse typically contains massive information from a variety of 
sources that pertains to customers, vendors, products, or any other organizational 
or environmental subjects of interest to the company's decision makers. It is esti- 
mated that over 90% of large corporations worldwide either have or are building 
data warehouses, and that the market for data warehousing hardware and soft- 
ware grew from practically nil in 1994 to more than $15 billion by 2000. 10 

The tools for data mining comprise many of the techniques covered in this 
text, such as descriptive statistics, cross-tabulations, regression, and correlation, 
but also include many methods that are far beyond the scope of our text, such as 
those briefly described here. 

Factor analysis A set of techniques for studying interrelationships among 
variables. A very large set of variables can be reduced to a smaller set of new 
variables (factors) that are more basic in meaning but contain most of the 
information in the original set. 

Cluster analysis When each of a great many individuals or objects is repre- 
sented by a set of measurements or variables, cluster analysis categorizes the 
individuals or objects into groups where the members tend to be similar. 
In marketing, one application of cluster analysis is to categorize consumers into 
groups, a procedure associated with what is referred to as market segmentation. 

Discriminant analysis This technique identifies the variables that best sepa- 
rate members of two or more groups. It can also be used to predict group 
membership on the basis of variables that have been measured or observed. 

Multidimensional scaling Based on a ranking that reflects the extent to which 
various pairs of objects are similar or dissimilar, multidimensional scaling is a 
procedure that maps the location of objects in a perceptual space according to 
dimensions that represent important attributes of the objects. In this approach 
to multidimensional scaling, one of the biggest challenges is the interpretation 
of exactly what these dimensions mean. 

Data warehouses and data mining can be useful to many kinds of businesses. 
For example, the descriptive and predictive models developed through data min- 
ing can help telecommunications companies answer questions like these: (1) What 
kinds of customers tend to be profitable or unprofitable to the company? (2) How 
can we predict whether customers will buy additional products or services, like 
call waiting? (3) Why do customers tend to call more at certain times? (4) Which 
products or services are the most profitable? 11 Likewise, companies in the insur- 
ance industry can benefit from models that help them identify which insurance 
claims are most likely to be fraudulent, and investment firms can profit from 
models that help them better differentiate between initial public offerings (IPOs) 
that should be pursued and those that are better avoided. 

SAS, SPSS, and IBM are among those offering data mining software and assis- 
tance to a variety of firms and industries. For example, IBM's Intelligent Miner™ 

10 Source: Davis, Duane, Business Research for Decision Making (Pacific Grove, CA: Duxbury Press, 
2000), p. 355. 

"Source: Groth, Robert, Data Mining (Upper Saddle River, NJ: Prentice Hall, 2000), p. 209. 
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software was selected by Bass Export, the United Kingdom's largest beer exporter. As 
stated by Bass executive Mike Fisher, "Each week, we deliver orders to 23,000 
customers. It's vitally important that we understand the buying habits of each of 
them, the profitability of each individual account, and the profitability of each 
brand we deliver there." 12 Reuters, a leading provider of financial news and infor- 
mation, has used the SPSS-developed Clementine software to develop models for 
reducing error rates in its compilation and reporting of the massive amount of 
data it receives from worldwide sources. 13 

Internet Data Sources 

Thousands of database services — some are commercial, but many are free — offer 
bibliographical and statistical data for personal computer (PC) users. The specific set 
of available data will depend on your Internet provider, your Web-browsing soft- 
ware, and on the search engines at your disposal for specifying World Wide Web sites 
for further perusal. The PC and access to the vast electronic "ether" make it very con- 
venient to locate great quantities of information relevant to a particular business sit- 
uation or decision. The author's seventh-grade teacher had a favorite saying, "The 
more you know, the less you know you know, because you know there's so much to 
know." This certainly applies to our familiarity with the Internet and its capabilities, 
and this section will barely scratch the surface of what's out there. 

Corporate News and Annual Reports On-Line 

If you want to read a company's latest annual report, in many cases all you need 
to do is visit the firm's website and look for the "information for investors" or 
similar section. There's a good chance that your browser's home page includes a 
sponsored section where stock quotes (usually with a 20-minute or more delay) 
are available for companies on the major stock exchanges; if so, current news 
about a company or its industry is available just by pursuing the news items that 
accompany the quote. 

Secondary Data On-Line 

Many of the sources of published secondary data discussed previously offer informa- 
tion on-line as well as in hard-copy form. For example, the U.S. Bureau of the 
Census website ( http :// www, census .gov ) can be the starting point for data from sources 
such as USA Statistics in Brief or the Statistical Abstract of the United States. Its 
"State & County QuickFacts" section is particularly interesting and can provide 
detailed economic and demographic data about the county in which you live. As 
with any useful site that you visit, be sure to refer to the "Related Sites" section. In 
this case, the Bureau provides links to international statistical agencies, state data 
centers, more than 100 federal statistical agencies, and to its parent agency, the U.S. 
Department of Commerce. 

Finding Just About Anything — Those 
Incredible Search Engines 

Tonight Show host Johnny Carson used to have a segment that he called "Stump 
the band," where audience members would state a fictitious song title and the 
band would try to play music that seemed to fit the title. Regardless of whether 

12 Source: Groth, Robert, Data Mining (Upper Saddle River, NJ: Prentice Hall, 2000), p. 11. 
13 Ibid. 
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Get That Cat off the Poll! 

When a public opinion poll is conducted professionally, 
respondents are carefully and scientifically selected to 
ensure that they are representative. However, as with 
many other human endeavors, public opinion polls are 
sometimes carried out using methods that are not quite so 
professional. One unfortunate polling firm was the target 
of special scrutiny by a federal grand jury in New Haven, 
Connecticut, for techniques it allegedly employed during 
some of its studies between 2002 and 2004. According to 



prosecutors, the polling firm's owners and managers had 
instructed fieldworkers to fabricate survey results, to com- 
plete surveys even if they had to make up the answers 
themselves, and to meet their response quotas "even if 
they had to talk to dogs and cats." 



Source: Noreen Gillespie, "Polling Firm Accused of Falsifying Results," The 
Indiana Gazette, March 10, 2005, p. 7. 



the band was "stumped," the audience member always received a free dinner or 
other prize. 

If there were ever a contest called "Stump the Internet search engine," it 
would have few winners. Regardless of how esoteric the information or topic we 
are seeking, we can use a search engine like Google to obtain a wealth of guid- 
ance in leading us toward our goal. A prerequisite to using the search engines 
effectively is the strategic selection of the key words to be used in the search. 
It can also be helpful to use the "search within a search" or "power search" features 
to enter more specific key words to be found within a search that has already 
been conducted. 

Tapping Others' Expertise — Searching 
Discussion Groups 

Discussion (or news) groups can be viewed as the Internet counterpart to the 
"brick and mortar" organizations listed in the Encyclopedia of Associations. For 
practically any human interest or pursuit, you'll find some forum with participants 
who are enthusiastic about the topic and possess high levels of knowledge and 
experience. Select the "Groups" option at google.com, enter key words that 
describe your problem or interest, and you'll get a list of postings that contain one 
or more of the key words. Note: There are some privacy issues here — for example, 
if you've posted to a discussion group, a present or potential employer can search 
the groups for your name to find out more about you, your interests, and the foot- 
prints you've made during your electronic travels. 

Evaluating Secondary Data 

Secondary data should be selected, evaluated, and used with caution, since the 
information may be irrelevant to your needs, out of date, not expressed in units 
or levels of aggregation that are compatible with your informational require- 
ments, or may not have been collected or reported objectively. When considering 
the use of secondary information, be sure to keep in mind both who collected the 
information and whether the individual or group may have had some stake in the 
results of their own research. Honesty and reputation are also very important. 
Statistics in Action 4.1 describes a situation where a company may have been 
more interested in obtaining respondents than in obtaining facts. 



Chapter 4: Data Collection and Sampling Methods 




4.19 What are secondary data? Differentiate between 
internal and external secondary data. 

4.20 Besides accounting and financial information, what 
are some of the other sources of internal secondary data? 

4.21 What kinds of external secondary data are readily 
available on-line? 

4.22 Briefly, what kinds of information does the U.S. 
Department of Commerce provide in its Census of Popu- 
lation? In its Census of Housing? 

4.23 What is the Encyclopedia of Associations, and 
how can this publication be useful to the business 
researcher? 

4.24 What is data mining, and how can it be useful to a 
business or other organization? What are some of the 
analytical tools it employs? 

Note: Exercises 4.25-4.31 require a computer and access 
to the Internet. Web addresses were accurate at this 
writing. If a site address is no longer applicable, use a 
search engine (e.g., google.com) to identify the company 
or organization's new home page. 

4.25 Visit the U.S. Bureau of the Census ( http://www. 
census.gov ). and find the following descriptive statistics 
for the county where your college or university is located: 
a. Median household income 



b. Number of Social Security recipients 

c. Per capita retail sales 

4.26 Repeat Exercise 4.25, but obtain the requested 
descriptive statistics for Dare County, North Carolina. 

4.27 Referring to each company's on-line annual report, 
find the number of employees and net income in the most 
recent fiscal year for Coca-Cola, McDonald's, Dell 
Computer, and Compaq. 

4.28 Visit google.com and search discussion groups for 
postings that pertain to (a) a professional sports team 
and (b) a hobby or recreational activity of your choice. 

4.29 Using your browser's "people finder" capability, find 
out how many telephone listings are in your state for 
people with the same last name as yours. How many 
listings are there in your state for people named Smith? 

4.30 Using one or more search engines and key words of 
your choice, find (a) a website that reports the average life 
span of Saint Bernard dogs and (b) a website that advises 
business professionals on strategies for writing an effective 
memo. 

4.31 Using a search engine and the key words "spy 
equipment," identify some of the high-tech gadgets that 
companies or individuals can use in carrying out 
industrial espionage on other companies or individuals. 



THE BASICS OF SAMPLING 



Selected Terms 

The following terms — some of which we have discussed previously in other 
contexts — are of special relevance to our discussion of samples, the importance of 
sampling, and the methods by which samples are selected: 



POPULATION The set of all possible elements that could theoretically be 
observed or measured; this is sometimes referred to as the universe. 
SAMPLE A selected portion from the elements within the population, with these 
elements actually being measured or observed. 

CENSUS The actual measurement or observation of all possible elements from the 
population; this can be viewed as a "sample" that includes the entire population. 

PARAMETER A characteristic of the population, such as the population mean 
(fi), standard deviation (a), or population proportion (it). 

STATISTIC A characteristic of the sample, such as the sample mean (x), standard 
deviation (s), or sample proportion (p). In practice, this is used as an estimate of the 
(typically unknown) value of the corresponding population parameter. 
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Why a Sample Instead of a Census? 

A Census Can Be Extremely Expensive 
and Time-Consuming 

A complete census of the population is not a problem whenever the population 
happens to consist of a small number of elements that are easily identified. For 
example, there are only 20 metal-ore mining firms that have 500 or more 
employees. 14 In this setting, a study involving a complete census would present 
little difficulty. On the other hand, infoUSA, Inc., lists 61,763 pizza parlors in the 
United States. 15 Contacting every member of such a large population would 
require great expenditures of time and money, and a sampling from the list can 
provide satisfactory results more quickly and at much lower cost. 

A Census Can Be Destructive 

The Gallo Wine Company, like every other winery, employs wine tasters to ensure 
the consistency of product quality. Naturally, it would be counterproductive if the 
tasters consumed all of the wine, since none would be left to sell to thirsty cus- 
tomers. Likewise, a firm wishing to ensure that its steel cable meets tensile- 
strength requirements could not test the breaking strength of its entire output. 
As in the Gallo situation, the product "sampled" would be lost during the sam- 
pling process, so a complete census is out of the question. 

In Practice, a Sample Can Be More 
Accurate Than a Census 

Because of the tremendous amounts of time and money required for a complete 
census of a large population, a sample can actually be more accurate than a cen- 
sus in conducting a real-world study. For example, if we hired 10 people to work 
16 hours a day telephoning each of the more than 100 million television house- 
holds in the United States, it would take years to ask all of them whether they 
watched the CNN program on political campaigns that aired last night. By the 
time the last batch of calls was made, nobody would be able to remember whether 
they had watched a given program that was aired 2 or 3 years earlier. 

Sampling can also contribute to accuracy by reducing nonsampling error. This 
occurs because we can now afford to spend more time and attention on the mea- 
suring instrument and related procedures. For example, if a survey question is 
biased ("Most studies find that capital punishment does not deter crime. Do you 
agree that capital punishment should be abolished?"), asking that same question 
of everyone in the entire population will only ensure that you get the same biased 
answers from everyone instead of from just a sample. With some of the money 
saved by conducting a sample instead of a census, we can fine-tune the measure- 
ment procedure through such strategies as pretesting the questionnaire, hiring 
more-experienced interviewers, or following up on those who did not initially 
respond. This accuracy advantage of a sample over a census is especially valuable 
to survey research. As was observed in the vignette at the beginning of this chap- 
ter, even the U.S. Bureau of the Census has proposed that its census efforts be sup- 
plemented with survey information based on samples. 

14 Source: Bureau of the Census, U.S. Department of Commerce, Statistical Abstract of the United 
States 2002, p. 541. 

15 Source: Sales Leads & Mailing Lists, p. 26. 
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Although a census is theoretically more accurate than a sample, in practice we 
are limited by funding constraints such that a fixed amount of money must be 
divided between the sampling (or census) process and the measurement process. 
If we allocate more funds to the sampling or census process, we have less to spend 
on the assurance of accuracy in the measurement process. Here is a brief review 
of the concepts of sampling and nonsampling error: 

SAMPLING ERROR This is the error that occurs because a sample has been 
taken instead of a census. For example, the sample mean may differ from the true 
population mean simply by chance, but is just as likely to be too high as too low. 
When a probability sample (discussed in the next section) is used, the likely 
amount of sampling error can be statistically estimated using techniques 
explained in Chapter 9 and can be reduced by using a larger sample size. For a 
visual demonstration of sampling error, see Seeing Statistics Applet 3 at the end 
of the chapter. 

NONSAMPLING ERROR This type of error is also referred to as bias, because 
it is a directional error. For example, survey questions may be worded in such a 
way as to invite responses that are biased in one direction or another. Likewise, 
a weighing scale may consistently report weights that are too high, or a worn 
micrometer may tend to report rod diameters larger than those that actually 
exist. Nonsampling errors cannot be reduced by simply increasing the size of 
a sample. To reduce nonsampling error, it is necessary to take some action 
that will eliminate the underlying cause of the error. For example, we may 
need to adjust a machine or replace parts that have become worn through 
repeated use. 




4.32 What is the difference between a sample and a 
census? Why can it be advantageous to select a sample 
instead of carrying out a census? 

4.33 Differentiate between sampling error and nonsam- 
pling error. 

4.34 Provide a real or hypothetical example of a situation 
in which the sampling process is destructive. 

4.35 Differentiate between the terms parameter and 
statistic. Which one will be the result of taking a sample? 



4.36 To impress the interviewer, many respondents in a 
survey of 400 persons exaggerated their annual incomes 
by anywhere from 2% to 50%. Does such exaggeration 
represent sampling error or nonsampling error? Explain. 

4.37 In survey research, what are some of the methods by 
which we can reduce nonsampling error? 



SAMPLING METHODS 



Sampling methods can be categorized as probability or nonprobability. The distinction 
is that with probability sampling, each person or element in the population has a 
known (or calculable) chance of being included in the sample. Such samples also 
allow us to estimate the maximum amount of sampling error between our sample 
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statistic and the true value of the population parameter being estimated. Nonproba- 
bility sampling is primarily used in exploratory research studies where there is no 
intention of making statistical inferences from the sample to the population. Our 
emphasis in this and the chapters to follow will be on samples of the probability type. 

Probability Sampling 

In probability sampling, each person or element in the population has some 
(nonzero) known or calculable chance of being included in the sample. However, 
every person or element may not have an equal chance for inclusion. 

The Simple Random Sample 

In the simple random sample, every person or element in the population has an 
equal chance of being included in the sample. This type of sample is the equiva- 
lent of "drawing names from a hat" but is usually generated using procedures 
that are more workable than the familiar "hat" analogy. 16 

A practical alternative to placing names in a hat or box is to identify each person 
or element in the population with a number, then use a random number table to 
select those who will make up the sample. A portion of the random number table 
in Appendix A is reproduced here as Table 4.1 and will be used in the example 
that follows. 





'1 


example 





Simple Random Sampling 

A firm has 750 production employees and wishes to select a simple random sample 
of 40 workers to participate in a quality-assurance training program. 



SOLUTION 

Using the random digits listed in Table 4.1, the following steps can be taken to 
determine the 40 individuals to be included in the sample: 

1. Identify the employees with numbers ranging from 001 to 750. 

2. Arbitrarily select a starting point in the random number table. This can be 
done by closing your eyes and pointing to a position on the table or by some 
other relatively random method of your choice. As our starting point for the 
example, we will begin with digits 11-13 in row 2 of Table 4.1. Three-digit 
numbers are necessary because the largest employee identification number is 
three digits. 

3. Work your way through the table and identify the first 40 unique (to avoid 
duplicates, since we're sampling without replacement) numbers that are 
between 001 and 750. 



16 For practicality, we are using the operational definition of the simple random sample. The theoretical 
definition says that, for a given sample size, each possible sample constituency has the same chance of 
being selected as any other sample constituency. For more details, the interested reader may wish to 
refer to a classic work on sampling: Leslie Kish, Survey Sampling, New York: John Wiley & Sons, 
1965, pp. 36-39. 
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Columns 



Row 


1- 


10 


11 


-20 




1 


I 
i 

37220 


i 

84861 


i 

59998 


i 

7731 1 




2 


31618 


06840 


45167 


13747 


Employee #451 is in sample. 


3 


53778 


71813 


50306 


4741 5 


Employee #503 is in sample. 


4 


82652 


58162 


90352 


10751 


903 > 750. 903 is ignored. 


5 


40372 


53109 


76995 


24681 


769 > 750. 769 is ignored. 


6 


24952 


3071 1 


22535 


51397 


Employee #225 is in sample. 


7 


94953 


96367 


87143 


71467 


871 > 750. 871 is ignored. 


8 


98972 


12203 


90759 


561 57 


907 > 750. 907 is ignored. 


9 


61759 


32900 


05299 


56687 


Employee #052 is in sample. 


10 


14338 


44401 

1 


22630 
1 \ 


06918 

r 


Employee #226 is in sample. 

The process continues, until 
all 40 sample members have 
been selected. 



TABLE 4.1 



A portion of the table of 
random digits from 
Appendix A. In the exam- 
ple, a simple random sam- 
ple of 40 is being selected 
from a population of 750 
employees. The digits in 
row 2, columns 11-13, 
serve as an arbitrary starting 
point. 



The procedure is summarized in Table 4.1 and begins with the selection of 
employee #451, who also happens to be at the randomly selected starting point. 
The next number, 503, also falls within the limit of 750 employees; at this point 
we have placed individuals #451 and #503 into the sample. Because the next 
number (903) exceeds 750, it is ignored and we continue down the column. 



Computer Solutions 4.1 shows an alternative to using the random number 
table. Here we show the procedures and results when Excel and Minitab are used 
to select a simple random sample from an existing list of data values. In this case, 
we are selecting a simple random sample of 10 usage times (seconds) from the 
50 ATM customers whose times were examined in Chapters 2 and 3. The data 
file is CX02ATM. 

Although the simple random sampling approach just described is a straight- 
forward procedure, it does require that we have a listing of the population mem- 
bers. This is not a problem if we are dealing with telephone directories or members 
of the Fortune 500, but it may not be practical in other situations. For example, it 
would be both expensive and awkward to take down the names and telephone 
numbers of all the shoppers at the local mall tomorrow so that we can later 
narrow them down to a simple random sample of 100 persons. 

In some cases, such as when products are coming off an assembly line, we can 
use a random selection process to determine which ones to sample. For example, 
if we would like to subject 10% of the units to a detailed inspection, a random 
number sequence could be followed. As each product comes off the line, we 
advance to the next random digit in a table of random numbers. If the digit asso- 
ciated with a particular product happens to be a "9" (an arbitrary selection), it 
and all other units that are associated with the digit 9 receive the detailed inspec- 
tion. Although it takes a slightly different form, this is a variation of the simple 
random sampling method, and each unit has the same probability (in this case, 
0.10) of being sampled. 



122 



Part 1: Business Statistics: Introduction and Background 



computer solutions 4.1 



Simple Random Sampling 

These procedures select a simple random sample from a list of existing data values. 

EXCEL 






A 


B 


C 


D 


E 


1 


AgeCat 


Gender 


Seconds 




32.0 


2 


1 


2 


50.1 




37.7 


3 


1 


1 


53.0 




44.8 


4 


2 


2 


43.2 




37.8 


5 


1 


2 


34.9 




40.1 


6 


3 




37.5 




42.9 


7 


2 




37.8 




32.9 


8 


3 




49.4 




45.0 


9 


1 




50.5 




44.8 


10 


3 




48.1 




46.4 



Open data file CX02ATM.XLS. The 50 ATM usage times are in column C, and the variable name (Seconds) is in C 1 . 
Click Tools. Click Data Analysis. Click Sampling. Click OK. 

Enter C1:C51 into the Input Range box. Click to place a check mark into the Labels box (because the input range has 
the name of the variable at the top). Select Random and enter 10 into the Number of Samples box. Enter E1 into 
the Output Range box. Click OK. The simple random sample of 10 values from the 50 ATM usage times are listed in 
column E. 



MINITAB 

Data Display 

C5 

58.1 37.8 43.2 44.8 27.5 34.4 33.6 31.0 56.3 37.7 

1 . Open file CX02ATM.MTW. The data are already listed in columns C 1 (AgeCat), C2 (Gender), and C3 (Seconds). 

2. Click Calc. Select Random Data. Click Sample From Columns. Enter 10 into the Number of rows to sample: box. 
Enter C3 into the box below the From columns: label. Enter C5 into the Store samples in box. (Do not select "Sample 
with replacement.") Click OK. The 10 data values randomly selected from the 50 usage times will be in column C5. 



The Systematic Sample 

Although similar in concept to the simple random sample, the systematic sample 
is easier to apply in practice. In this approach, we randomly select a starting 
point between 1 and k, then sample every kth element from the population. For 
example, in selecting a sample of 100 from a population of 6000, we would 
select a random number between 1 and 60, then include that person and every 
60th person until we have reached the desired sample size. The choice of k = 60 
is made on the basis that the population is 60 times as large as the desired 
sample size. 

In the context of a shopping mall, we might simply elect to survey every 10th 
person passing a given point in the mall during the entire day. Such a process is 
illustrated in Figure 4.2, except with k = 3 as our selection rule. As in the simple 
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Sample 
member 



Sample Sample 
member member 



The systematic sample is a 
probability sample in which 
every kth person or element 
in the population is 




included in the sample. In 
this example, k = 3. 



random sample, we have a rule to follow, and personal judgment is not allowed 
to enter the picture. The use of such rules, and their elimination of personal judg- 
ment or bias in the selection process, is basic to all probability sampling proce- 
dures. If such rules are not followed, we could find ourselves avoiding interviews 
with mall shoppers who are carrying heavy bags, who appear to be in a hurry, 
or who in some other way cause us to be either less likely or more likely to inter- 
view them. 

One potential problem with the systematic sample is periodicity, a phe- 
nomenon where the order in which the population appears happens to include a 
cyclical variation in which the length of the cycle is the same as the value of k that 
we are using in selecting the sample. For example, if daily sales receipts are 
arranged in order from January 1 to December 31, it would not be wise to select 
k = 7 in choosing a systematic sample. Under these conditions, we could end up 
with a sample consisting entirely of Mondays, Wednesdays, or some other given 
day of the week. Likewise, a production machine may have some quirk by which 
every 14th unit produced happens to be defective. If we were to use k = 7 or 14 
in selecting our systematic sample, we would end up with a sample that is not rep- 
resentative of production quality. Although such periodicity is not a common 
problem, the possibility of its existence should be considered when undertaking a 
systematic sample. 

Other Probability Sampling Techniques 

A number of more advanced probability sampling methods are also available. 
Among them are the stratified and cluster sampling approaches. 

In the stratified sample, the population is divided into layers, or strata; then 
a simple random sample of members from each stratum is selected. Strata mem- 
bers have the same percentage representation in the sample as they do in the 
population, so this approach is sometimes referred to as the proportionate strati- 
fied sample. Part A of Table 4.2 is an example of the stratified sampling approach 
in which the stratification has been done on the basis of education level. The selec- 
tion of education as the variable on which to stratify could be important if our 
purpose is to measure the extent to which individuals provide charitable contri- 
butions to the performing arts. 

A sample can also be stratified on the basis of two or more variables at the 
same time. Part B of Table 4.2 is similar to part A, except that the stratification is 
based on two variables. This is referred to as a two-way stratified sample since the 
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TABLE 4.2 



Examples of stratified sam- 
pling using one variable 
(part A) and two variables 
(part B) for stratification. In 
each case, strata members 
have the same percentage 
representation in the 
sample as they do in the 
population, and sample 
members from each 
stratum are selected 
through the use of proba- 
bility sampling. 



A. Stratified Sample Based on One Variable: Highest Education Level 



Population: 
U.S. Employed Civilians 



Stratified Sample of 
n = 1000 



<high school graduate 


10,466,000 




95 




high school graduate 


34,011,000 


>- 


307 








some college 


31,298,000 




283 




college graduate 


34,874,000 




315 





Total = 1 10,649,000 Total = 1000 

Persons with a college diploma comprise 31.5% of the population. They will also 
comprise 31.5% of the sample. 



B. Stratified Sample Based on Two Variables: Education and Gender 



Population: 
U.S. Employed Civilians 
Male Female 



Stratified Sample of 
n = 1000 
Male Female 



<high school graduate 


6,397,000 


4,069,000 


high school graduate 


18,188,000 


15,823,000 


some college 


15,613,000 


15,685,000 


college graduate 


19,033,000 


15,841,000 



58 


37 


164 


143 


141 


142 


172 


143 



Total = 110,649,000 



Total = 1 000 



Males with some years of college comprise 14.1 % of the population. They will also 
comprise 14.1 % of the sample. 



Source: Based on data in Bureau of the Census, U.S. Department of Commerce, Statistical Abstract of the 
United States 2002, p. 385. 



sample has been forced to take on the exact percentage breakdown as the popu- 
lation in terms of two different measurements or characteristics. 

The stratified sample forces the composition of the sample to be the same as 
that of the population, at least in terms of the stratification variable(s) selected. 
This can be important if some strata are likely to differ greatly from others with 
regard to the variable(s) of interest. For example, if you were doing a study to 
determine the degree of campus support for increased funding for women's sports 
programs, you might be wise to use a sample that has been stratified according to 
male/female so that your sample has the same percentage male/female breakdown 
as the campus population. 

The cluster sample involves dividing the population into groups (e.g., based 
on geographic area), then randomly selecting some of the groups and taking 
either a sample or a census of the members of the groups selected. Such an 
approach is demonstrated in Figure 4.3. As in the stratified sample, members of 
the population may not have the same probability of inclusion, but these proba- 
bilities could be determined if we wished to exert the time and effort. 



Nonprobability Sampling 

In nonprobability sampling, not every unit in the population has a chance of 
being included in the sample, and the process involves at least some degree of 
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FIGURE 4.3 



The five segments of a geographic region 




The cluster sample involves 
dividing the population into 
groups or areas, then ran- 
domly selecting some of the 
groups or areas and taking 
either a sample or a census 
of each. Shown here is an 
example of how this 
approach would apply to a 
city that has been divided 
into areas A through E. 



I 



Randomly selected areas: A, C, E 




Sample 
or census 



personal subjectivity instead of following predetermined, probabilistic rules for 
selection. Although less important for the purposes of this text, such sampling 
can be useful in small-scale, exploratory studies where one wishes simply to gain 
greater familiarity with the population rather than to reach statistical conclu- 
sions about its characteristics. We will briefly examine four types of nonproba- 
bility samples: 

CONVENIENCE SAMPLE Members of such samples are chosen primarily because 
they are both readily available and willing to participate. For example, a fellow stu- 
dent may ask members of your statistics class to fill out a questionnaire for a study 
she is doing as an assignment in her sociology course. So-called person-on-the- 
street interviews by television reporters are often identified as "random," although 
they are usually conducted at shopping areas near the station or at other locations 
selected primarily because they happen to be convenient for the interviewer and 
camera crew. 

QUOTA SAMPLE This is similar to the stratified probability sample described pre- 
viously, except that members of the various strata are nor chosen through the use 
of a probability sampling technique. For example, if the sample is to include 30 
undergraduates, all 30 could arbitrarily be selected from the marching band. 
Although it looks good on paper (e.g., the male/female percentage breakdowns 
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A Sample of Sampling by Giving Away Samples 



A sampling approach used by a drug company ended up 
giving a federal judge a headache. The company had 
claimed its brand of pain reliever to be recommended by 
70,000 doctors. 

It seemed that the firm had provided free samples of its 
product to 250,000 physicians, with about half of them 
returning a postcard that entitled them to receive still more 
free samples. The company then surveyed 404 of those 
who had returned the postcard, and 65% of them said 
they recommended use of this brand of pain reliever. 



Applying the 65% figure to all of those who had returned 
the postcard, the company came up with its claim that the 
product was recommended by 70,000 doctors. 

Although some might view the company's claim as 
technically correct, the judge felt it was misleading and for- 
bade the company from engaging in further deceptive 
advertising. 

Source: Based on William Power, "A Judge Prescribes a Dose of Truth To Ease 
the Pain of Analgesic Ads," Wall Street Journal, May 13, 1987, p. 33. 



are the same for the population and the sample), this type of sample is far inferior 
to the stratified sample in terms of representativeness. 

PURPOSIVE SAMPLE In this sample, members are chosen specifically because 
they're not typical of the population. For example, the maker of a new pocket cal- 
culator may submit prototypes to members of an advanced doctoral seminar in 
mathematics at the Massachusetts Institute of Technology. If these individuals can- 
not figure out how to operate it, the calculator should obviously be simplified. 
Conversely, if a sample of individuals known to have IQs below 80 are able to oper- 
ate it, it should present no problem to the general population. 

JUDGMENT SAMPLE This sample is selected on the basis that the researcher 
believes the members to be representative of the population. As a result, the 
representativeness of such a sample is only as good as the judgment of the per- 
son who has selected the sample. The president of a manufacturing firm may 
really believe that products produced by the 1 1 :00 p.m.-7:00 a.m. shift are typi- 
cal of those produced by all three work shifts. This belief is all that is necessary 
for a sample of output from the 1 1-7 shift to fall into the category of a judg- 
ment sample. 



As this chapter has pointed out, statistically generalizing to the population on 
the basis of a nonprobability sample is not valid. Statistics in Action 4.2 describes 
how a company used a rather bizarre nonprobability sampling approach, then 
tried to sell the results as accurately describing the population. 




4.38 Attendees at an industrial trade show are given the 
opportunity to fill out a card that entitles them to receive a 
free software package from the sponsoring firm. The cards 
are placed in a bin, and 30 winners will be drawn. Consid- 
ering the 30 winners as a sample from all of the persons 
filling out such cards, is this a probability sample or a non- 



probability sample? What specific type of probability or 
nonprobability sample does this procedure represent? 

4.39 To obtain a sample of the invoices over the past 
year, a clerk uses the computer to select all invoice num- 
bers that are a multiple of 13. If the invoice numbers are 
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sequential, beginning with 1, what type of sample does 
this represent? 

4.40 Differentiate between a judgment sample and a con- 
venience sample. 

4.41 In what way are the quota sample and the stratified 
sample similar? In what way are they different? What 
effect does this difference have on the applications for 
which such samples might be appropriate? 

4.42 What is the difference between a probability sample 
and a nonprobability sample? Which type is necessary if 
we wish to statistically generalize from the sample to the 
population? 

4.43 For each of the following situations, would you rec- 
ommend a sample or a census? Explain your reasoning in 
each case. 

a. A purchasing agent has just received a shipment of 
shock-resistant watches and wants to find out 
approximately how far they can be dropped onto a 
concrete surface without breaking the crystal. 

b. General Mills wants to learn the age, gender, and 
income characteristics of persons who consume 
Cheerios breakfast cereal. 

c. The producers of NBC's Tonight show want to find 
out what percentage of TV viewers recognize a photo 
of host Jay Leno. 

d. A small company wants to determine whether U.S. 
companies that manufacture nuclear submarines 
might be interested in a new technique for purifying 
air when such craft are submerged. 

4.44 Using the random number table in Appendix A, 
select a simple random sample of 5 from the 50 states of 
the United States. 

4.45 What is meant by periodicity, and how can this pre- 
sent a problem when taking a systematic sample? Provide 
an example of a systematic sampling situation where peri- 
odicity could be a factor. 



4.46 The manufacturer of a fast-drying, highly effective 
glue is concerned about the possibility that some users 
might accidentally glue their fingers together, raising 
potential legal problems for the firm. Could purposive 
sampling help determine the extent to which people 
might use the glue improperly? If so, describe the compo- 
sition of at least one purposive sample that could be 
employed for this purpose. 

/ data Set / Note: Exercises 4.47-4.49 require a 
computer and statistical software capable of selecting 
random or systematic samples. 

4.47 Given the 250 household utility expenditures in 
Exercise 3.65 (file XR03065), generate a simple random 
sample consisting of 30 data values. 

4.48 Given the honeymoon expenditures for the 

300 couples in Exercise 3.66 (file XR03066), generate a 
simple random sample consisting of 30 data values. 

4.49 A study found that the average American buys 4.6 
movie tickets per year, compared to just 1.9 for the aver- 
age Spaniard. Assuming that the data values in file 
XR04049 represent the number of movie ticket purchases 
reported by 100 American respondents, source: Anne 

R. Carey and Quin Tian, "Irish Eyes Watching Movies," 7999 USA Snapshot 
Calendar for June 16. 

a. Select a simple random sample of 20 data values. 
Compute the sample mean. 

b. Select another simple random sample of 20 data 
values, and compute the mean. 

c. To two decimal places, compare the mean of the sam- 
ple in part (a) with the mean of the sample in part (b). 
Are the values identical? Would you expect them to 
be? How do they compare to the mean of the 100 
data values from which the samples were drawn? 
(We will learn more about this topic in Chapter 8, 
Sampling Distributions.) 



SUMMARY 4.8 



• Types of business research studies 

Business research studies can be categorized according to the objective involved 
and may be exploratory, descriptive, causal, or predictive. Prior to beginning the 
study, the problem or question should be defined in specific terms that can be 
answered by research. 

• Survey research methods, data, and error sources 

In business research, the data collected can be either primary or secondary. Pri- 
mary data are generated by the researcher for the problem or situation at hand, 
while secondary data have been collected by someone else for some other pur- 
pose. Secondary data can be classified as internal or external, depending on 
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whether the data have been generated from within or outside the researcher's firm 
or organization. Primary data tend to require more time and expense, but have 
the advantage of being more applicable to the research problem or situation. 

Survey, experimental, and observational research are among the most popular 
methods for collecting primary data. Surveys may be carried out by mail ques- 
tionnaires, personal interviews, and telephone interviews. The questionnaire, or 
data collection instrument, will contain any of three types of questions: multiple 
choice, dichotomous, and open-ended. In survey research, there are three major 
sources of error: sampling error, response error, and nonresponse error. 

• Experimental and observational studies 

Experimental studies attempt to identify cause-and-effect relationships between 
variables. Evaluation of the results of an experimental study involves the assess- 
ment of both internal and external validity. Observational studies emphasize what 
individuals do rather than what they say, with behavior observed and recorded on 
a data collection instrument resembling the questionnaire used in survey research. 

• Internal secondary data and data mining 

Internal sources of secondary data may take a variety of forms within the re- 
searcher's firm or organization, with accounting and financial records being espe- 
cially useful. The many sources of secondary data include government agencies, 
publications, commercial suppliers, and trade associations. The data warehouse is 
a large collection of data put together to apply the sophisticated analytical tools 
of data mining to identify patterns and interrelationships in the data. More than 
ever, the Internet is an indispensable source of information and expertise. 

• Sampling can be better than a census 

When a population contains only a small number of elements, a complete census 
of its members is easily done. In practice, however, most studies involve relatively 
large populations for which the taking of a sample can provide satisfactory results 
much more quickly and at considerably lower cost. In some cases, the mea- 
surement process can be destructive, necessitating the use of sampling instead of 
a census. Because of the time and money it saves, a sample can even be more accu- 
rate than a census. This occurs because some of the limited funds available for the 
study can be used to improve the measurement process itself, thereby reducing 
nonsampling error. 

• Sampling and nonsampling errors 

Error is the difference between the actual value of the population parameter (e.g., 
/i- or tt) and the value of the sample statistic (e.g., ~x or p) that is being used to 
estimate it. Sampling error is a random, nondirectional error that is inevitable 
whenever a sample is taken instead of a census, but the amount of this error can 
be estimated whenever a sample is of the probability type. For probability sam- 
ples, sampling error can be reduced by increasing the sample size. Nonsampling 
error is a tendency toward error in one direction or the other and can be present 
even if a complete census is taken. 

• Probability and nonprobability sampling 

In probability sampling, each person or element in the population has some 
(nonzero) known or calculable chance of being included in the sample. In the sim- 
ple random sample, each person or element has the same chance for inclusion. 
Other probability samples discussed in the chapter are the systematic sample, the 
stratified sample, and the cluster sample. 

In nonprobability sampling, not everyone in the population has a chance of 
being included in the sample, and the process involves at least some degree of 
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personal subjectivity instead of predetermined, probabilistic selection rules. Such 
samples can be useful in small-scale, exploratory studies where the researcher 
does not intend to make statistical generalizations from the sample to the popula- 
tion. Nonprobability techniques discussed in the chapter include convenience, 
quota, purposive, and judgment sampling. 




4.50 The Sonic Travel Agency has not kept pace with the 
current boom in family vacation air travel between the 
agency's northern city and sunny Florida. For each of the 
following, indicate whether the associated data are 
primary or secondary. If the data are secondary, further 
indicate whether they are internal or external. 

a. Sonic's research department conducts a survey of past 
customers to determine their level of satisfaction 
with the agency's service. 

b. The Florida Bureau of Tourism sends travel agencies 
across the nation the results of a study describing 
the characteristics of winter vacationers to the state, 
including where and how long people tended to stay. 

c. To get a better grasp of the problem, the president of 
Sonic examines the company's annual reports for the 
past 10 years. 

4.51 A labor union official has considered three possible 
negotiating stances that the union might take during 
upcoming contract negotiations. Members of the rank and 
file, on the other hand, could differ in terms of the approach 
they would most like to see. The union official is concerned 
with properly representing his membership in talks with 
company representatives. For this situation, formulate a 
specific question that research might be able to answer. 

4.52 For each of the following report titles, indicate 
whether the study involved was exploratory, descriptive, 
causal, or predictive, and explain your reasoning. 

a. "The Popularity of Bowling as a Participant Sport 
in Ohio" 

b. "The Effects of TV Violence on Crime in Small Towns" 

c. "A Preliminary Investigation of the Market for 
Pet Foods" 

d. "Family Vacationing in the 21st Century" 

4.53 In an attempt to measure the availability of 
component parts for a new product line, a manufacturer 
mails a questionnaire to 15 of the 25 companies that 
supply these components. Past experience with similar 
surveys has shown that such companies tend to exagger- 
ate their ability to supply desired components on time 
and in the quantities desired. Of the companies surveyed, 
only 8 complete and return the questionnaire. Given this 
scenario, identify possible sources of sampling error, 
response error, and nonresponse error. 



4.54 To collect information on how well it is serving its 
customers, a restaurant places questionnaire cards on the 
tables, with card deposit boxes located near the exit. 

a. Do you think that nonresponse error might influence 
the results of this effort? In what way(s)? 

b. In addition to being a source of information to 
management, are the cards providing any other 
benefits? In what way(s)? 

4.55 A mail survey of junior executives is designed to 
measure how much time they spend reading the Wall 
Street Journal each day and how they feel about the 
newspaper's content and layout. Formulate one question 
of each of the following types that might be included in 
the questionnaire: (a) multiple-choice; (b) dichotomous; 
(c) open-ended. 

4.56 The curator of a fossil museum finds there were 
1450 visitors in May and 1890 visitors in June. At the 
beginning of June, he had hired Bert McGruff, a popular 
local athlete, to serve as a tour guide. Mr. McGruff, 
pointing out the increase in attendance, has demanded a 
raise in pay. Considering this situation as an experiment: 

a. Identify the dependent and independent variables in 
this experiment. 

b. Are there any extraneous variables that might have 
influenced this experiment? 

c. Given your response to part (b), evaluate the internal 
validity of the experiment. Does Bert's request for a 
raise seem to be justified? 

4.57 Timetech, Inc., has been experimenting with 
different approaches to improving the performance of its 
field sales force. In a test involving salespersons in Maine, 
the company found that giving sales leaders a free snow- 
mobile caused a sales increase of 25% in that state. The 
experiment in Maine was so successful that company 
executives are now planning to increase sales nationwide 
by awarding free snowmobiles to sales leaders in all 

50 states. Critique the external validity of this experiment. 

4.58 A security officer uses a one-way mirror to watch 
department store shoppers and identify potential 
shoplifters. In addition to observing whether a person is 
attempting to steal merchandise, what other personal or 
behavioral characteristics can be observed in this setting? 
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4.59 Using either the Internet or the Encyclopedia of 
Associations, identify a group or organization that might 
be able to supply information on 

a. the popularity of bowling. 

b. child abuse in the United States. 

c. industrial accidents. 

d. antique and classic cars. 

4.60 When customers purchase goods at a Radio Shack 
store, they are typically asked for their name and phone 
number so their purchase information can be stored. This 
would be one portion of the vast internal and external 
data available to the company. How might a company 
like Radio Shack utilize data warehousing and data 
mining? 

4.61 In general, what are some of the considerations in 
evaluating the suitability of data that have been generated 
by someone else? For what types of sources should 
secondary data be regarded with special skepticism? 

4.62 Shirley will be graduating soon and is anticipating job 
offers from several national-level corporations. Depending 
on the company she joins, she could end up living in any of 
four different locations in the United States. 

a. What are some of the ways that Shirley could utilize 
the Internet for information to help her decide which 
company she would most prefer to join? 

b. What are some of the ways that company personnel 
could use the Internet to find out more about Shirley 
and her suitability for their company? Would you 
consider any of these methods to be unethical? If so, 
which one(s) and why? 

Note: Exercises 4.63-4.66 require a computer and access 
to the Internet. Web addresses were accurate at this writ- 
ing. If a site address is no longer applicable, use a search 
engine (e.g., google.com) to identify the company or orga- 
nization's new home page. 

4.63 Visit the U.S. Bureau of the Census ( http:// 
www.census.gov ), and find the following descriptive 
statistics for Kalamazoo County, Michigan: 

a. Of those who are 25 or older, the percentage who 
are high school graduates 

b. The number of persons who are 65 or older 

c. The median household income 

4.64 Using the U.S. Bureau of the Census ( http:// 
www.census.gov ) as your starting point, use the links it 
provides and find the most recent population total for 
Canada. 

4.65 One of the capabilities on the Internet is called 
"reverse lookup." Enter the residential telephone number 
for someone you know. Evaluate the accuracy of the 
name and address provided, along with the validity of the 
map that can be generated as additional information. 
Was the reverse lookup successful in identifying the 
person's e-mail address as well? 



4.66 Visit google.com and search discussion groups for 
postings that pertain to Sony digital cameras. Given the 
kinds of discussions that take place in such settings, what 
benefits could Sony obtain by having someone within the 
company "lurk" (i.e., observe without participating) in 
one or more groups where digital cameras are being 
discussed? 

4.67 A researcher wants to find out to what extent local 
residents participate in the recreation and entertainment 
activities available in their community. She selects a sim- 
ple random sample of numbers from the telephone direc- 
tory and proceeds to call these numbers between 6 p.m. 
and 9 p.m. If the line is busy or if there is no answer, she 
uses the next number on her list as a replacement. Com- 
ment on both the randomness and the representativeness 
of the sample of persons who end up being interviewed. 

4.68 Researchers at a university with an enrollment of 
12,000 take a census of the entire student population, 
asking, "Shoplifting is not only illegal, it raises the prices 
that we all have to pay. Have you ever shoplifted from 
the university bookstore?" Based on the preceding, 
discuss the possible presence of (a) sampling error and 
(b) nonsampling error. 

4.69 According to the U.S. Bureau of the Census, 28% of 
U.S. adults are college graduates. In a stratified sample of 
200 U.S. adults, approximately how many persons in 
each of the following categories should be questioned: 

(a) college graduate and (b) not a college graduate? 
source: Statistical Abstract of the United States 2006, p. 149. 

4.70 To test the effectiveness of a new type of plastic 
handcuffs, a law enforcement agency puts the cuffs on a 
sample of 30 persons who belong to the local weightlift- 
ing club. None is able to break free from the cuffs. What 
type of sampling technique does this represent? Explain 
your reasoning. 

4.71 Of a company's 1000 employees, 200 are managers, 
700 are factory employees, and 100 are clerical. In select- 
ing a sample of 100 employees, a researcher starts at the 
beginning of the alphabet and continues until she has col- 
lected the names of 20 managers, 70 factory employees, 
and 10 clerical workers. What type of sampling technique 
does this represent? Explain your reasoning. 

4.72 A college of business has 20 faculty members 

in Accounting, 30 in Marketing, 20 in Management, 15 
in Finance, and 15 in Information Systems. The dean of 
the college asks her secretary to arbitrarily select 4, 6, 4, 
3, and 3 persons from the respective departments to serve 
on a faculty committee. Is this a probability sample or a 
nonprobability sample? What specific type of probability 
or nonprobability sample does this procedure represent? 

/ data Set / Note: Exercises 4.73 and 4.74 require a 
computer and statistical software capable of selecting 
simple random samples. 
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4.73 In 2001, the average U.S. production employee 
worked 34.2 hours per week. An assistant in the human 
resources department of Acme Eyebolts, Inc., is curious as 
to how the workers in her company compared to this fig- 
ure during the first week of this month. The assistant will 
be reaching her conclusions based on a simple random 
sample of 30 data values out of the 300 Acme workers' 
weekly hours listed in file XR04073. Help the assistant 

by generating the simple random sample and computing 
its mean. Based on the sample, compare the Acme 
workweek with that of production workers nationally. 
source: Statistical Abstract of the United States 2002, p. 392. 

4.74 Unknown to a quality assurance technician, the 
tensile strengths (in pounds per square inch, psi) for all 



500 heavy-duty construction bolts in a recent shipment 
are as listed in file XR04074. Because a bolt must be 
broken to measure its strength, the testing process is 
destructive. The technician plans to collect a simple 
random sample of 20 bolts, then measure how much 
tension each one withstands before it breaks. Generate 
the simple random sample and compute its mean break- 
ing strength. Assuming that the bolt manufacturer has 
advertised that, on average, such bolts will withstand 
10,000 psi, refer to your sample result in commenting 
on the manufacturer's claim. (We will discuss the use of 
sample data in evaluating claims in much more detail in 
Chapter 10, Hypothesis Testing.) 



integrated case 




Thorndike Sports Equipment — Video Unit Two 



The first Thorndike video segment takes us back in time, 
to when Ted got his business degree and first joined his 
grandfather's company. At the time of this video, Luke 
isn't much in need of data mining — to reach the things at 
the bottom of his document heaps, he needs good old- 
fashioned shovel mining. 

To get a feel for the industry in which his grandfather 
operates, Ted wants to find out more about the sporting 
goods manufacturing business and collect some data on 
the competition that Thorndike Sports Equipment faces 
with some of its products, like racquetball and tennis rac- 
quets. Please give Ted some help by getting onto the Inter- 
net and providing him with the information requested 
here. Internet search engines and strategically selected key 
words will come in handy. 

1. Approximately how much money do Americans spend 
on sports equipment annually? If possible, identify 



which types of sports equipment are the biggest 
sellers. 

2. Who are some of the leading manufacturers of racquet- 
ball and tennis racquets? If possible, find out their rela- 
tive importance in terms of manufacturing volume or 
retail sales of these products. 

3. For two of the companies that manufacture equipment 
for racquet sports, visit the companies' websites and, if 
possible, read their most recent annual report. For each 
company, what have been the trends in overall com- 
pany sales and profitability over the most recent years 
reported? 

4. Select a company that makes racquetball racquets, 
then search discussion groups for comments that 
group participants may have made about the quality 
or reputation of racquets carrying that company's 
brand name. 



Sampling 

When this applet loads, it contains 100 blue circles. As you 
click on any 10 of them, their blue covers disappear and 
each circle changes to either red or green. After you have 
taken a sample of 1 0, click on Show All and the display at 
the top will show both the proportion of your sample that 
were red and the actual proportion of the population that 
were red. Of course, because of sampling error, it is very 
likely that your sample proportion will be different from the 
actual population value. Click the Reset button to return 
the circles to blue so you can try another sample. 

Applet Exercises 

3.1 Take 20 different samples and for each one record 
both the sample proportion and the actual population 



proportion. How many times was your sample proportion 
less than the actual population value? Equal to the actual 
population value? Greater than the actual population 
value? 

3.2 For each of the 20 different samples from Applet Exer- 
cise 3.1 , calculate the difference between your sample pro- 
portion and the actual population proportion. (Remember 
to retain the positive or negative sign associated with each 
difference.) What was the average difference for the sam- 
ples you took? 



Proportion Red = 0.4 Units Sampled = 10 



Maximum sample size reached 




Reset Show All 



Bumping Up Your Chances of Getting "Bumped" 

Among air travelers, it's common knowledge that airlines overbook flights to make 
up for no-shows. For the more clever air travelers, it becomes a challenge to get 
"bumped" from the flight in return for a free ticket, a voucher, or some other mone- 
tary goody. During just the first nine months of 2005, more than 460,000 passengers 
agreed to give up their seats on overbooked flights. 

There are fewer "no-shows" than in years past, and improved forecasting meth- 
ods have helped the airline companies more closely predict how many people will 
need to be "bumped" from a given flight. However, to improve your chances of being 
bumped and rewarded with free travel or other compensation, experts advise to travel 
during peak travel periods, book flights that are already nearly full, and favor heavily 
booked flights on relatively small-bodied aircraft, such as the 727s and 737s. It also 
helps to arrive early, ask the agent if the flight is fully booked, and tell her you are willing 
to consider being bumped. 

In 2006, about one of every 650 passengers was "bumped" from a flight. To 
boost your likelihood of becoming a "bumpee," follow these tips and pick up some 
more from the sources cited below. 



probability 

Facing 



, a loW-P 



Sources: Barbara De Lollis and Barbara Hansen, 
"Airlines Give Fliers Fewer Chances to Do the 
Bump," USA Today, December 20, 2005, p. 6B; 
www.Bestfares.com, July 26, 2006; 
www.BudgetTravel.com, July 26, 2006. 
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learning 
objectives 

After reading this 
chapter, you should 
be able to: 



Understand the basic concept of probability, including the classical, relative 
frequency, and subjective approaches. 

Interpret a contingency table in terms of the probabilities associated with the 
two variables. 

Determine the probability that an event will occur. 

Construct and interpret a tree diagram with sequential events. 

Use Bayes' theorem and additional information to revise a probability. 

Determine the number of combinations and the number of possible permuta- 
tions of n objects r at a time. 



5.1 INTRODUCTION 



Uncertainty plays an important role in our daily lives and activities as well as in 
business. In some cases, uncertainty even helps heighten our interest in these 
activities. For example, some sports fans refuse to watch the evening news when- 
ever they're going to view a taped replay of a sports event that occurred earlier in 
the day — the replay is more exciting if they don't know who won the game. 

In a business context, investment counselors cannot be sure which of two 
stocks will deliver the better growth over the coming year, and engineers try to 
reduce the likelihood that a machine will break down. Likewise, marketers may 
be uncertain as to the effectiveness of an ad campaign or the eventual success of a 
new product. The following situations provide other examples of the role of 
uncertainty in our lives: 

• In the last game of the World Series, the home team is down one run with the 
bases loaded and two out in the bottom of the ninth. The pitcher, whose 
batting average is only 0.105, is due up. The manager sends in a pinch hitter 
with a 0.320 batting average to hit for the pitcher. 

• Eddie is standing at an intersection, facing the "Don't Walk" light in a city 
where jaywalking carries a $20 fine. After looking around to be sure there are 
neither cars nor police, he crosses the street. 

• Arthur is 75 years old, has had three heart attacks, and works as a bomb 
disposal expert with his local police department. When he applies for life 
insurance, he is turned down as an unacceptable risk. 

In each of these cases, chance was involved. The baseball manager selected the 
batter who was more likely to get a hit. Eddie was trying to reduce the chance that 
something undesirable would happen to him. Arthur's life insurance company felt 
that his age, health, and occupation greatly reduced the likelihood of his surviving 
for many more years. 

We'll soon offer several more precise definitions, but for now let's just consider 
probability as "the chance of something happening." Although humans have been 
dealing with probabilities ever since cave dwellers discovered their chances for 
survival would be better if they carried spears, only within the past couple of 
centuries have probabilities really been a focal point of mathematical attention. 
Gamblers were among the first to take an interest in probabilities, followed by the 
insurance industry and its mortality tables that expressed the likelihood that a 
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person of a given age would live an additional year. More recently, probabilities 
have proven vital in many ways to both business and the social sciences. 

The next section considers the three basic approaches to probability. This will 
be followed by a discussion of unions and intersections of events, concepts that 
are fundamental to describing situations of uncertainty. Sections 5.4 and 5.5 will 
examine the rules for calculating probability under different circumstances. 

Sometimes, it is useful to revise a probability on the basis of additional infor- 
mation that we didn't have before. This will be the subject of Section 5.6. Finally, 
Section 5.7 will explain how permutations and combinations can help us calcu- 
late the number of ways in which things can happen. 



PROBABILITY: TERMS AND APPROACHES 5.2 



Basic Terms 

The discussion in this and later sections relies heavily on four important, interre- 
lated definitions: 

EXPERIMENT An activity or measurement that results in an outcome 

SAMPLE SPACE All possible outcomes of an experiment 

EVENT One or more of the possible outcomes of an experiment; a subset of the 
sample space 

PROBABILITY A number between 0 and 1 which expresses the chance that an 
event will occur 

The event in the preceding definitions could be almost anything of interest to 
the observer — e.g., rain tomorrow, a 30-point increase in the Dow Jones Indus- 
trial Average (DJIA) next week, or even that you'll take time out for a soft drink 
before finishing this page. As an example application, consider the possibility that 
the DJIA will increase by at least 30 points during the next week: 

• Experiment Next week's trading activities on the New York Stock Exchange 

• Sample space The sample space consists of two possible outcomes: (1) the 
DJIA goes up by at least 30 points, and (2) it does not. 

• Event A = the DJIA goes up by at least 30 points next week. 

• Probability The chance that the DJIA will increase by at least 30 points next 
week. Represented by P(A), the probability for this and any other event will 
have the following properties: 

0 < P(A) < 1 For any event, the probability will be no 
less than 0 and no greater than 1. 

P(A) + P(A') = 1 Either the event will occur (A) or it 
will not occur (A'). A' is called the 
complement of A. 

As indicated above, the probability for any event will be a number between 0 
and 1. These extreme values represent events that are either impossible or certain. 
For example: 

P(B) = 0 If B = you awake tomorrow morning and find yourself on Jupiter. 
P( C) = 1 If C = it will rain someplace in the world tomorrow afternoon. 



Part 2: Probability 

The Classical Approach 

The classical approach describes a probability in terms of the proportion of times 
that an event can he theoretically expected to occur: 

Classical probability: 

For outcomes that are equally likely, 

_. . .... Number of possible outcomes in which the event occurs 

Probability = : — 

lotal number of possible outcomes 

For example, in rolling a 6-sided die, the probability of getting a "three" is 

1 Out of 6 possible (and equally likely) 

outcomes, there is one in which a 



1 + 1 + 1 + 1 + 1 + 1 "three" occurs. 

Likewise, the probability of tossing a coin and having it land "heads" is -. In 
this case there are two possible (and equally likely) outcomes, only one of which 
includes the event "heads." Because we can determine the probability without 
actually engaging in any experiments, the classical approach is also referred to as 
the a priori ("before the fact") approach. Classical probabilities find their great- 
est application in games of chance, but they are not so useful when we are 
engaged in real-world activities where either (1) the possible outcomes are not 
equally likely or (2) the underlying processes are less well known. 

The Relative Frequency Approach 

In the relative frequency approach, probability is the proportion of times an event 
is observed to occur in a very large number of trials: 

Relative frequency probability: 

For a very large number of trials, 

Number of trials in which the event occurs 



Probability 



Total number of trials 



The relative frequency approach to probability depends on what is known as 
the law of large numbers: Over a large number of trials, the relative frequency 
with which an event occurs will approach the probability of its occurrence for a 
single trial. For example, if we toss a coin one time, the result will be either heads 
or tails. With just one toss, our relative frequency for heads will be either 0 or 1. 
If we toss it a great many times, however, the relative frequency will approach the 

0. 5 that it is theoretically "supposed to be." 

On a more serious note, the life insurance industry relies heavily on relative fre- 
quencies in determining both candidate acceptability and the premiums that must 
be charged. Life insurance premiums are based largely on relative frequencies that 
reflect overall death rates for people of various ages. Table 5.1 is a portion of a mor- 
tality table of the type used by the life insurance industry. Of particular interest: 

1. The death rate starts out relatively high for newborns, decreases slightly 
through the younger years, then steadily increases until (at the highest figure 
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Deaths per 1000 
Age Male Female 



0 


4.18 


2.89 


5 


0.90 


0.76 


10 


0.73 


0.68 


15 


1.33 


0.85 


20 


1.90 


1.05 


25 


1.77 


1.16 


30 


1.73 


1.35 



Deaths per 1000 
Age Male Female 



35 


2.11 


1.65 


40 


3.02 


2.42 


45 


4.55 


3.56 


50 


6.71 


4.96 


55 


10.47 


7.09 


60 


16.08 


9.47 


65 


25.42 


14.59 



Deaths per 1000 



Age 


Male 


Female 


70 


39.51 


22.11 


75 


64.19 


38.24 


80 


98.84 


65.99 


85 


152.95 


116.10 


90 


221.77 


190.75 


95 


329.96 


317.32 



TABLE 5.1 



Life insurance companies 
use mortality tables to 
predict how many policy 
holders in a given age 
group will die each year. 
When the death rate is 
divided by 1000, the result 
is the probability that a 
typical policyholder in the 
age group will pass away 
during the year. 



Source: American Council of Life Insurance, 1999 Life Insurance Fact Book, pp. 182-186. 



shown, 95 years) the likelihood of death within the coming year is about one 
chance out of three. 

2. Each death rate can be converted to a probability by dividing by 1000. The 
resulting probability isn't necessarily the exact probability of death for a 
specific individual but rather the probability for a "typical" individual in 
the age group. For example, a typical male policyholder of age 95 would 
have a 329.96/1000, or a 0.32996, probability of dying within the coming 
year. 

3. Examination of male versus female death rates shows the rate for males is 
higher for every age group. Especially noteworthy is the drastic increase in 
male death rates between the ages of 10 and 20, a period during which female 
death rates show a much smaller increase. 

The Subjective Approach 

The subjective approach to probability is judgmental, representing the degree to 
which one happens to believe that an event will or will not happen. Subjective 
probabilities can also be described as hunches or educated guesses. It has been esti- 
mated that wagers in 2000 amounted to $866 billion for all forms of gambling, 
ranging from casinos to bingo. 1 

Individuals place such bets only if they think they have some chance of win- 
ning. In a more negative direction, insurance customers paid $8.3 billion in fire 
insurance premiums during 2000. 2 They would not have paid such sums if they 
felt their buildings had a zero probability of catching fire. 

As the terms "hunch" and "educated guess" imply, subjective probabilities 
are neither based on mathematical theory nor developed from numerical analyses 
of the frequencies of past events. This doesn't make them meaningless, however. 
For example, the following subjective probabilities might be considered reason- 
able with regard to your activities today: 

Event Subjective Probability 

You will be mugged while reading this page. Low (Prob. < 0.001) 

You'll talk to someone on the phone today. Medium (Prob. > 0.600) 

Your shoes are on the right feet. High (Prob. > 0.999) 

Source: www.OnPointRadio.org, July 26, 2006. 

^Source: Bureau of the Census, Statistical Abstract of the United States 2002, p. 142. 
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As intelligent human beings, we are the sum total of our experiences and 
memories, and these enable us to make intuitive estimates of the probabilities of 
events with which we are somewhat familiar. Such probabilities, though subjective, 
can be important in drawing conclusions and making decisions. Although you 
may not be able to precisely quantify the probability that the fast-talking salesper- 
son in the plaid suit is really offering you a sound used car, this shouldn't prevent 
you from assigning a low probability to such an event. 

Probabilities and "Odds" 

The term odds is sometimes used as a way of expressing the likelihood that some- 
thing will happen. When someone says "the odds for this outcome are three to 
one," she is really saying that the chance of the event occurring is three times the 
chance that it will not occur. In this case, the odds might be expressed as "three to 
one" or "3:1." When you see odds listed in reference to an event, you can convert 
them to a probability as follows: 

Conversion from odds to probability, and vice versa: 

1. If the odds in favor of an event happening are A to B, or A:B, the probability 
being expressed is 

A 

A + B 

2. If the probability of an event is x (with 0 S x < 1), the odds in favor of the 
event are "x to (1 — x)," or "x:(l — x)." 

Odds are typically expressed in terms of the lowest applicable integers. For 
example, "1.5 to 4" would become "3 to 8." Also, the odds against the occur- 
rence of an event are the reverse of the odds in favor. For example, if the odds in 
favor of an event are 2 to 3, the odds against would be 3 to 2. 

Like the other probabilities we've discussed, odds can be classical, based on 
relative frequencies, or even subjective. For example: 

• Classical In flipping a fair coin, the probability of heads is 0.5. The resulting 
odds are "0.5 to 0.5," or "1 to 1." 

• Relative frequency For example, 16.1% of U.S. households have a cell 
phone with Internet access, a probability of 0.161 for a randomly selected 
household. 3 If the event is "has cell phone with Internet access," the odds in 
favor of the household having this device would be 0.161 to 0.839, or 
approximately 1 to 5. 

• Subjective In speculation about the upcoming 2000 major league baseball 
season, an oddsmaker assigned odds that each team would end up winning 
the World Series. The New York Yankees were given odds of 1 to 3 of 
winning, which translates into a subjective probability of 1/(1 + 3), or 0.25. 
The odds given the Milwaukee Brewers were much less favorable — just 1 to 
10,000,000. 4 



^Source: Adrienne Lewis, "Early Adopter Technologies," USA Today, October 17, 2005, p. 2B. 
4 Source: "World Series Odds," VSA Today, March 14, 2000, p. 11C. 
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2. 



As discussed earlier, when presenting the odds against an event happening, the 
odds are listed in the opposite order. For example, the published odds against 
the Yankees were "3:1." For purposes of consistency with our previous exam- 
ples, we've converted the published version to the "odds for" order. 
In sports events, points are often added to or subtracted from the score of one 
of the teams (creating a "point spread") in an attempt to convert the odds 
to 1:1. 



N 


0 


T 


E 





5.1 The president of a computer manufacturing firm 
states that there is a 70% chance that industry shipments 
of notebook computers will double in the next 5 years. Is 
this a classical probability, a relative frequency probabil- 
ity, or a subjective probability? 

5.2 If A = "Aeroflot, Russia's national airline, will begin 
making daily flights to Pocatello, Idaho, next year," iden- 
tify the sample space and possible events that could occur. 
Use your judgment in arriving at an approximate proba- 
bility for the event(s) and complement(s) you've identified. 

5.3 If B = "IBM's net profit will increase next year," iden- 
tify the sample space and possible events that could occur. 
Use your judgment in arriving at an approximate proba- 
bility for the event(s) and complement(s) you've identified. 

5.4 Electric meters on private homes in a community reg- 
ister the cumulative consumption of kilowatt-hours with 
a five-digit reading. If a home in the community were ran- 
domly selected, it's possible that the right-hand digit 
(which spins the fastest) would be a "7." Using the con- 
cept of classical probability, what is the probability that 
the digit is a "7"? 

5.5 Regarding Exercise 5.4, how might an observer 
employ the law of large numbers to verify the classical 
probability that you identified? 



5.6 If a die is rolled one time, classical probability would 
indicate that the probability of a "two" should be 1 . 

If the die is rolled 60 times and comes up "two" only 
9 times, does this suggest that the die is "loaded"? Why 
or why not? 

5.7 A newspaper article reported that Massachusetts had 
become the second state (after Montana) to prohibit sex 
to be used as a factor in determining insurance 
premiums. Considering the information in Table 5.1, 
would such unisex insurance tend to increase or decrease 
the life insurance premiums previously paid (a) by men? 
(b) by women? source: John R. Dorfman, "Proposals for Equal Insur- 
ance Fees for Men and Women Spark Battle," Wall Street Journal, August 
27, 1987, p. 23. 

5.8 It has been reported that about 35% of U.S. adults 
attended a sports event during the previous year. What 
are the odds that a randomly selected U.S. adult attended 
a sports event during the year? source: Bureau of the Census, 
Statistical Abstract of the United States 2006, p. 79 1 . 

5.9 If the odds are 4:7 that an event will occur, what is 
the corresponding probability? 



UNIONS AND INTERSECTIONS OF EVENTS 5.3 



As discussed previously, an event is one or more of the possible outcomes in an 
experiment, and the sample space is the entire set of possible outcomes. Sample 
spaces and events may be portrayed in either a table or a visual display, and these 
representations can contribute to our discussion of the probability concepts in 
sections to follow. 

The tabular method involves the contingency table, or cross-tabulation, intro- 
duced in Chapter 2. This shows either frequencies or relative frequencies for two 
variables at the same time. The visual mechanism, called a Venn diagram, simul- 
taneously displays both the sample space and the possible events within. We will 
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use the contingency table and the Venn diagram in demonstrating several key 
terms that are important to the consideration of probability: 

MUTUALLY EXCLUSIVE EVENTS If one event occurs, the other cannot occur. 
An event (e.g., A) and its complement (A) are always mutually exclusive. 

EXHAUSTIVE EVENTS A set of events is exhaustive if it includes all the possible 
outcomes of an experiment. The mutually exclusive events A and A' are exhaustive 
because one of them must occur. When the events within a set are both mutually 
exclusive and exhaustive, the sum of their probabilities is 1 .0. This is because one of 
them must happen, and they include all of the possibilities. The entries within a rela- 
tive frequency contingency table are mutually exclusive and exhaustive, and their 
sum will always be 1 .0. 

INTERSECTION OF EVENTS Two or more events occur at the same time. Such 
an intersection can be represented by "A and B," or "A and B and C," depending 
on the number of possible events involved. 

UNION OF EVENTS At least one of a number of possible events occurs. A union 
is represented by "A or B," or "A or B or C," depending on the number of events. 

The contingency tables in parts (1) and (2) of Table 5.2 show frequencies and 
relative frequencies describing the sex and age of persons injured by fireworks in 
1995. The relative frequencies in part (2) are obtained by dividing the corre- 
sponding entries in part (1) by 11,449, the total number of persons injured by 
fireworks during that year. For example, part (1) shows that 3477 of those 
injured were males under the age of 15. The relative frequency for persons in this 
sex/age category is 3477/11,449, or 0.304, as shown in the contingency table in 
part (2). 



TABLE 5.2 


(1) Frequencies 








Frequency and relative 
frequency contingency 
tables for fireworks victims 




Age 






B 

Under 15 


B' 

15 or Older 






Sex A Male 


3477 


5436 


8913 




A' Female 


1249 


1287 


2536 






4726 


6723 


1 1 ,449 




(2) Relative Frequencies 




Age 








B 

Under 15 


B' 

15 or Older 






Sex A Male 


0.304 


0.475 


0.779 




A' Female 


0.109 


0.112 


0.221 






0.413 


0.587 


1.000 




Source: U.S. Consumer Product Safety Commission, National Electronic Injury Surveillance System. 
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Referring to the frequencies in part (1) of Table 5.2, we can identify several 
that illustrate the terms defined earlier in the section: 

• Mutually exclusive events The victims were either male (event A, 8913 
persons) or female (event A', 2536 persons). 

• Exhaustive events The four mutually exclusive events (A and B), (A and B'), 
(A' and B), and (A' and £>') are also exhaustive since a victim must be in one 
of these sex/age categories. 

• Intersection of events There were 3477 victims who were males under the 
age of 15, the "A and B" category. 

• Union of events There were 10,162 victims who were either male, or under 
age 15, or both, the "A or B" category. This category includes everyone 
except females who were 15 or older and its frequency is calculated as 5436 + 
3477 + 1249 = 10,162. 

Figure 5.1 presents a series of Venn diagrams that also describe the age/sex 
breakdown of the fireworks victims. The information is the same as that con- 
tained in the contingency table of Table 5.2, part (1), but in a more visual form. 
Examples of calculations for finding the number of persons in the union or the 
intersection of events are shown in the figure, and the results are identical to those 
based on the contingency table of frequencies. 



1. Complement of an event, A ' 




2536 



FIGURE 5.1 



Venn diagrams describing 
persons injured by fireworks 



A = Injured person was a male (8913 persons 

in this category). 
A' = Injured person was not a male (2536 

persons in this category). 

A' = Complement of A. 



2. Intersection of two events, (A and B) 

(A' and B 1 ) 
1287 



5436 


(A and B) 










1249 



(A and B) = Injured person was a male under 
15 (3477 persons in this category). 

(A' and B') = Injured person was neither a male 
nor under 15 (1287 persons in this 
category). 



3. Union of two events, (A or B) 







(A'andB 1 ) 






1287 








(A and B) 






Mil 








1249 g 



(A or B) = Injured person was either a male, 
or under 15, or both (5436 + 3477 
+ 1249 = 10,162 persons in this 
category). 
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5.10 A sample space includes the events A, B, and C. 
Draw a Venn diagram in which the three events are 
mutually exclusive. 

5.11 A sample space includes the events A, B, and C. 
Draw a Venn diagram in which events A and B are 
mutually exclusive but events B and C are not mutually 
exclusive. 

5.12 The following contingency table of frequencies is 
based on a 5-year study of fire fatalities in Maryland. 
For purposes of clarity, columns and rows are identified 
by the letters A-C and D-G, respectively, source: National 
Fire Protection Association, The 1984 Fire Almanac, p. 151. 







>< 


.iJ 1 


b. 


Identify the region of the Venn dia 


yam 


that 


c. 


represents (N and D). 

Identify the region of the Venn dia; 


51-am 


that 


d. 


represents (N' and D'). 

Identify the region of the Venn dia 


yarn 


that 


c. 


represents (N or D). 

Identify the region of the Venn dia 

represents (N' or D'). 


yarn 


that 










Blood Alcohol 










Level of Victim 








A 


6 


C 


Age 


0.00% 


0.01-0.09% 


5=0.10% 


D 


0-19 


142 


7 


6 


E 


20-39 


47 


8 


41 


F 


40-59 


29 


8 


77 


G 


60 or over 


47 


7 


35 






265 


30 


159 



b. 



155 
96 

114 
89 

454 



For this table, identify any two events that are 
mutually exclusive. 

For this table, identify any two events that intersect. 



5.13 Using the table in Exercise 5.12, how many victims 
were in the category described by: 

a. (A and A')? b. (CorF)? 
c. (A'andG')? d. (B or G')? 

5.14 Using the table in Exercise 5.12, what is the proba- 
bility that a randomly selected victim would have been at 
least 60 years old and have had a blood alcohol level of 
at least 0.10%? 

5.15 The following table represents gas well completions 
during 1986 in North and South America, source: American 
Gas Association, 1987 Gas Facts, p. 50. 



5.16 Using the table in Exercise 5.15, assume that one 
well has been selected at random from the 48,673. 

a. What is the probability that the well was drilled in 
North America and was dry? 

b. What is the probability that the well was drilled in 
South America and was not dry? 

5.17 The owner of a McDonald's restaurant in France is 
considering the possibility of opening up a new franchise 
at the other end of her town. At the same time, the man- 
ager of the Pennsylvania Turnpike is deciding whether to 
recommend raising the speed limit by 10 miles per hour. 
Draw a single Venn diagram representing their possible 
decisions. Explain any assumptions you have made 
regarding mutual exclusivity of events. 

5.18 A shopping mall developer and a historical society 
are the only two bidders for a local historical landmark. 
The sealed bids are to be opened at next week's city 
council meeting and the winner announced. Draw a 
single Venn diagram describing purchase/nonpurchase of 
the landmark by these two parties. Explain any assump- 
tions you have made regarding mutual exclusivity of 
events. 

5.19 According to data from the U.S. Energy Informa- 
tion Administration, the 60.0 million U.S. households 
with personal computers were distributed as shown here 
with regard to geographic location and access to the 
Internet. The entries represent millions of households. 
What is the probability that a randomly selected 
computer household would be in the category described 
by (South or Midwest or Yes)? In the category (West and 
No) ? source: Bureau of the Census, Statistical Abstract of the United 
States 2006, p. 638. 







D 


D' 






Dry 


Not Dry 


N 


North America 


14,131 


31,575 


N' 


South America 


404 


2,563 






14,535 


34,138 



45,706 
2,967 

48,673 



a. Draw a Venn diagram that summarizes the informa- 
tion in the table. 



Access 
to 

Internet? 


North- 
east 


Region 
Midwest 


South 


West 




Yes 


9.7 


11.8 


16.9 


12.2 


50.6 


No 


1.2 


2.3 


3.8 


2.1 


9.4 




10.9 


14.1 


20.7 


14.3 


60.0 
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5.20 There are 100 males and 120 females in the gradu- 
ating class of a local high school. Thirty-five percent of 
the graduating males are on school sports teams, as are 
30% of the graduating females. A local businessperson 
is going to donate $1000 to the favorite charity of a 



randomly selected graduate. What is the probability that 
the graduate selected will be a male who is not a mem- 
ber of a sports team? What is the probability that the 
graduate will be either a female or a member of a sports 
team? 



ADDITION RULES FOR PROBABILITY 



There are occasions where we wish to determine the probability that one or more 
of several events will occur in an experiment. The determination of such proba- 
bilities involves the use of addition rules, and the choice of a given rule will 
depend on whether the events are mutually exclusive. To illustrate these rules, we 
will use the two Venn diagrams in Figure 5.2. 



When Events Are Mutually Exclusive 

When events are mutually exclusive, the occurrence of one means that none of the 
others can occur. In this case, the probability that one of the events will occur is 
the sum of their individual probabilities. In terms of two events, the rule can be 
stated as follows: 

Rule of addition when events are mutually exclusive: 

P(A or B) = P(A) + P(B) 

Part (1) of Figure 5.2 is a relative frequency breakdown of colors of 
sport/compact cars made in North America during the 2004 model year. As the 



1. Mutually exclusive events: Colors of sport/compact cars 







A 






B 

0.63 




0.17 







A = Silver 

B = Red 

(For 63% of the sport/compact 
cars, the color was something 
other than silver or red.) 



2. Nonmutually exclusive events: Injuries from fireworks 









0.475 


0.304 








0.109 



0.112 



A = Injured person was a male. 

B = Injured person was under age 15. 

(Of those injured, 11.2% were neither 

male nor under 15.) 



FIGURE 5.2 



Venn diagrams showing 
relative frequencies for 
(1) colors of sport/compact 
cars made in North America 
during the 2004 model 
year, and (2) persons 
injured by fireworks 



Sources: Part (1) Time Almanac 2006, p. 609. Part (2) U.S. Consumer Product Safety Commission, National 
Electronic Injury Surveillance System. 
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Venn diagram indicates, the possible events are mutually exclusive. For a given 
vehicle, only one color was applied. Using the letters A and B to identify the 
colors silver and red, respectively, and considering a sport/compact car selected at 
random from that model year, we can arrive at the following probabilities: 

• Probability that the car is either silver or red: 

P(A or B) = P(A) + P(B) = 0.20 + 0.17 = 0.37 

• Probability that the car is either silver or some color other than silver or red: 

Although we have not specifically identified "neither silver nor red" with a let- 
ter designation, it can be expressed as (A' and B'). Using this identification, 
we can determine the probability that the randomly selected car is either 
silver or some color other than silver or red: 

P(A or [A' and B']) = P(A) + P(A' and B') = 0.20 + 0.63 = 0.83 

When Events Are Not Mutually Exclusive 

When events are not mutually exclusive, two or more of them can happen at the 
same time. In this case, the general rule of addition can be used in calculating 
probabilities. In terms of two events, it can be stated as follows: 

General rule of addition when events are not mutually exclusive: 
P(A or B) = P{A) + P(B) - P(A and B) 

As befits its name, the general rule of addition can also be applied when the events 
are mutually exclusive. In this case, the final term in the expression will become zero 
because events that are mutually exclusive cannot happen at the same time. 

Relative frequencies for the fireworks victims in the various sex and age cate- 
gories are shown in the Venn diagram in part (2) of Figure 5.2. As the Venn dia- 
gram indicates, the events are not mutually exclusive. For example, a victim can 
be male and under the age of 15, represented by the category (A and B). 

In part (2) of Figure 5.2, the probability that a randomly selected victim 
is a male (event A) is 0.475 + 0.304, or P(A) = 0.779. The probability that 
this randomly selected victim is under age 15 (event B) is 0.109 + 0.304, or 
P(B) = 0.413. If we were to simply add P(A) = 0.779 and P(B) = 0.413, we'd get 
a value greater than 1 (a total of 1.192), which is not possible since a probability 
must be between 0 and 1. The problem is that the probability of the intersection, 
P(A and B) = 0.304, has been counted twice. To make up for this, we add the 
individual probabilities, then subtract the probability of their intersection, and 
the probability of a fireworks victim being male or under age 15 is: 

P(A or B) = P(A) + P(B) - P(A and B) = 0.779 + 0.413 - 0.304 = 0.888 

Using the general rule of addition in determining the probability that a 
randomly selected victim is female or at least 15 years of age, we will use the 
relative frequencies shown in part (2) of Table 5.2: 

P(female) = P(A') = 0.221 
P(15 or older) = P(P') = 0.587 
P(female and [15 or older]) = P(A' and £>') = 0.112 

and 

P(A' or B') = P(A') + P(B') - P(A' and B') 

= 0.221 + 0.587 - 0.112 = 0.696 
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5.21 A financial advisor frequently holds investment 
counseling workshops for persons who have responded 
to his direct mailings. The typical workshop has 10 atten- 
dees. In the past, the advisor has found that in 35% of 
the workshops, nobody signs up for the advanced class 
that is offered; in 30% of the workshops, one person 
signs up; in 25% of the workshops, two people sign up; 
and in 10% of the workshops, three or more people sign 
up. The advisor is holding a workshop tomorrow. What 
is the probability that at least two people will sign up for 
the advanced class? What is the probability that no more 
than one person will sign up? Draw a Venn diagram that 
includes the possible events and their relative frequencies 
for this situation. 

5.22 A survey of employees at a large company found the 
following relative frequencies for the one-way distances 
they had to travel to arrive at work: 

Number of Miles (One-Way) 

A B C D E F 

£5 6-10 11-15 16-20 21-30 a31 

Relative 

Frequency 0.38 0.25 0.16 0.09 0.07 0.05 

a. What is the probability that a randomly selected 
individual will have to travel 1 1 or more miles to 
work? 

b. What is the probability that a randomly selected 
individual will have to travel between 6 and 15 miles 
to work? 

c. Draw a Venn diagram that includes the relative 
frequency probabilities in the table. 

d. Using the letter identifications provided, calculate the 
following probabilities: P(A or B or E); P(A or F); 
P(A' or B); P(A or B or C). 

5.23 In 2005, McDonald's had 31,886 restaurants 
systemwide. Of these, 18,334 were operated by 
franchisees, 9283 by the company, and 4269 by 
affiliates. What is the probability that a randomly 
selected McDonald's restaurant is operated by either a 
franchisee or an affiliate? source: McDonald's Corporation, 
2005 Annual Report, p. 1 . 

5.24 For three mutually exclusive events, P(A) = 0.3, 
P(B) = 0.6, and P(A or B or C) = 1.0. What is the value 
of P(A or C)? 

5.25 It has been reported that the 44,300 employees of 
Northwest Airlines are distributed among the following 



corporate functions: 
2002, p. 1 1 . 



source: Northwest Airlines, Annual Report 



Pilots 

Agents/clerks 
Flight attendants 
Mechanics 
Other 



13% 


(A) 


22 


(B) 


21 


(C) 


17 


(D) 


27 


(E) 


100% 





For the sample space consisting of Northwest 
employees: 

a. Draw a Venn diagram representing events A, B, C, D, 
and E. 

b. What is the value of P(A)? 

c. What is the value of P(A or B)? 

d. What is the value of P(A or D')? 



5.26 In 2003, Entergy Corporation had 2,631,752 
electricity customers. Of these, 86.0% were in the 
residential category (R); 11.8% were commercial 
(C); 1.6% were industrial (I); and 0.6% were 
government and municipal (G). source: Entergy 
Corporation, 2003 Annual Report 

a. Draw a Venn diagram representing events R, C, I, 
and G. 

b. What is the value of P(R)? 

c. What is the value of P( C or G ) ? 

d. What is the value of P(R or C')? 

5.27 According to the Bureau of Labor Statistics, there 
are 777,000 men and 588,000 women in the life, physi- 
cal, and social science occupations category; and 786,000 
men and 204,000 women in the farming, fishing, and 
forestry occupations category. For these 2,355,000 indi- 
viduals, what is the probability that a randomly selected 
individual will either be a male or be in the life, physical, 
and social sciences category? source: Encyclopaedia Britannica 
Almanac 2006, p. 835. 

5.28 Using the information presented in the table 
in Exercise 5.12, calculate the following 
probabilities: 

a. P(A or D) b. P(B or F) 

c. P(CorG) d. P(BorCorG) 

5.29 Using the information presented in the table 
in Exercise 5.15, calculate the following 
probabilities: 

a. P(D or N) b. P(D' or N') 
c. P(D or N') d. P(D' or N) 
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5.5 



MULTIPLICATION RULES FOR PROBABILITY 



While addition rules are used to calculate the probability that at least one of several 
events will occur, in this section we'll consider rules for determining the probability 
that two or more of the events will all occur. These rules involve several important 
terms: 



MARGINAL PROBABILITY The probability that a given event will occur. No 
other events are taken into consideration. A typical expression is P (A). 

JOINT PROBABILITY The probability that two or more events will all occur. A 
typical expression is P(A and B). 

CONDITIONAL PROBABILITY The probability that an event will occur, given 
that another event has already happened. A typical expression is P(A\B), with the 
verbal description, "the probability of A, given B." A conditional probability may be 
determined as follows: 

Conditional probability of event A, given that event B has occurred: 
P(A and B) 



P{A\B) 



P(B) 



(Note: This applies only if P(B) > 0. 



The rules that follow are called multiplication rules; they determine the prob- 
ability that two events will both happen or that three or more events will all hap- 
pen. There are two multiplication rules, and the one that is applicable will depend 
on whether the events are independent or dependent: 



INDEPENDENT EVENTS Events are independent when the occurrence of one 
event has no effect on the probability that another will occur. 

DEPENDENT EVENTS Events are dependent when the occurrence of one event 
changes the probability that another will occur. 



When Events Are Independent 

When events are independent, their joint probability is the product of their indi- 
vidual probabilities. In the case of two events, the multiplication rule is as follows: 

Multiplication rule when events are independent: 

P(A and B) = P(A) X P(B) 

To illustrate this rule, we'll use a device called a tree diagram, which visually 
summarizes the occurrence or nonoccurrence of two or more events. The tree dia- 
gram is especially useful in visualizing events that occur in sequence. Figure 5.3 
illustrates two consecutive tosses of a fair coin. Regardless of the result of the first 
toss, the probability of heads on the next toss is going to be 0.5; the tree diagram 
in Figure 5.3 shows all the possibilities, along with their joint probabilities. 

For example, the probability of getting heads twice in a row is 0.5 X 0.5, or 
0.25. Likewise, the joint probability of any other given sequence (e.g., HI — Tl, 
Tl — T2, or Tl — H2) will also be 0.25. Since the diagram includes all four of the 
possible sequences, the joint probabilities add up to 1.00. 
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First toss 



Second toss 



Heads 



P(H2) = 0.5 



Joint probability 

P(H\ and HI) 
= P(Hl)xP(fl2) 
= 0.5 x 0.5 = 0.25 



FIGURE 5.3 



This tree diagram illustrates 
two coin tosses, a process 
involving independent 
events. The probability of 
heads on one toss has no 
effect on the probability of 
obtaining heads on the 
next. 



/>(#!) = 0.5 



P(T2) = 0.5 



Tails 



P(H\ and T2) 
-- P(Hl) x P(T2) 
-- 0.5 x 0.5 = 0.25 



Heads 



P(T\) = 0.5 



P(H2) = 0.5 



P(Tl and H2) 
■-P{T\)xP(H2) 
-- 0.5 x 0.5 = 0.25 



Tails 



P(T2) = 0.5 



Tails 



P(Tl and T2) 
= P(7T)xP(T2) 
= 0.5 x 0.5 = 0.25 

L00 



When Events Are Not Independent 

When events are not independent, the occurrence of one will influence the proba- 
bility that another will take place. Under these conditions, a more general multi- 
plication rule applies: 

Multiplication rule when events are not independent: 

P(A and B) = P(A) X P(B\A) 

Table 5.3 (page 148) shows marginal, joint, and conditional probabilities for 
the fireworks victims. It indicates that a fireworks victim is more likely to be male 
than female. The respective marginal probabilities are P(A) = 0.779 and P(A') = 
0.221. Of the victims, 30.4% were males under the age of 15, and the joint prob- 
ability for this category is P(A and B) = 0.304. 

The contingency table can be complemented by a tree diagram like the one in 
Figure 5.4. Using the tree diagram, we can visually describe the two variables and 
probabilities related to them. The tree diagram in the figure includes the marginal, 
joint, and conditional probabilities shown in Table 5.3. 

The first set of branches in the tree diagram shows the marginal probabilities. 
These probabilities are based on relative frequencies and have the following values: 

Probability that the victim is a male = P(A) = 0.779 
Probability that the victim is a female = P(A') = 0.221 

The second set of branches in the tree diagram includes conditional probabilities 
dependent on the result of each initial branch. For example, given that the victim is 
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TABLE 5.3 



Relative frequency contin- 
gency table and marginal, 
joint, and conditional prob- 
abilities for a randomly 
selected fireworks victim 



Sex 



A Male 
A' Female,_>. 



Age 



B 


B' 


Under 15 


15 or Older 


0.304 


0.475 


0.109 


0.112 


0.413 


0.587 



0.779- 
0.221 - 



1.000 



Joint probabilities 

P(A and B) = 0.304 
HA and B') = 0.475 
P(A' and B) = 0.109 
Wand SO = 0.112 



Marginal probabilities 

P(A) = 0.779 
HA') = 0.221 

HB) = 0.413 
P(B') = 0.587 



Conditional probabilities, e.g., 

HA and B) _ 0.304 _ 
HB) ~ 0.413 ~ 
HA' and B') 0.112 



P(A\B) 



0.736 



W|60 



P(60 



0.587 



= 0.191 



FIGURE 5.4 



This tree diagram illustrates 
the sex and age of 
fireworks accident victims. 
The events are not indepen- 
dent, so the probabilities in 
the second set of branches 
depend on what has 
happened in the first set. 



Sex of victim 



Age of victim 



0.390 



B = < 15 



A = Male 



P(A) = 0.779 




Joint probability 

P(A and B) 
= P(A) X P(B\A) 
= 0.779 X 0.390 = 0.304 



> 15 



P(A andB 1 ) 
= P(A)xP(B'\A) 
= 0.779x0.610 = 0.475 



B = < 15 



P(A') = 0.221 



P(B\A") = 0.493 



P(A'and B) 
= F(A')xP(B|A') 
= 0.221 X 0.493 = 0.109 




Female 



P(B'|A') = 0.507 



B' = > 15 



P(A'andfi') 
= P(A')xP(B'\A') 
= 0.221 x 0.507 = 0.112 

1.000 
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First bolt 



Second bolt 



Dl = 1st is defective 
P(D1) = 0.200 



Gl = 1st is good 
P(G1) = 0.800 



Dl = 2nd is defective 
P(D2|D1) = 0.111 




G2 = 2nd is good 
P(G2|G1) = 0.778 



Joint probability 

P(D1 and Dl) 
= P(D\) X P(D1\DI) 
= 0.200x0.111 =0.022 



P(D\ and G2) 
= P(Dl)xP(G2|Dl) 
= 0.200x0.889 = 0.178 



P(G1 and£>2) 
= P(G1) X P(D2|G1) 
= 0.800x0.222 = 0.178 



P(G1 and G2) 
= P(Gl)xP(G2|Gl) 
= 0.800 x 0.778 = 0.622 



FIGURE 5.5 



When sampling without 
replacement from a bin 
containing 8 good bolts 
and 2 defective bolts, the 
probability that the second 
bolt is defective will depend 
on whether or not the first 
one selected was defective. 
If the first bolt is defective, 
the probability that the sec- 
ond one will be defective is 
P(D2|D1) = 0.111. 



1.000 



a male, the conditional probability that he is under 15 years old is 



P(B\A) 



P(B and A) 0.304 



P(A) 



0.779 



0.390 



The other conditional probabilities are calculated in a similar manner; then 
the four different joint probabilities shown in the figure can also be computed. As 
in Figure 5.3, these are the only possibilities that can occur, so the sum of the joint 
probabilities is 1. 

Events can be dependent when we sample ivitbout replacement. Figure 5.5 repre- 
sents the selection of two bolts, without replacement, from a bin of 10 bolts in 
which 2 are defective. The probability that the first bolt is defective will be 
p(£) i ) = i, or 0.200. If the first bolt is defective, then the conditional probability that 
the second one will be defective is P(D2 |D1) = ;,or0. 111. This is because there will 
be just one defective in the 9 bolts that remain. If the first bolt is not defective, the prob- 
ability that the second one will be defective is P(D2 Gl ) = or 0.222. 





5.30 What is the difference between a marginal probability 
and a joint probability? 

5.31 It is possible to have a sample space in which P(A) = 
0.7, P(B) = 0.6, and P(A and B) = 0.35. Given this infor- 
mation, would events A and B be mutually exclusive? 
Would they be independent? 



5.32 If events A and B are independent, will P(A \B) be 
greater than, less than, or equal to P(A)? Explain. 

5.33 A study by the U.S. Energy Information Administration 
found that 84.3% of U.S. households with incomes under 
$10,000 did not own a dishwasher while only 21.8% of 
those in the over-$50,000 income range did not own a 
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dishwasher. If one household is randomly selected from 
each income group, determine the probability that source: 
Bureau of the Census, Statistical Abstract of the United States 2001 , p. 61 1. 

a. neither household will own a dishwasher. 

b. both households will own a dishwasher. 

c. the lower-income household will own a dishwasher, 
but the higher-income household will not. 

d. the higher-income household will own a dishwasher, 
but the lower-income household will not. 

5.34 A fair coin is tossed three times. What is the proba- 
bility that the sequence will be heads, tails, heads? 

5.35 A kitchen appliance has 16 working parts, each of 
which has a 0.99 probability of lasting through the prod- 
uct's warranty period. The parts operate independently, but 
if one or more malfunctions, the appliance will not work. 
What is the probability that a randomly selected appliance 
will work satisfactorily throughout the warranty period? 

5.36 An optometry practitioner group has found that 
half of those who need vision correction are patients 
who require bifocal lenses. 

a. For a randomly selected group of three people who 
require vision correction, what is the probability that 
all three will require bifocals? What is the probability 
that none of the three will require bifocals? 

b. If the three individuals had all been selected from an 
organization consisting of retirees over age 65, do 
you believe the joint probabilities calculated in part 
(a) would still be accurate? If not, why not? 

5.37 Through April 30 of the 1999 filing season, 13.7% 
of all individual U.S. tax returns were prepared by H&R 
Block, source: H&R Block, Inc., Annual Report 1999, p. 3. 

a. If two individuals are randomly selected from those 
filing tax returns during this period, what is the prob- 
ability that both of their tax returns were prepared by 
H&R Block? 

b. In part (a), what is the probability that neither return 
was prepared by H&R Block? 

c. In part (a), what is the probability that exactly one of 
the two returns was prepared by H&R Block? 

5.38 Of employed U.S. adults age 25 or older, 90.3% 
have completed high school, while 30.8% have 



completed college. For H = completed high school, 
C = completed college, and assuming that one must 
complete high school before completing college, 
construct a tree diagram to assist your calculation of 
the following probabilities for an employed U.S. adult: 
source: Bureau of the Census, Statistical Abstract of the United States 
2002, p. 140. 

a. P(H) b. P(C\H) 

c. P(HandC) d. P(H and C) 

5.39 A taxi company in a small town has two cabs. Cab 
A stalls at a red light 25% of the time, while cab B stalls 
just 10% of the time. A driver randomly selects one of 
the cars for the first trip of the day. What is the probabil- 
ity that the engine will stall at the first red light the driver 
encounters? 

5.40 Using the table in Exercise 5.12, calculate the condi- 
tional probability of C given each of the age groups, or 
P{C\D), P(C\E), etc. Compare these probabilities and 
speculate as to which age groups seem more likely than 
others to have been (according to the legal definition 

at that time, 0.10% blood alcohol content) intoxicated 
at the time they were victims. 

5.41 Charlie has read a survey result that says 60% of the 
adults in his town consider Wendy's hamburgers to taste 
good. Charlie drives into the local Wendy's and questions 
a young couple about to enter the restaurant. According 
to Charlie, there's only a 0.36 (i.e., 0.6 X 0.6) probability 
that both of these people will say Wendy's hamburgers 
taste good. Do you think Charlie is correct? Explain. 

5.42 Based on the information in Exercise 5.20, if the 
student chosen is known to be on a sports team, what is 
the probability that the student is a female? 

5.43 Based on the information in Exercise 5.21, if the 
advisor has at least one person sign up for the advanced 
class, what is the probability that at least three people 
have signed up? 

5.44 Based on the information in Exercise 5.22, if a 
person is known to travel a one-way distance of at least 
1 1 miles to work, determine the probability that he or 
she drives at least 31 miles to work. 



5.6 BAYES' THEOREM AND THE REVISION 

OF PROBABILITIES 



In the 1700s, Thomas Bayes developed a theorem that is an extension of the 
concept of conditional probability discussed in the previous section. In Bayes' 
application of conditional probability, the emphasis is on sequential events; in par- 
ticular, information obtained from a second event is used to revise the probability 
that a first event has occurred. The theorem will be demonstrated by means of an 
example, and key terms will be introduced along the way. 
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Bayes' Theorem 

Present Information 

A dryer manufacturer purchases heating elements from three different suppliers: 
Argostat, Bermrock, and Thermtek. Thirty percent of the heating elements are 
supplied by Argostat, 50% by Bermrock, and 20% by Thermtek. The elements 
are mixed in a supply bin prior to inspection and installation. Based on past expe- 
rience, 10% of the Argostat elements are defective, compared to only 5% of those 
supplied by Bermrock, and just 4% of those from Thermtek. An assembly worker 
randomly selects an element for installation. What is the probability that the ele- 
ment was supplied by Argostat? 



SOLUTION 

• Events 

A 1 = The element was produced by Argostat 
A 2 = The element was produced by Bermrock 
A 3 = The element was produced by Thermtek 

• Prior probability This is an initial probability based on the present level 
of information. On this basis, P(A 1 ) = 0.300 since Argostat supplies 30% 
of the heating elements. 

Additional Information 

Upon testing the element before installation, an inspector finds it to be defective. 

• Events 

B = A tested element is defective 
B' = A tested element is not defective 

• Posterior probability This is a revised probability that has the benefit of 
additional information. It is a conditional probability and can be expressed 
as PiA^B). 

Argostat's quality record is the worst of the three suppliers, so the finding of a 
defective element would tend to suggest that P(A l \B) is higher than P(A 1 ). From 
an intuitive standpoint, the posterior (revised) probability should be higher than 
0.300. But how much higher? As a first step in finding out, consider the tree dia- 
gram and joint probabilities in Figure 5.6 (page 152). 

Because the events are independent, the marginal probabilities in the first 
branches of the tree diagram are multiplied by the conditional probabilities in the 
second set of branches to obtain the joint probabilities of the final column. How- 
ever, there is a problem: in order to construct the tree diagram, we had to use a 
time sequence that proceeded from the supplier source to the eventual testing of 
the element. 

What we really need is the reverse of this tree diagram, that is, proceeding 
from the present (a known defective) to the past (the probability that it was sup- 
plied by Argostat). The joint probabilities of Figure 5.6 can be used in reversing the 
time sequence. In doing so, it is useful to look at the joint probabilities as relative 
frequencies out of 1000 cases. Taking this perspective, we observe that in 63 (i.e., 
30 + 25 + 8) of the 1000 cases, the element was defective, and in 30 of these 
the element came from Argostat. In other words, we have just determined that 



152 



Part 2: Probability 



FIGURE 5.6 



Heating-element tree 
diagram, shown in the 
order in which the elements 
are received and found to 
be either defective or good 



Supplier 



Condition of a tested 
heating element 



B = defective 



P(B\A 1 ) = 0.100 



: Argostat 



B' = good 



B = defective 




= Thermtek 



B' = good 



B = defective 



P(B'\A 3 ) = 0.960 



B' = good 



Joint probability 



P(A X andS) 
■- 0.300x0. 100 = 0.030 



P(Ai andB') 
■■P(A l )y.P(B')A{) 
■- 0.300 x 0.900 = 0.270 



P(A 2 and B) 
■- P(A 2 ) X P(B\A 2 ) 
■- 0.500 x 0.050 = 0.025 



P(A 2 and B') 
-P(A 2 )xP(B'\A 2 ) 
■- 0.500 x 0.950 = 0.475 



P(A 3 and B) 
-P(A 3 )xP(B\A 3 ) 
: 0.200 x 0.040 = 0.008 



P(A 3 andB') 
■-P(A 3 )XP(B'\A 3 ) 
■- 0.200x0.960 = 0.192 

1.000 



the revised probability, P(A 1 |B), is 30/63, or 0.476. This revised probability is 
among those shown in the reversed tree diagram in Figure 5.7. 



Probabilities can also be revised by using Bayes' theorem. The theorem deals 
with sequential events, using information obtained about a second event to revise 
the probability that a first event has occurred. 

Bayes' theorem for the revision of probability: 

• Events A and B Probability of A, given that event B has occurred: 

= P(A and B) = P(A) X P(B\A) 

1 j P(B) [P(A) X P(B\A)] + [P(A') X P(B\A')] 
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Condition of a tested 
heating element 



Supplier 



A l = Argostat 



P(A l \B) = 0.476 



B = defective 



Aj = Bermiock 



PCS) = 0.063 




FIGURE 5.7 



This reversal of the tree 
diagram in Figure 5.6 
shows that the probability 
that the defective heating 
element came from 
Argostat is 0.476. The prior 
probability, P(A % ) = 0.300, 
has been converted to 
a revised probability, 
P(A,|fi) = 0.476. 



A 3 = Thermtek 




• General form Probability of event A t , given that event B has occurred: 

P(A, and B) _ P(A,) X P(B \A,) 



P(A t \B) 



P(B) - 

2p(A ; .) x P{B\A t ) 

7 = 1 

where A- = the z'th of k mutually 

exclusive and exhaustive events 



Applying the general form of Bayes' theorem, the heating element had to come 
from one of k = 3 suppliers: Argostat (event Aj), Bermrock (event A 2 ), or 
Thermtek (event A 3 ). Additional information is available in the form of knowledge 
that the heating element is defective (event B). We must use the general form of the 
theorem because the number of suppliers is more than two. Given that the heating 
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element is defective, Bayes' theorem is used in determining the probability that it 
was supplied by Argostat: 



P(AAB) 



P(A,) X P(B\A t ) 
XP(B\A f ) 

7 = 1 



P(Aj) = 0.300 

Argostat supplies 
30% of the elements. 



T 



PiBlAJ = 0.100 

Of the elements supplied by 
Argostat, 10% are defective. 



PiA.lB) 



P(A t ) x P(B|Aj 



[P(AA X P(B|A 1 )] + [P(A 2 ) X P(B|A 2 )] + [P(A 3 ) X P(B|A 3 )] 



e.g., P(A 3 ) = 0.200 

Thermtek supplies 
20% of the elements. 



e.g.,P(B|A 3 ) 



0.040 



Of the elements supplied by 
Thermtek, 4% are defective. 



0.300 X 0.100 



[0.300 X 0.100] + [0.500 X 0.050] + [0.200 X 0.040] 



and 



P(A 1 \B) 



0.030 
0.063 



0.476 



As in our calculations based on the tree diagrams in Figures 5.6 and 5.7, the 
probability that the defective heating element was supplied by Argostat is found 
to be 0.476. If the additional information had revealed a nondefective heating ele- 
ment, we could use the same approach in determining the probability that it had 
come from a given supplier. In this case, we would simply replace B with B'. For 
example, if we wished to find the probability that a nondefective heating element 
had come from Thermtek, this would be calculated as 

P(A 3 ) x P(B'\A 3 ) 



P(A 3 \B') 



and 



[P(A X ) X PiB'lA,)] + [P(A 2 ) x P(B'\A 2 )] + [P(A 3 ) x P(B'|A 3 )] 



0.200 X 0.960 



[0.300 X 0.900] + [0.500 X 0.950] + [0.200 X 0.960] 



P(AJB') 



0.192 
0.937 



0.205 



For the heating-element example, Table 5.4 shows a contingency table with 
relative frequencies for suppliers and defectives. Like Table 5.3, Table 5.4 shows 
marginal, joint, and conditional probabilities. Given that a randomly selected ele- 
ment is defective, we can determine the conditional probability that it was sup- 
plied by an individual supplier. This is exactly what we did earlier by using tree 
diagrams (Figures 5.6 and 5.7) and by using Bayes' theorem for the revision of a 
probability. In each case, we have taken advantage of the fact that a revised prob- 
ability can be viewed as a conditional probability. 
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The element is 

B Defective 
B' Good 



Joint probabilities, e.g. 

P(A, and B) = 0.030 
P(A 2 and B') = 0.475 
P(A 3 and B) = 0.008 



Supplier 



A, 


A 2 


*3 


Argostat 


Bermrock 


Thermtek 


0.030 


0.025 


0.008 


0.270 


0.475 


0.192 


0.300 


0.500 


0.200 



t _L i 



0.063- 
0.937- 

1.000 



Marginal probabilities, e.g. 

PCA,) = 0.300 
P(A 2 ) = 0.500 
P(fi) = 0.063 



TABLE 5.4 



A contingency table show- 
ing relative frequencies 
and marginal, joint, and 
conditional probabilities for 
the heating-element example 
of Figures 5.6 and 5.7 



Conditional probabilities, e.g., 

P(A, and B) 0.030 



W, I S) 



P(fi) 

PQ4 3 and 5Q 



0.063 
0.192 
" 0.937 



= 0.476 



0.205 



The concept of additional information and revised probabilities is an 
important part of decision making, an activity in which both statistics and addi- 
tional information are important. This topic will be covered in greater detail 
in Chapter 19 and its CD appendix. 






5.45 For U.S. live births, P(boy) and P(girl) are approxi- 
mately 0.51 and 0.49, respectively. 3 According to a 
newspaper article, a medical process could alter the 
probabilities that a boy or a girl will be born. 
Researchers using the process claim that couples who 
wanted a boy were successful 85% of the time, while 
couples who wanted a girl were successful 77% of the 
time. b Assuming that the medical process does have an 
effect on the sex of the child: Source: Bureau of the Census, 
Statistical Abstract of the United States 2002 , p. 60. Source: Steve 
Twedt, "Boy-Girl Selection Available Soon Here," Pittsburgh Press, 
December 31, 1987, pp. 1,10. 

a. Without medical intervention, what is the probability 
of having a boy? 

b. With medical intervention, what is the conditional 
probability that a couple who wants a boy will have 
a boy? 

c. With medical intervention, what is the conditional 
probability that a couple who wants a girl will 
have a girl? 



5.46 Using the information in Exercise 5.45, assume that 
a couple who wanted a girl was randomly placed into 
either the treatment group (receiving the medical process 
described) or the control group (no treatment 
administered) in a test involving the medical procedure. 

a. What is the couple's prior probability for being in 
the treatment group? 

b. The couple's newborn child is a girl. Given this addi- 
tional information, what is the revised probability 
that the couple was in the treatment group? 

5.47 A magician has two coins: one is unbalanced and lands 
heads 60% of the time; the other is fair and lands heads 
50% of the time. A member of the audience randomly selects 
one of the coins and flips it. The result is heads. 

a. What is the prior probability that the fair coin was 
selected? 

b. Given additional information in the form of the single 
flip that came up heads, what is the revised probability 
that the coin is the fair one? 
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5.48 For the information provided in Exercise 5.39, the 
cabbie finds that his car stalls at the first red light he 
encounters. Given this additional information, what is the 
probability that he has selected cab A? 

5.49 Machine A produces 3% defectives, machine B pro- 
duces 5% defectives, and machine C produces 10% 
defectives. Of the total output from these machines, 60% 



of the items are from machine A, 30% from B, and 10% 
from C. One item is selected at random from a day's 
production. 

a. What is the prior probability that the item came from 
machine C? 

b. If inspection finds the item to be defective, what is the 
revised probability that the item came from machine C? 



COUNTING: PERMUTATIONS AND COMBINATIONS 



Closely related to the concept of probability is the topic of counting. In this section, 
we'll see that counting can be much more than adding up football scores or keep- 
ing track of how many people enter a theater. 



Fundamentals of Counting 

If there are m ways in which a first event can happen, and n ways in which a sec- 
ond event can then occur, the total number of possibilities ism X n. For example, 
if you have two different cars and four different routes to travel to work, you can 
arrive in 2 X 4, or 8, different ways. This can be described as the principle of 
multiplication: 

The principle of multiplication: 

• If, following a first event that can happen in n x ways, a second event can then 
happen in n 2 ways, the total number of ways both can happen is n^n T 

• Alternatively, if each of k independent events can occur in n different ways, 
the total number of possibilities is n^. 

As an example of the second approach, assume that a state's auto license plates 
include 6 different numbers or letters, and that all 26 letters and 10 digits are avail- 
able. As a result, there can be (36) 6 , or 2,176,782,336 different license plates. If we 
include the possibility of blank spaces, this becomes (37) 6 , or 2,565,726,409, more 
than enough to satisfy even the most populous state. As a matter of fact, this number 
of unique plates is more than three times as large as the number of motor vehicles in 
the entire world. 5 

In an application similar to the selection of letters and numbers for license 
plates, consonants and vowels can be randomly selected to either create words or 
fill in blank spaces that are remaining in a word. Statistics in Action 5.1 shows an 
example in which the computer randomly fills in the remaining three letters of the 
word " TECH." 

In some situations, several events all have different possibilities, but the choice 
you make for one limits the number of choices for subsequent events. This might 
occur, for example, if you have five different errands to run and must choose the 



s Source: As of 2002, 787 million motor vehicles were registered throughout the world; New York 
Times Almanac 2006, p. 415. 





Computer Assistance in the Name Game 



When selecting a name for a company or a product, one alter- 
native is to randomly generate combinations of letters, then 
scan the list for "words" that either look attractive or inspire 
further ideas. A computer might be programmed to randomly 
list five-letter words that begin with "k" and end with "x," 
have a desired order of consonants or vowels, or even include 
desired sequences like "corp," "tronics," or "electro." 

For example, we might generate seven-letter words of 
the form ci/cTECH, where c = any of 21 consonants and 



i/= any of 5 vowels. In so forming " TECH," the number 

of possibilities is 21 x 5 x 21, or 2205. The computer 
becomes a high-speed variant of the proverbial typewriting 
monkey who will eventually generate a sentence. As this 
alphabetized partial listing shows, most of the "words" are 
not suitable for naming a company, a product, or even a 
fish. A few of them, however, are fairly reasonable and 
illustrate the possibilities of this method. 



BUGTECH 
JARTECH 
METTECH 
RELTECH 
VUZTECH 



CAVTECH 
JETTECH 
MIPTECH 
ROSTECH 
WIJTECH 



DUFTECH 
KAPTECH 
NESTECH 
SAXTECH 
XOGTECH 



FOZTECH 
KUBTECH 
NOGTECH 
SOTTECH 
YESTECH 



GAGTECH 
LOJTECH 
PASTECH 
TAGTECH 
ZETTECH 



HOWTECH 
MAZTECH 
QUKTECH 
TOYTECH 
ZIQTECH 



order in which you will do them. In this case, the number of possibilities becomes 

5X4X3X2X1 = 120 

t 

There are 5 choices available for the errand to be done first. 
Once the 1st errand is chosen, 4 choices are left. 
Once the 2nd errand is chosen, 3 choices are left. 
Once the 3rd errand is chosen, 2 choices are left. 
Once the 4th errand is chosen, only 1 choice remains. 



The product (5x4x3x2x1) is called a factorial and can be described by 
the symbol "5!." For example, 3! is the same as 3 X 2 X 1. The exclamation point 
is just a mathematical way of saving space. This approach to counting can be 
described as the factorial rule of counting: 

Factorial rule of counting: 

n\ = n X (n - 1) X (n - 2) X ••• X 1 (Note: 0! is defined as 1.) 

Permutations 

Permutations refer to the number of different ways in which objects can be arranged 
in order. In a permutation, each item can appear only once, and each order of the 
items' arrangement constitutes a separate permutation. The number of possible 
arrangements can be determined as follows: 

Number of permutations of n objects taken r at a time: 

n\ 

(n — r)\ 



Part 2: Probability 



example 



Permutations 

Bill has 6 textbooks, but can fit only 4 of them onto a small shelf. If the other 2 
books have to sit on the desk, in how many ways can Bill arrange the shelf? 

SOLUTION 

In this example, there are n = 6 objects, taken r = 4 at a time, and the number of 
choices available for the shelf arrangement would be 



r)\ (6-4)! 



6X5X4X3X2X1 
2X1 



360 



The situation can be viewed as follows: (1) the first shelf position can be filled 
in any of 6 ways, (2) the second shelf position can then be filled in any of 5 ways, 
(3) the third shelf position can then be filled in any of 4 ways, and (4) the fourth 
shelf position can then be filled in any of 3 ways. The total number of ways of fill- 
ing these four shelf positions is thus 6 X 5 X 4 X 3, or 360. In the equation above, 
note that the product (2x1) appears in both the numerator and the denomina- 
tor and is therefore canceled out, reducing the equation to 6x5x4x3 = 360. 

This might seem like a rather large number of possibilities for just 6 books on 
a 4-book shelf, but each different order of arrangement is regarded as a different 
permutation. In permutations, the exact order in which the objects are arranged 
is paramount. 



example 



Permutations 

A tax auditor has 9 different returns to audit, but will have time to examine only 
5 of them tomorrow. In how many different orders can tomorrow's task be carried 
out? 

SOLUTION 

In this case, there will be n = 9 objects, taken r = 5 at a time, and the number of 
different orders for doing tomorrow's work will be 

nl 9! 9X8X7X6X5X4X3X2X1 



- r)\ (9-5)! 4X3X2X1 



15,120 



Combinations 



Unlike permutations, combinations consider only the possible sets of objects, 
regardless of the order in which the members of the set are arranged. The number 
of possible combinations of n objects taken r at a time will be as follows: 
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In the bookshelf example, there would be 15 different combinations by which 
Bill could select 4 of the 6 books for storage on the shelf. This is calculated by 



«! 



6! 



6X5X4X3X2X1 
r\(n - rjl ~ 4!(6 - 4)! ~ (4 X 3 X 2 X 1)(2 X 1) 



15 



For greater ease of computation, this calculation can be simplified by taking 
advantage of the fact that 4! is actually a component of 6!. By making this substi- 
tution, the calculation becomes simpler because 4! is canceled out by appearing in 
both the numerator and denominator: 



\r r\(n 



6! 



6x5x4! 6X5 



r)! 4!(6 - 4)! 4!(2 X 1) 2x1 



15 



In the tax return example, if the tax auditor does not consider the order in 
which tomorrow's audits are carried out, there will be 126 different compositions 
for the group of persons whose returns are audited that day. This is calculated as 



\r r\(n 



9! 



9X8X7X6X5! 3024 



r)\ 5!(9 - 5)! 



5! X 4! 



24 



126 



As these examples suggest, for given values of n and r, with n > r > 1, the 
number of permutations will always be greater than the number of combinations. 
This is because there will be r! permutations for every possible combination. For 
example, if the auditor selects a given set of 5 returns to audit tomorrow, she can 
conduct the audits in 5! different orders. Thus, the number of combinations will 
be just 1/(5!) as large as the number of permutations. This can also be seen by 
comparing the respective formulas: they are identical except that the formula for 
combinations has r\ in the denominator, while that for permutations does not. 




5.50 A tax accountant has three choices for the method 
of treating one of a client's deductions. After this choice 
has been made, there are only two choices for how a sec- 
ond deduction can be treated. What is the total number 
of possibilities for the treatment of the two deductions? 

5.51 A committee consists of eight members, each of 
whom may or may not show up for the next meeting. 
Assuming that the members will be making independent 
decisions on whether or not to attend, how many differ- 
ent possibilities exist for the composition of the meeting? 

5.52 Ten prominent citizens have been nominated for 
a community's "Citizen of the Year" award. First- and 
second-place trophies are awarded to the two persons 
receiving the highest and second-highest number of votes. 
In how many different ways can the trophies be awarded? 



5.53 In advertising her oceanfront cottage for summer 
rental, a property owner can specify (1) whether or not 
pets are permitted, (2) whether the rent will be $700, 
$900, or $1200 per week, (3) whether or not children 
under 10 are allowed, and (4) whether the maximum stay 
is 1, 2, or 3 weeks. How many different versions of the 
ad can be generated? 

5.54 A state's license plate has 6 positions, each of which 
has 37 possibilities (letter, integer, or blank). If the 
purchaser of a vanity plate wants his first three positions 
to be ROY, in how many ways can his request be 
satisfied? 

5.55 When we are considering n objects taken r at a time, 
with r > 1, why will the number of combinations be less 
than the number of permutations? 
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5.56 An investment counselor would like to meet with 
12 of his clients on Monday, but he has time for only 

8 appointments. How many different combinations 
of the clients could be considered for inclusion into his 
limited schedule for that day? 

5.57 How many different combinations are possible if 

6 substitute workers are available to fill 3 openings created 
by employees planning to take vacation leave next week? 



5.58 A roadside museum has 25 exhibits but enough 
space to display only 10 at a time. If the order of arrange- 
ment is considered, how many possibilities exist for the 
eventual display? 

5.59 A sales representative has 35 customers throughout 
the state and is planning a trip during which 20 will 

be visited. In how many orders can the visits be 
made? 



5.8 SUMMARY 



• The concept of probability 

Uncertainty, an inherent property of both business and life, can be expressed in 
terms of probability. For any event, the probability of occurrence will be no less 
than 0 and no more than 1. All possible outcomes of an experiment are included 
in the sample space. 

• Classical, subjective, and relative frequency probabilities 

The classical approach describes probability in terms of the proportion of times 
that an event can be theoretically expected to occur. Because the probability can 
be determined without actually engaging in any experiments, this is known as the 
a priori approach. In the relative frequency approach, the probability of an event 
is expressed as the proportion of times it is observed to occur over a large num- 
ber of trials. 

The subjective approach to probability represents the degree to which one 
believes that an event will happen. Such probabilities can also be described as 
hunches or educated guesses. Another way to express the probability of an event 
is to indicate the odds that it will or will not occur. 

• Venn diagrams and contingency tables 

Events are mutually exclusive if, when one occurs, the other(s) cannot. They are 
exhaustive if they include all of the possible outcomes. An intersection of events 
represents the occurrence of two or more at the same time, while their union 
means that at least one of them will happen. The Venn diagram displays both the 
sample space and the events within. Frequencies or relative frequencies can also 
be summarized in a contingency table. 

• Determining probabilities 

The addition rules describe the probability that at least one of several events will 
occur, e.g., P(A or B). If the events are mutually exclusive, this probability will 
be the sum of their individual probabilities. When the events are not mutually 
exclusive, the general rule of addition applies. This rule takes into account the 
probabilities that two or more of the events will happen at the same time. 

The multiplication rules describe the joint probability that two or more events 
will occur, e.g., P(A and B). When the events are independent, this probability is the 
product of their individual (marginal) probabilities. When the events are not indepen- 
dent, the general rule of multiplication applies. This rule considers the conditional 
probability that an event will occur, given that another one has already happened. 

• Bayes' theorem and tree diagrams 

Bayes' theorem, an application of the conditional probability concept, applies to two 
or more events that occur in sequence. It begins with an initial (prior) probability 
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for an event, then uses knowledge about a second event to calculate a revised 
(posterior) probability for the original event. The application of Bayes' theorem 
can be visually described by a tree diagram which shows the possible sequences of 
events along with their marginal, conditional, and joint probabilities. 

• Permutations and combinations 

Closely related to the concept of probability are the principles of counting. Two 
such rules are the principle of multiplication and the factorial rule of counting. 
The concept of permutations is concerned with the number of possible arrange- 
ments of n objects taken r at a time. Unlike permutations, combinations consider 
only the particular set of objects included, regardless of the order in which they 
happen to be arranged. 



requations 



5 ?$Ti V 



Classical Approach to Probability 

For outcomes that are equally likely, 

„ , , .,. Number of possible outcomes in which the event occurs 

Probability = ^— — 

total number ot possible outcomes 

Relative Frequency Approach to Probability 

For a very large number of trials, 

Number of trials in which the event occurs 

Probability = : - — —, 

1 otal number ot trials 

Probabilities and "Odds" 

1. If the odds in favor of an event happening are A to B, or A:B, the probability being 
expressed is 

A 

A + B 

2. If the probability of an event is x (with 0 < x < 1), the odds in favor of the event are x 
to (1 — x), or x:(l — x). (Note: In practice, the ratio is converted to a more visually 
pleasing form in which the numbers are the lowest-possible integers.) 

Addition Rule When Events Are Mutually Exclusive 

P(A orB) = P(A) + P(B) 
Addition Rule When Events Are Not Mutually Exclusive 

P(A or B) = P(A) + P(B) - P(A and B) 
Conditional Probability 

P(A |B) = P<A p^ B) Note: This applies only if P(B) > 0. 
Multiplication Rule When Events Are Independent 

P(A andB) = P(A) X P(B) 
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Multiplication Rule When Events Are Not Independent 

P(A andB) = P(A) X P(B\A) 
Bayes' Theorem for the Revision of a Probability 

• Events A and B Probability of A, given that event B has occurred: 
P(A and B) P(A) X P(B \A) 



P(A\B) 



P{B) [P(A) X P(B\A)} + [P(A') X P{B\A')] 
General form Probability of event A., given that event B has occurred: 

P(A-andB) P(A,) X P(B \A f ) 



P(AAB) 



P(B) 



2P(A,-) X P(B|A.) 

7=1 

where /A ; = the jth of k mutually exclusive and exhaustive events 
The Principle of Multiplication 

If, following a first event that can happen in n 1 ways, a second event can then happen in 
« 2 ways, the total number of ways both can happen is n 1 n 2 . If each of k independent 
events can occur in n different ways, the total number of possibilities is n k . 

The Factorial Rule of Counting 

n\ = n X (n - 1) X (n - 2) X ••• X 1 Note: 0! is defined as 1. 

Number of Permutations of n Objects Taken r at a Time 



Number of Combinations of n Objects Taken r at a Time 



r\(n — r)\ 




5.60 It has been estimated that the odds of being struck by 
lightning in a given year are 1 to 240,000. Using this infor- 
mation: source: www.lightningtalks.com, July 27, 2006. 

a. Express the odds in terms of "odds against." 

b. What is the probability that a randomly selected indi- 
vidual will be struck by lightning this year? 

c. What is the probability that the person in part (b) will 
be struck by lightning at least once during the next 
20 years? 

5.61 A fast-food chain gives each customer a coupon, 
the surface of which can be scratched to reveal whether 
a prize will be received. The odds for winning $1000 per 
week for life are listed as 1 to 200,000,000, while the 
odds for winning a free hamburger are 1 to 15. Sheila is 
going to have lunch at her local franchise of the chain 



and will receive one coupon during her visit. Determine 
the probability that she will win 

a. $1000 per week for life. 

b. a free hamburger. 

c. either the lifetime $1000/week or a free hamburger. 

d. both the lifetime $1000/week and a free hamburger. 

5.62 For the situation in Exercise 5.61, what is the prob- 
ability that Sheila will visit the franchise 

a. 5 times without winning a free hamburger? 

b. 10 times without winning a free hamburger? 

c. 20 times without winning a free hamburger? 

d. 40 times without winning a free hamburger? 

5.63 The "daily number" of a state lottery is a 3-digit 
integer from 000 to 999. 
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a. Sam buys a ticket for the number 333. What is the 
probability that he will win? 

b. Is the probability found in part (a) a classical, relative 
frequency, or subjective probability? Explain your 
answer. 

c. Shirley buys a ticket for the number 418 since it came 
up yesterday and she thinks it's "hot." Is there a flaw 
in her logic? Explain. 

5.64 The U.S. Bureau of Justice released the following 
probabilities for those arrested for committing various 
felony crimes in the United States: 



Probability of Being 



Crime 


Prosecuted 


Convicted 


Jailed for 
>1 Year 


Homicide 


0.91 


0.75 


0.73 


Assault 


0.79 


0.64 


0.15 


Burglary 


0.88 


0.81 


0.28 


Arson 


0.88 


0.72 


0.28 


Drug offenses 


0.78 


0.69 


0.19 


Weapons 


0.83 


0.70 


0.13 


Public disorder 


0.92 


0.85 


0.12 



Allen has been arrested for burglary, Bill has been 
arrested for a weapons offense, and Charlie has been 
arrested on a public-disorder charge. Assuming these 
individuals are typical perpetrators and the decisions 
regarding their respective fates are unrelated, determine 
the probability that source: U.S. Bureau of Justice, as reported in 
Sam Meddis, "Felony Arrests: Short Terms," USA Today, January 18, 
1988, p. 9A. 

a. Allen will be jailed for more than a year. 

b. either Allen or Bill (or both) will be convicted. 

c. none of the three will be jailed for more than a year. 

d. Allen and Bill will be convicted, but Charlie will be 
found innocent. 

e. none will be prosecuted. 

5.65 For the three perpetrators in Exercise 5.64, 
determine the number of possibilities in which 

a. just one person is convicted. 

b. exactly two of the three persons are convicted. 

c. all three persons are convicted. 

5.66 Dave has been arrested for arson and has been 
convicted. Edgar is currently being prosecuted for a 
drug offense. Applying the U.S. Bureau of Justice data 
in Exercise 5.64, determine the probability that 

a. Dave will spend more than a year in jail. 

b. Edgar will be convicted. 

c. Edgar will spend more than a year in jail. 

5.67 A firm has two computer systems available for 
processing telephone orders. At any given time, system A 
has a 10% chance of being "down," while system B has 
just a 5% chance of being "down." The computer 



systems operate independently. For a typical telephone 
order, determine the probability that 

a. neither computer system will be operational. 

b. both computer systems will be operational. 

c. exactly one of the computer systems will be operational. 

d. the order can be processed without delay. 

5.68 An industrial hoist is being used in an emergency 
job where the weight exceeds the design limits of two of 
its components. For the amount of weight being lifted, 
the probability that the upper attachment hook will fail 
is 0.20. The probability that the lower hook will fail is 
0.10. What is the probability that the hoisting job will be 
successfully completed? 

5.69 The following relative frequency distribution 
describes the household incomes for families living in a 
suburban community: 

Income Relative Frequency 

Less than $20,000 0.10 

$20,000-under $40,000 0.20 

$40,000-under $60,000 0.30 

$60,000-under $80,000 0.25 

$80,000 or more 0.15 

a. For a randomly selected household, what is the prob- 
ability that its annual income is less than $80,000? 

b. If a household is known to have an income of at least 
$20,000, what is the probability that its annual 
income is in the $60,000-under $80,000 category? 

c. Two households are randomly selected from those 
whose income is at least $40,000 per year. What is 
the probability that both of these households are in 
the $80,000 or more category? 

5.70 Collecting data on traffic accident fatalities, the 
National Highway Traffic Safety Administration has 
found that 47.1% of the victims have 0.0% blood 
alcohol content (BAC), 7.0% of the victims have from 
0.01 to 0.07% BAC, and 45.9% of the victims have at 
least 0.08% BAC. For a randomly selected victim: 
source: Statistical Abstract of the United States 2006, p. 720. 

a. What is the probability that the victim's BAC was at 
least 0.01%? 

b. Given that the victim had been drinking prior to the 
accident, what is the probability that this victim's 
BAC was at least 0.08%? 

5.71 During fiscal 1995, Ashland Exploration drilled 

7 wells in the United States and 3 in Nigeria. One of the 
wells in Nigeria was productive and 2 of the wells in the 
United States were productive. The remaining wells were 
dry. For a well selected at random from those drilled 
during fiscal 1995, determine the following probabilities. 
source: Ashland Oil, Inc., 1995 Fiscal Year Operational Supplement, p. 20. 

a. P(drilled in the United States) 

b. P(drilled in the United States or dry) 

c. P(dry drilled in Nigeria) 
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5.72 Avis, Inc., has reported that its fleet consists of 
200,000 vehicles. If the vehicles are independent in terms 
of accident incidence and each has a 0.99999 chance of 
making it through the year without being in a major acci- 
dent, what is the probability that the entire fleet will 
avoid a major accident during the year? source: Avis.com, 
July 27, 2006. 

5.73 Data from the Federal Bureau of Investigation show 
that 1 of every 189 motor vehicles was stolen during 
2000. Applying this statistic to 5 motor vehicles 
randomly selected from the nation's vehicle population: 

source: Bureau of the Census, Statistical Abstract of the United States 
2002, pp. 183, 850. 

a. What is the probability that none of the 5 motor vehi- 
cles will be stolen? 

b. What is the probability that all 5 motor vehicles will 
be stolen? 

c. How many possibilities exist in which 2 of the 5 motor 
vehicles are stolen? 

5.74 Of the participants in a corporate meeting, 20% 
have had Red Cross training in cardiopulmonary resusci- 
tation (CPR). During a break in the session, one partici- 
pant is randomly selected to go across the street and pick 
up some hoagies from the local deli. A patron at the deli 
is stricken by a severe heart attack. His survival prob- 
ability is just 0.40 if he does not receive immediate CPR, 
but will be 0.90 if CPR can be administered right away. 
The meeting participant and the stricken person are the 
only customers in the deli, and none of the employees 
knows CPR. 

a. Without additional information, what is the probabil- 
ity that the stricken patron will survive? 

b. Given that the stricken patron survives, what is the 
probability that the meeting participant knew how to 
administer CPR? 

c. Given that the stricken patron does not survive, what 
is the probability that the meeting participant knew 
how to administer CPR? 

5.75 Of the adults in Jefferson County, 10% have had 
CPR training. For a randomly selected group of adults 
from this county, how large would the group have to be 
in order for the probability that at least one group mem- 
ber has had CPR training to be at least 0.90? 

5.76 Over the years, a realtor estimates that 50% of 
those who contact her about a listing end up allowing her 
to provide a guided tour of the property, 20% of those 
who take a guided tour follow up with a request for more 
information about the property, and 2% of those who 
make the follow-up informational request actually end 
up making an offer on the property. The realtor has just 
taken a young couple on a guided tour of one of her 
listed properties. Based on the probability estimates 
provided above, what is the probability that the young 
couple will actually make an offer on the property? 



5.77 According to Sears, two-thirds of U.S. homeowners 
have an appliance from Sears. In a randomly selected 
group of three homeowners, what is the probability that 
all three would have an appliance from Sears? That at least 
one of the three would have an appliance from Sears? 
source: Sears advertisement insert, Indiana Gazette, May 24, 2003. 

5.78 A computer specialty manufacturer has a two-person 
technical support staff who work independently of each 
other. In the past, Tom has been able to solve 75% of the 
problems he has handled, and Adam has been able to solve 
95% of the problems he has handled. Incoming problems 
are randomly assigned to either Tom or Adam. If a techni- 
cal problem has just been assigned to the support depart- 
ment, what is the probability that it will be handled by 
Adam? If it turns out that the problem was solved, what is 
the probability that it was handled by Adam? 

5.79 When a machine is properly calibrated, 0.5% of its 
output is defective, but when it is out of adjustment, 6% 
of the output is defective. The technician in charge of 
calibration can successfully calibrate the machine in 90% 
of his attempts. The technician has just made an attempt 
to calibrate the machine. 

a. What is the probability that the machine is in adjust- 
ment? 

b. The first unit produced after the calibration effort is 
found to be defective. Given this information, what is 
the probability that the machine is in adjustment? 

5.80 In examining borrower characteristics versus loan 
delinquency, a bank has collected the following informa- 
tion: (1) 15% of the borrowers who have been employed 
at their present job for less than 3 years are behind in their 
payments, (2) 5% of the borrowers who have been 
employed at their present job for at least 3 years are 
behind in their payments, and (3) 80% of the borrowers 
have been employed at their present job for at least 3 
years. Given this information: 

a. What is the probability that a randomly selected loan 
account will be for a person in the same job for at 
least 3 years who is behind in making payments? 

b. What is the probability that a randomly selected loan 
account will be for a person in the same job for less 
than 3 years or who is behind in making payments? 

c. If a loan account is behind, what is the probability 
that the loan is for a person who has been in the 
same job for less than 3 years? 

5.81 A security service employing 10 officers has been 
asked to provide 3 persons for crowd control at a local 
carnival. In how many different ways can the firm staff 
this event? 

5.82 A corporate board of directors consisting of 15 per- 
sons is to form a subcommittee of 5 persons to examine 
an environmental issue currently facing the firm. How 
many different subcommittees are possible? 
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5.83 The chairperson of the accounting department has 
three summer courses available: Accounting 201, 
Accounting 202, and Accounting 305. Twelve faculty 
members are available for assignment to these courses, 
and no faculty member can be assigned to more than one 
course. In how many ways can the chairperson assign 
faculty to these courses? 

5.84 A test to compare the taste of 6 soft drinks is being 
arranged. Each participant will be given 3 of the soft 
drinks and asked to indicate which one tastes best. 

a. How many different possibilities exist for the set 
of soft drinks that a given participant will taste? 



b. If we consider the order in which the participant in 
part (a) tastes the 3 soft drinks, how many possibilities 
exist? 

5.85 A certain brand of bicycle lock requires that each of 
four adjacent wheels be rotated to exactly the correct 
position in order to open the lock and separate the 
chain to which it is attached. If each wheel is labeled 
with the digits 0 through 9, how many different 
possibilities are available to the manufacturer in 
"programming" the sequence of numbers for opening 
any given lock? 



integrated case 





Thorndike Sports Equipment 



Ted Thorndike has been with his grandfather's firm for 5 
weeks and is enjoying his job immensely. The elder 
Thorndike is also pleased — so pleased that he took the 
morning off to do a little fishing. However, while old Luke 
is catching some carp, young Ted is catching some flak. 

At the other end of this morning's telephone call is an 
irate customer whose aluminum tennis racquet has cracked 
in half after just 14 months. The caller is an avid player 
who gets out on the court about two or three times a week, 
and he claims the racquet has never been abused in any 
way. He not only wants Thorndike Sports Equipment to 
send him another racquet, he also demands that the com- 
pany reimburse him the $30 he lost after his racquet broke 
and he was no longer competitive with his opponent. 

Ted assures the indignant caller that the company will 
replace his racquet at no cost, since it is only slightly over 
the 12-month warranty and the company values his loy- 
alty to its products. Unfortunately, the $30 wager cannot 
be covered, even though it was a triple-or-nothing bet from 
the caller's losses in the two previous matches. 

On Mr. Thorndike's return to the office, Ted mentions 
the interesting phone call, and the elder Thorndike is not a 
bit surprised. He says the firm has been getting a lot of 
complaints in recent months, mainly from longtime cus- 
tomers who claim their latest Thorndike aluminum racquet 
didn't hold up as well as the ones they had used in the past. 

Speculating, Mr. Thorndike goes on to point out that 
the company has had two aluminum suppliers for many 
years, but added a third supplier just a year and a half ago. 
He suspects that the most recent supplier may be shipping 



an aluminum alloy that is more brittle and prone to failure. 
He's not sure, but he thinks the racquets that are failing are 
more likely than not to be constructed from the aluminum 
purchased from the newest of the three suppliers. 

When the company sends out a replacement racquet, it 
does so only after it has received the defective one. All of the 
racquets that have been returned over the past 5 years or so 
are in a wooden crate in the basement. Mr. Thorndike isn't 
sure why, but 6 years ago he mandated that each racquet 
produced have a serial number that identifies when it was 
made and who supplied the aluminum for its construction. 
If someone were to ferret through the wooden crate, maybe 
he could shed some light on this business of customers calling 
up with broken racquets. 

Mr. Thorndike goes on, "Say, Ted, why don't you come 
in over the weekend and take a look at those broken rac- 
quets downstairs, and see if maybe there's something going 
on here that I should know about? Maybe I should have 
stayed with the two suppliers I've been using. The prices the 
new folks were offering seemed too good to pass up, but 
maybe it was all too good to be true. Net profits from our 
tennis business aren't going to be very strong if we have to 
keep on sending people all these free racquets." 

As Ted is rummaging through the broken racquets in 
the basement, what kind of information should he be seek- 
ing and how might he structure it in the form of a contin- 
gency table? What kinds of probabilities might be useful 
regarding racquet failure, and what implications could 
Ted's findings have for helping his grandfather better sat- 
isfy the company's racquet customers? 
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Springdale Shopping Survey 

The contingency tables and relative frequency probabili- 
ties in this exercise are based on the Springdale Shopping 
Survey database. Data are in the computer file SHOPPING. 
Information like that gained from the two parts of this 
exercise could provide helpful insights into the nature of 
the respondents, their perceptions, and their spending 
behaviors. In particular, part 2 examines how conditional 
probabilities related to spending behavior might vary, 
depending on the gender of the respondent. 
1. Based on the relative frequencies for responses to each 

variable, determine the probability that a randomly 

selected respondent 

a. [variable 4] spends at least $15 during a trip to 
Springdale Mall. 

b. [variable 5] spends at least $15 during a trip to 
Downtown. 

c. [variable 6] spends at least $15 during a trip to 
West Mall. 

Comparing the preceding probabilities, which areas 
seem strongest and weakest in terms of the amount of 
money a shopper spends during a typical shopping visit? 

d. [variable 11] feels that Springdale Mall has the 
highest-quality goods. 

e. [variable 11] feels that Downtown has the highest- 
quality goods. 

f. [variable 11] feels that West Mall has the highest- 
quality goods. 



Comparing the preceding probabilities, which areas 
are strongest and weakest in terms of the quality of 
goods offered? 

2. Set up a contingency table for the appropriate vari- 
ables, then determine the following probabilities: 

a. [variables 4 and 26] Given that the random 
respondent is a female, what is the probability that 
she spends at least $15 during a trip to Springdale 
Mall? Is a male more likely or less likely than a 
female to spend at least $15 during a visit to this 
area? 

b. [variables 5 and 26] Given that the random 
respondent is a female, what is the probability 
that she spends at least $15 during a trip to Down- 
town? Is a male more likely or less likely than a 
female to spend at least $15 during a visit to this 
area? 

c. [variables 6 and 26] Given that the random 
respondent is a female, what is the probability that 
she spends at least $15 during a trip to West Mall? 
Is a male more likely or less likely than a female to 
spend at least $15 during a visit to this area? 

Based on the preceding probabilities, at which shop- 
ping areas are males and females most likely and least 
likely to spend $15 or more during a shopping visit? 




Baldwin Computer Sales (B) 



In the Baldwin Computer Sales case, visited previously in 
Chapter 3, one of the areas of concern regarding the stu- 
dent purchase program was whether the probability of 
defaulting on the computer payments might be related to 
the university attended by the student. Using the BALDWIN 
data file on the text CD, first construct an appropriate con- 
tingency table of frequencies and determine the overall 
percentage of students who defaulted on their computer 
payments. Then address the following: 

1. Given that a student attended university number 1, 
determine the conditional probability that he or she 
defaulted on the computer payments. 

2. Repeat question 1 for each of the other universities. 
Do any of the conditional probabilities seem especially 
high or low compared to the others? In general, which 
school is associated with those most likely to default, 




and which school is associated with those least likely 
to default? 

3. If a student is randomly chosen from those who have 
defaulted on their computer payments, determine the 
revised probability that the student is from the "most 
likely to default" university identified in question 2. 



ATM Users Are Discrete As Well As Discreet 

Yes, we know automatic teller machine (ATM) customers use discretion as to when 
and how they use the ATM. For example, many zealously shield the keyboard when 
entering their personal ID number into the machine. 

But ATM users are also discrete: They are members of a 
discrete probability distribution. Forx = the 

number of persons arriving at the . ^ e p r jday aft ern ° on 

machine over a period of time, x will be stuC u„hiijtv distribution 
a discrete variable — 0, 1 , 2, 3, and so on. pl 

The next time you're in the vicinity of 
your local ATM, relax and enjoy a soft 
drink for 5 minutes while you count how 
many people arrive at the machine during 
that time. Do this on 9 more occasions. 
Now figure out what percent of the time x 
was equal to 0, what percent of the time x 
was equal to 1, and so on. Congratulations, 
you've just created a discrete probability dis- 
tribution, a type of distribution that is the 
topic of this chapter. 

More specifically, the phenomenon you've 
been observing tends to be well represented by 
a theoretical discrete probability distribution 
called the Poisson distribution, discussed later in 
the chapter. This distribution is useful in both 
describing and predicting the number of people 
arriving at an ATM, a tollbooth, an emergency 
room, and many other kinds of service facilities 
over a given period of time. 



168 



Part 2: Probability 



Understand both the concept and the applications of a probability distribution 
for a random variable. 

Determine whether the random variable in a probability distribution is of the 
discrete type or the continuous type. 

Differentiate among the binomial, hypergeometric, and Poisson discrete proba- 
bility distributions and their applications. 

Understand what is meant by a Bernoulli process and how this applies to the 
consecutive trials associated with the binomial distribution. 

Use the appropriate probability distribution in determining the probability that 
a discrete random variable will have a given value or a value in a given range. 



6.1 INTRODUCTION 



Chapter 2 introduced the relative frequency distribution as a way of describing 
the proportion of actual observations that either had one of several possible val- 
ues or fell into one of a number of different ranges of values. In Chapter 5, we dis- 
cussed classical probability as a probability that could be theoretically determined 
without engaging in any experiments [e.g., P(heads) = 0.5 for a coin flip]. In this 
chapter, we'll see how a probability distribution is really a very special combina- 
tion of these two concepts. 

In general, we can define a probability distribution as the relative frequency 
distribution that should theoretically occur for observations from a given popula- 
tion. In business and other contexts, it can be helpful to proceed from (1) a basic 
understanding of how a natural process seems to operate in generating events to 
(2) identifying the probability that a given event may occur. By using a probabil- 
ity distribution as a model that represents the possible events and their respective 
likelihoods of occurrence, we can make more effective decisions and preparations 
in dealing with the events that the process is generating. 

Random Variables: Discrete versus Continuous 

A random variable is a variable that can take on different values according to the 
outcome of an experiment. Random variables can be either discrete or continu- 
ous, a distinction discussed briefly in Chapter 1: 

DISCRETE RANDOM VARIABLE A random variable that can take on only certain 
values along an interval, with the possible values having gaps between them. For 
example, in a given group of five children, the number who got at least one elec- 
tronic toy for Christmas would be 0, 1, 2, 3, 4, or 5. It could not be a number 
between any of these values, such as 2.338. 

CONTINUOUS RANDOM VARIABLE A random variable that can take on a value 
at any point along an interval. The exact temperature outside as you're reading this 
book could be 1 3.568, 78.352, or 83.81 5 degrees Fahrenheit, or any of an infinity 
of other values in the range of temperatures where colleges and universities are lo- 
cated. (The temperature examples have been expressed to just three decimal places 
here, but there would be countless possibilities along the temperature continuum.) 



learning 
objectives # 

After reading this 
chapter, you should 
be able to: 
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A random variable is described as random because we don't know ahead of 
time exactly what value it will have following the experiment. For example, when 
we toss a coin, we don't know for sure whether it will land heads or tails. Like- 
wise, when we measure the diameter of a roller bearing, we don't know in ad- 
vance what the exact measurement will be. In this chapter, the emphasis will be 
on discrete random variables and their probability distributions. In the next, we'll 
cover variables of the continuous type. 

The Nature of a Discrete Probability Distribution 

A discrete probability distribution is a listing of all possible outcomes of an experi- 
ment, along with their respective probabilities of occurrence. We'll begin our 
discussion of such distributions with a small-scale example. 



Example 




Discrete Probability Distribution 

An experiment is conducted in which a fair coin is flipped twice. The result of the 
experiment will be the random variable x = the number of times that heads 
comes up. 

SOLUTION 

Since the two tosses are independent, the probability of heads remains constant at 
P(heads) = 0.5 from one toss to the next. Each of four possible sequences is 
equally likely, and each has a probability of 0.25: 



Two tosses of a coin: possible sequences and their probabilities 



Sequence x = Number of Heads 



Probability 



HH 
HT 
TH 
TT 



0.25 
0.25 
0.25 
0.25 
1.00 



(0.5 X 0.5) 



Because the possible sequences are mutually exclusive (only one of them can 
happen for a given series of two tosses), the sum of the joint probabilities is 1.00. 
Two of the sequences (HT and TH) involve one head, and we can use the addition 
rule from Chapter 5 to find the probability that x = 1. This can be expressed as 
P(HT or TH) = P(HT) + P{TH) = 0.25 + 0.25 = 0.50. We have just generated 
the following probability distribution for the discrete random variable x = number 
of heads in two coin tosses: 



Two tosses of a coin: discrete probability distribution 
for the random variable x = number of heads 



P(x) 

0.25 
0.50 
0.25 
1.00 
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FIGURE 6.1 



Discrete probability distribu- 
tion for the random variable 
x = number of heads in 
two consecutive tosses of 
a coin 





This distribution can also be described graphically, as in Figure 6.1. This dis- 
crete probability distribution reflects several important characteristics shared by 
all such distributions: 

Characteristics of a discrete probability distribution: 

1. For any value of x, 0 S P(x) ^ 1. 

2. The values of x are exhaustive: the probability distribution includes all possi- 
ble values. 

3. The values of x are mutually exclusive: only one value can occur for a given 
experiment. 

4. The sum of their probabilities is one, or 2P(x ; ) = 1.0. 

Although discrete probability distributions are often based on classical prob- 
abilities (as in Figure 6.1), they can be the result of relative frequencies observed 
from past experience or even from subjective judgments made about future events 
and their likelihood of occurrence. As an example of a discrete probability distri- 
bution based on relative frequencies, consider the following situation. 



example 



Discrete Probability Distribution 

A financial counselor conducts investment seminars with each seminar limited to 
6 attendees. Because of the small size of the seminar group and the personal atten- 
tion each person receives, some of the attendees become clients following the sem- 
inar. For the past 20 seminars she has conducted, x (for x = the number of 
investors who become clients) has had the relative frequency distribution shown 
in Figure 6.2. 

SOLUTION 

The probability distribution for x = the number of attendees who become 
clients can also be described in tabular form. The events and their respective 
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FIGURE 6.2 



Relative frequency distribu- 
tion for the number of 
investors in the seminar 
group who become clients 
of the seminar leader 



probabilities are as follows: 

x P(x) 

0.05 
0.10 
0.20 
0.25 
0.15 
0.15 
0.10 
1.00 

As in the classical relative frequency distribution of Figure 6.1, the events in 
Figure 6.2 are mutually exclusive and exhaustive. Interpreting their relative fre- 
quencies as probabilities, as shown in Figure 6.2, the sum of these probabilities 
will be 1.0. In a given seminar, the probability that nobody will become a client 
is P(0) = 0.05, while the probability that at least 4 will become clients is 
P(x > 4) = P(x = 4) + P(x = 5) + P{x = 6), or 0.15 + 0.15 + 0.10 = 0.40. 



Twenty investment seminars; discrete probabil- 
ity distribution for the random variable x = 
number of investors who become clients 



0 
1 
2 
3 
4 
5 
6 



The Mean and Variance of a Discrete 
Probability Distribution 

For the observed data in Chapter 3, we used measures of central tendency and 
dispersion to describe typical observations and scatter in the data values, respec- 
tively. Similarly, it can be useful to describe a discrete probability distribution in 
terms of its central tendency and dispersion. For example, the investment coun- 
selor of Figure 6.2 may be interested both in how many attendees from a seminar 
typically become clients and in the extent to which the number who become 
clients varies. 

The mean of a discrete probability distribution for a discrete random variable 
is called its expected value, and this is referred to as E(x), or yu,. In the following 
formula, it can be seen to be a weighted average of all the possible outcomes, with 
each outcome weighted according to its probability of occurrence. The variance is 
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the expected value of the squared difference between the random variable and its 
mean, or E[(x — fx) 2 ]: 

General formulas for the mean and variance of a discrete probability distribution: 

• Mean: 

fx = E(x) or fx = ^XjP(Xj) for all possible values of x 

• Variance: 

o 2 = E[{x - fx) 1 } or a 2 = 2(*. - fx) 2 F{x} 
for all possible values of x, and the standard deviation is 

a = Vo 2 




An alternative formula for a 2 is a 2 = (S l x 2 P(x j )] — fx 2 . It provides greater ease 
of calculation when the data values are large or when there are a great many pos- 
sible values of x. 



To demonstrate how the mean and variance are determined, we will compute 
these descriptors for the discrete probability distribution in Figure 6.2. 

Mean and Variance for x = the Number of Investors Who Become Clients 
(Represented by the Discrete Probability Distribution of Figure 6.2) 

• Mean: 

fx = E(x) = 2^P(x ( ) 
= 0(0.05) + 1(0.10) + 2(0.20) + 3(0.25) + 4(0.15) + 5(0.15) + 6(0.10) 
= 3.2 investors who become clients 

• Variance: 

a 2 = E[(x - fx) 2 ] = 2(*,. - fx) 2 P{x t ) 

= (0 - 3.2) 2 (0.05) + (1 - 3.2) 2 (0.10) + (2 - 3.2) 2 (0.20) 

+ (3 - 3.2) 2 (0.25) + (4 - 3.2)2(0.15) + (5 - 3.2) 2 (0.15) 

+ (6 - 3.2)2(0.10) 
= 2.66 

and 

.66 = 1.63 investors who become clients 

In a given seminar, there will never be exactly x = 3.2 investors who become 
clients. However, this is the expected value, or mean of x, and it will tend to be 
the average result if the probability distribution remains unchanged and a great 
many such seminars are conducted. 

The formulas in this section can be used to determine the mean and variance for 
any discrete probability distribution. For some distributions, however, the mean and 
variance can be described and computed by using expressions that are both more 
practical and more easily calculated. As we discuss the distributions to follow, such 
expressions for the mean and variance will be presented as they become relevant. 

Before we move on to examine three particular discrete probability distribu- 
tions, the binomial, the hypergeometric, and the Poisson, consider the probability 
distribution represented by a state's three-digit daily lottery number. There are 




it 



RfEi, 




The Discrete Uniform Distribution That Took a Day Off 



A feature of the Pennsylvania State Lottery is the "daily 
number," a three-digit number selected at 7 p.m. each 
weekday evening. The number is obtained from the first 
ball that pops from each of three machines containing balls 
numbered from 0 to 9. The big event is televised, with a 
senior-citizen witness and a lottery official present to verify 
the number of each ball. 

Since the daily number can range from 000 to 999, the 
chance of any specific result is just 0.001 . Lottery officials 
rigorously weigh the balls and conduct extensive testing to 
make sure some numbers don't have a higher probability 
than others. A number of years ago, an individual was con- 
victed for injecting a latexlike substance that made some of 
the balls heavier. (The author and many others watching 



television noticed that some of the bouncing balls seemed 
a little "tired.") Asa result of this one-day nonuniformity of 
the probability distribution, security was heightened con- 
siderably, and officials are now more scrupulous than ever 
in their efforts to ensure that the probability distribution is 
indeed uniform. 

Despite the uniformity of this discrete probability distri- 
bution, some players still insist on playing a "lucky" num- 
ber based on their birth date, wedding date, or digits from 
an auto license number. Some even avoid playing yester- 
day's number because they think it's been "used up" for 
awhile. Many seem to prefer 333 or 777, although 923, 
147, or any other choice would have the same probability 
of being selected. 



1000 possible outcomes (000 to 999) and, because all of the outcomes are equally 
likely, this is an example of what is called a discrete uniform distribution. 
Attempts to tamper with this uniformity are not well received by state officials. 
See Statistics in Action 6.1. 




6.1 Why is a random variable said to be "random"? 

6.2 Indicate whether each of the following random 
variables is discrete or continuous. 

a. The diameter of aluminum rods coming off a 
production line 

b. The number of years of schooling employees have 
completed 

c. The Dow Jones Industrial Average 

d. The volume of milk purchased during a supermarket 
visit 

6.3 Indicate whether each of the following random 
variables is discrete or continuous. 

a. The number of drive-through customers at your local 
McDonald's today 

b. The amount spent by all customers at the local 
McDonald's today 

c. The time, as reported by your digital watch 

d. The length of a pedestrian's stride 

6.4 The values of x in a discrete probability distribution 
must be both exhaustive and mutually exclusive. What is 
meant by these two terms? 

6.5 A discrete random variable can have the values x = 3, 
x = 8, or x = 10, and the respective probabilities are 0.2, 



0.7, and 0.1. Determine the mean, variance, and standard 
deviation of x. 

6.6 Determine the mean, variance, and standard deviation 
of the following discrete probability distribution: 

x 0 1 2 

P(x) 0.60 0.30 0.10 

6.7 Determine the mean, variance, and standard 
deviation of the following discrete probability 
distribution: 

x 0 1 2 3 4 

P(x) 0.10 0.30 0.30 0.20 0.10 

6.8 Ed Tompkins, the assistant dean of a business school, 
has applied for the position of dean of the school of 
business at a much larger university. The salary at the 
new university has been advertised as $120,000, and Ed 
is very excited about this possibility for making a big 
career move. He has been told by friends within the 
administration of the larger university that his resume is 
impressive and his chances of getting the position he 
seeks are "about 60%." If Ed stays at his current 
position, his salary next year will be $70,000. Assuming 
that his friends have accurately assessed his chances of 
success, what is Ed's expected salary for next year? 
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6.9 Scam artists sometimes "stage" accidents by 
purposely walking in front of slow-moving luxury cars 
whose drivers are presumed to carry large amounts of lia- 
bility insurance. The "victim" then offers to accept a small 
monetary cash settlement so the driver can avoid an insur- 
ance claim, a blot on his driving record, and increased 
insurance premiums. Marlin has just tried to stage such a 
scheme by walking into a slow-moving Mercedes and 
faking a minor injury. Assume that the possible outcomes 
for Marlin are —$500 (i.e., he is arrested and fined for his 
scam), $0 (the driver recognizes the scam and refuses to 
pay), and $100 (the driver simply pays up to get rid of 
Marlin). If the probabilities for these outcomes are 0.1, 0.3, 
and 0.6, respectively, what is Marlin's expected monetary 
outcome for trying to scam the Mercedes driver? 

6.10 A regional manager visits local fast-food franchises 
and evaluates the speed of the service. If the manager 
receives her meal within 45 seconds, the server is given 

a free movie-admission coupon. If it takes more than 
45 seconds, the server receives nothing. Throughout the 
company's franchises, the probability is 0.60 that a meal 
will be served within 45 seconds. What is the expected 
number of coupons a counter employee will receive when 
serving the regional manager? 

6.11 A consultant has presented his client with three al- 
ternative research studies that could be conducted. If the 
client chooses alternative a, the consultant will have to 
hire 1 additional employee. Two new employees will have 
to be hired if the client chooses b, and 5 if the client 
chooses c. The probability that the client will select alter- 
native a is 0.1, with P(b) = 0.5 and P(c) = 0.4. Identify 
the discrete random variable in this situation and 
determine its expected value. 

6.12 A contractor must pay a $40,000 penalty if construc- 
tion of an expensive home requires more than 16 weeks. 
He will receive a bonus of $10,000 if the home is comple- 
ted within 8 weeks. Based on experience with this type of 
project, the contractor feels there is a 0.2 chance the home 
will require more than 16 weeks for completion, and there 
is a 0.3 chance it will be finished within 8 weeks. If the 
price of the home is $350,000 before any penalty or bonus 
adjustment, how much can the buyer expect to pay for her 
new home when it is completed? 

6.13 A music shop is promoting a sale in which the 
purchaser of a compact disk can roll a die, then deduct a 
dollar from the retail price for each dot that shows on the 
rolled die. It is equally likely that the die will come up 
any integer from 1 through 6. The owner of the music 
shop pays $5.00 for each compact disk, then prices them 
at $9.00. During this special promotion, what will be the 
shop's average profit per compact disk sold? 

6.14 Laura McCarthy, the owner of Riverside Bakery, 
has been approached by insurance underwriters trying to 
convince her to purchase flood insurance. According to 



local meteorologists, there is a 0.01 probability that the 
river will flood next year. Riverside's profits for the com- 
ing year depend on whether Laura buys the flood insur- 
ance and whether the river floods. The profits (which take 
into consideration the $10,000 premium for the flood 
insurance) for the four possible combinations of Laura's 
choice and river conditions are: 

The River 

Does Not 

Flood Floods 

Insurance No Flood 

Decision Insurance $200,000 -$1,000,000 
Get Flood 

Insurance $190,000 $200,000 

a. If Laura decides not to purchase flood insurance, use 
the appropriate discrete probability distribution to 
determine Riverside's expected profit next year. 

b. If Laura purchases the flood insurance, what will be 
Riverside's expected profit next year? 

c. Given the results in parts (a) and (b), provide Laura 
with a recommendation. 

6.15 In 2004, nearly 24 million tons of steel mill 
products went to construction and contracting 
companies. Transco Steel, a hypothetical manufacturer 
specializing in the production of steel for this market, is 
faced with aging production facilities and is considering 
the possible renovation of its plant. The company has re- 
duced its alternatives to just three: (1) no renovation at 
all, (2) minor renovation, and (3) major renovation. 
Whichever strategy it chooses, the company's profits will 
depend on the size of the construction and contracting 
industry in future years. Company executives have a sub- 
jective probability of 0.3 that the industry will not grow 
at all, a probability of 0.5 that it will grow moderately, 
and a probability of 0.2 that it will show a high rate of 
growth. The company's estimated profits (in millions of 
dollars) for each combination of renovation strategy and 
industry growth possibility are: 
source: Statistical Abstract of the United States 2006, p. 661 . 



Industry Growth 







None 


Moderate 


High 


Level of Plant 


None 


$35 


$38 


$42 


Renovation 


Minor 


$28 


$45 


$55 




Major 


$21 


$40 


$70 



Given the executives' subjective probabilities for the 
three possible states of industry growth, what will be the 
company's expected profit if it chooses not to renovate 
the plant? If it chooses minor renovations to the plant? If 
it chooses major renovations to the plant? Based on these 
expected values, provide the company's management with 
a recommendation as to which level of renovation they 
should choose. 
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THE BINOMIAL DISTRIBUTION 6.2 



One of the most widely used discrete distributions, the binomial distribution deals 
with consecutive trials, each of which has two possible outcomes. This latter char- 
acteristic is the reason for the "bi" part of the descriptor binomial. The outcomes 
can be identified in general terms, such as "success" versus "failure" or "yes" ver- 
sus "no," or by more specific labels, such as "milk in the refrigerator" versus "no 
milk in the refrigerator." The binomial distribution relies on what is known as the 
Bernoulli process: 

Characteristics of a Bernoulli process: 

1. There are two or more consecutive trials. 

2. In each trial, there are just two possible outcomes — usually denoted as success 
or failure. 

3. The trials are statistically independent; that is, the outcome in any trial is not 
affected by the outcomes of earlier trials, and it does not affect the outcomes 
of later trials. 

4. The probability of a success remains the same from one trial to the next. 



Description and Applications 

In the binomial distribution, the discrete random variable, x, is the number of suc- 
cesses that occur in n consecutive trials of the Bernoulli process. The distribution, 
which has a wide variety of applications that will be demonstrated through exam- 
ples and chapter exercises, can be summarized as follows: 



The binomial probability distribution: 






The probability of exactly x successes in n 


trials is 




n\ 

P(X) = — 7T*(1 - 77f-* 

x\(n — xjl 


where n 

X 


= number of trials 
= number of successes 




77 


= the probability of 

success in any given trial 




(1 " 77) 


= the probability of failure 
in any given trial 


• Mean: 






/x = E(x) 


= W77 




• Variance: 






a 1 = E[(x - ix) 2 ] 


= mr(l — 


77) 



In the binomial distribution, we can view the n trials as comprising (1) the x 
trials that are successes and (2) the (n — x) trials that are failures. The second 
component in the formula, tt x , is the probability that x of x trials will all be suc- 
cessful. The third component, (1 — Tr) n ~ x , is the probability that (n — x) of 
(n — x) trials will all be failures. The first component is the number of possible 
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combinations for n items taken x at a time; in the context of the binomial 
distribution, this is the number of possible sequences where there are exactly x 
successes in n trials. 

The binomial distribution is really a family of distributions, and the exact 
member of the family is determined by the values of n and it. The following ob- 
servations may be made regarding the Bernoulli process and the requirement that 
the probability of success (tt) remain unchanged: 

1. If sampling is done with replacement (the person or other item selected from 
the population is observed, then put back into the population), it will be con- 
stant from one trial to the next. 

2. If, when sampling without replacement, the number of trials (n) is very small 
compared to the population (N) of such trials from which the sample is taken, 
as a practical matter tt can be considered to be constant from one trial to the 
next. This would occur, for example, if n = 5 interview participants were ran- 
domly selected from a population of N = 1000 persons. As a rule of thumb, 
if the population is at least 20 times as large as the number of trials, we will 
assume the "constant tt" assumption has been satisfactorily met. 





Exampl 


E 





Binomial Distribution 



Of the 18,000,000 part-time workers in the United States, 20% participate in 
retirement benefits. 1 A group of 5 people is to be randomly selected from this 
group, and the discrete random variable is x = the number of persons in the 
group who participate in retirement benefits. 

SOLUTION 

This example can be considered a Bernoulli process, with n = 5 trials and 
tt = 0.2. The size of the population (N = 18,000,000) is extremely large com- 
pared to the number of trials, so tt can be assumed to be constant from one trial 
to the next. A worker either participates in retirement benefits or does not 
participate, so there are only two possible outcomes for each of the 5 trials. 

What Is the Expected Value of x? 

The expected number of people in the group who participate can be calculated as 
E(x) = yiTT = 5(0.2), or E(x) = 1.0. The expectation would be that 1.0 persons in 
the group would participate in retirement benefits. (In other words, if we were to 
randomly select a great many such groups, we would expect the average value of 
x to be 1.0.) 

What Is the Probability That the Group Will Include Exactly Two Participants? 

Using the binomial formula, with n = 5, tt = 0.2, and (1 — tt) = (1.0 — 0.2) = 
0.8, the probability of x = exactly 2 participants is 

P(X) = ~, Ul ,, 77*(1 ~ 77)""* 

x\\n — x)\ 



Source: Statistical Abstract of the United States 2006, pp. 367, 635. 
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and 

m) = nw (0.2) 2 (0.8) 5 - 2 



2!(5 - 2)! 



0.205 



5 • 4 • 3 • 2 ■ 1 5-4 



i.e., 



(2-l)(3-2-l) 2-1 



For Each of the Possible Values of x. What Is the Probability That Exactly 
This Number of People Out of the Five Will Be Participants? 

The values x = 0 through x = 5 include all of the possibilities for the number of 
persons in the sample who are participants; their probabilities would be 

P(0) = ^, (0.2)°(0.8) 5 = I 0.328 



0!(5 - 0)! 

(Note: Remember that 0! = 1 and that any nonzero number raised to the zero 
power is also equal to 1.) 



P(l) 
P(2) 
P(3) 
P(4) 
P(5) 



5! 



1!(5 - 


1)! 


5! 




2!(5 - 


2)! 


5! 




3!(5 - 


3)1 


5! 




4!(5 - 


4)! 


5! 




5!(5 - 


5)1 


that 


M = 



(o^j^o.s) 4 

(0.2) 2 (0.8) 3 
(0.2) 3 (0.8) 2 

(o^o.s) 1 

(0.2) 5 (0.8)° 



0.410 



0.205 



(as calculated earlier) 



0.051 



0.006 



0.000 



(rounded) 



E(x) = ^x l P(x t ). Because we have rounded the individual probabilities, the result 
differs slightly from the actual value, /jl = 1.000 persons: 

E(x) = Zx,P(x,) 

= 0(0.328) + 1(0.410) + 2(0.205) + 3(0.051) + 4(0.006) + 5(0.000) 

= 0.997 persons (differs from /x = 1.000 due to rounding of the 
individual probabilities) 

What Is the Probability That the Group Will Include at Least Three Persons 
Who Are Participants? 

Because x can take on only one of the possible values between 0 and 5, the out- 
comes are mutually exclusive. (Notice that the sum of the six probabilities just 
calculated is 1.000.) Thus P{x > 3) will be 

P(x > 3) = [P(x = 3) + P(x = 4) + P(x = 5)] 
= 0.051 + 0.006 + 0.000 = 0.057 

The binomial distribution for the discrete random variable in this example is 
shown in Figure 6.3 (page 178). The distribution is positively skewed. Binomial 
distributions can have different shapes, including being positively skewed, nega- 
tively skewed, or even symmetrical. 
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FIGURE 6.3 



Binomial probability 
distribution for text 
example with n = 5 and 
it = 0.20. Of 18 million 
part-time workers, only 
20% participate in 
retirement benefits. 
Because the population is 
very large compared to the 
number in the sample, tt 
can be considered as 
remaining constant from 
one trial to the next. 




Using the Binomial Tables 

As the preceding example indicates, calculating binomial probabilities can be te- 
dious. Fortunately, binomial distribution tables, such as those in Appendix A, 
make the task a lot easier. We will continue with our example in demonstrating 
their use. 

A given binomial probability distribution depends on the values of tt and n, so 
a separate listing is necessary for various combinations of their values. For the five 
randomly selected part-time workers, n = 5 trials, and we will rely on the individ- 
ual binomial probabilities table (Table A.l), a portion of which is reproduced 
here as Table 6.1. The "tt" columns in the table range from 0.1 to 0.9 in incre- 
ments of 0.1. The value of tt in our example was tt = 0.2, so this will be the 



TABLE 6.1 



A portion of the table of 
individual binomial proba- 
bilities from Appendix A. 
For n = 5 trials, each entry 
is P(x = k) for the specified 
value of the population 
proportion, tt. 



tt 



Binomial Distribution, Individual Probabilities 
forx = number of successes in n trials, prob(x = k) 



0.1 



0.2 



0.3 



0.4 



0.5 



0.6 



0.7 



0.8 



0.9 



0 0.5905 0.3277 0.1681 0.0778 0.0313 0.0102 0.0024 0.0003 0.0000 

1 0.3281 0.4096 0.3601 0.2592 0.1562 0.0768 0.0284 0.0064 0.0005 
k 2 0.0729 0.2048 0.3087 0.3456 0.3125 0.2304 0.1323 0.0512 0.0081 

3 0.0081 0.0512 0.1323 0.2304 0.3125 0.3456 0.3087 0.2048 0.0729 

4 0.0005 0.0064 0.0284 0.0768 0.1562 0.2592 0.3601 0.4096 0.3281 



5 0.0000 0.0003 0.0024 0.0102 0.0313 0.0778 0.1681 0.3277 0.5905 



For n = 5 and tt = 0.2, P(x = 2) is 0.2048. 
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appropriate column for determining the probabilities that we've just calculated 
the hard way. Briefly summarizing our earlier treatment of this example, n = 5, 
tt = 0.2, and the probabilities calculated for the possible values of x were 

x 0 1 2 3 4 5 

P(x) 0.328 0.410 0.205 0.051 0.006 0.000 

Using the individual binomial probabilities in Table 6.1, we refer to the 
"77 = 0.2" column within the "« = 5" section. For example, Table 6.1 shows 
P{x = 0) to be 0.3277. Likewise, P(x = 1) and P{x = 2) are 0.4096 and 0.2048, 
respectively. 

Appendix A also includes a table for cumulative binomial probabilities 
(Table A. 2), a portion of which is reproduced as Table 6.2. Using this table, we 
find that P(x < 3) = 0.9933 for x = the number of participants among the five 
randomly selected workers. 

Using the table of cumulative binomial probabilities is the best way to deter- 
mine "greater than or equal to" probabilities, such as P(x £ 2). For example, to 
find P{x > 2), 



P(x < 1) + P(x > 2) = 1.000 



and 



P(x > 2) = 1.0000 - P(x < 1) 



From the table of cumulative 
probabilities, P(x < 1) = 0.7373. 



so 



P(x > 2) = 1.0000 - 0.7373 = 0.2627 

As an alternative, the individual binomial probabilities in Table 6.1 could also 
be used in finding P(x s 2). Using this table would be more cumbersome, 



Binomial Distribution, Cumulative Probabilities 
forx = number of successes in n trials, prob(x == k) 



n = 5 



tt 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 

0 0.5905 0.3277 0.1681 0.0778 0.0313 0.0102 0.0024 0.0003 0.0000 

1 0.9185 0.7373 0.5282 0.3370 0.1875 0.0870 0.0308 0.0067 0.0005 
k 2 0.9914 0.9421 0.8369 0.6826 0.5000 0.3174 0.1631 0.0579 0.0086 

3 0.9995 0.9933 0.9692 0.9130 0.8125 0.6630 0.4718 0.2627 0.0815 

4 1.0000 0.9997 0.9976 0.9898 0.9688 0.9222 0.8319 0.6723 0.4095 

5 1.0000 1.0000 1.0000 1.0000 1.0000 1.0000 1.0000 1.0000 



TABLE 6.2 



A portion of the table of 
cumulative binomial proba- 
bilities from Appendix A. 
For n = 5 trials, each entry 
is P(x < k) for the specified 
value of the population 
proportion, tt. 



For n = 5 and tt = 0.2, P{x < 3) is 0.9933. 
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however, since we would have to add up each of the probabilities for x = 2 
through x = 5, or 

P(x > 2) = P(x = 2) + P(x = 3) + P(x = 4) + P(x = 5) 
= 0.2048 + 0.0512 + 0.0064 + 0.0003 
= 0.2627 

Excel and Minitab can provide either individual or cumulative binomial 
probabilities. The procedures and the results are shown in Computer Solutions 6.1. 




Binomial Probabilities 

These procedures show how to get binomial probabilities associated with n = 5 trials when the probability 
of a success on any given trial is is = 0.20. 
For each software package: 

Procedure I: Individual or cumulative probabilities for x = 2 successes. 
Procedure II: Complete set of individual or cumulative probabilities for x = 0 successes 
through x = 5 successes. 

EXCEL 





A 


B 


C 


1 


X 


P(X = x) 


P(X <= x) 


2 


0 


0.3277 


0.3277 


3 


1 


0.4096 


0.7373 


4 


2 


0.2048 


0.9421 


5 


3 


0.0512 


0.9933 


6 


4 


0.0064 


0.9997 


7 


5 


0.0003 


1.0000 



I. Excel procedure for just one individual or cumulative binomial probability 

1. Click on the cell into which Excel will place the probability. Click f x . Click Statistical in the Category box. Click 
BINOMDIST in the Function box. Click OK. 

2. Enter the number of successes (2) into the Number_s box. Enter the number of trials (5) into the Trials box. Enter the 
probability of success on any given trial (0.20) into the Probability_s box. To obtain the individual probability, P(x = 2), 
enter false into the Cumulative box. To obtain the cumulative probability, P(x < 2), enter true into the Cumulative 
box. Click OK. 

II. Excel procedure for a complete set of individual or cumulative binomial probabilities 

1 . Enter the column labels into A1:C1 as shown in the printout above. Enter the lowest number of successes in the set 
(in this case, 0) into A2. Enter 1 into A3. Click on cell A2 and highlight cells A2:A3. Click at the lower right corner of 
A3 (the cursor will turn to a " + " sign) and drag downward so cells A2:A7 will contain the values 0 through 5. 

2. To get the set of individual probabilities: Click on cell B2 (this is where the first individual probability will go). Follow the 
remainder of the steps in Excel procedure I, except enter A2 into the Number_s (number of successes) box and enter 
false into the Cumulative box. The first probability now appears in B2. Click to select cell B2. Click the lower right cor- 
ner of B2 (the cursor turns to a " + " sign) and drag down until cells B2:B7 contain the set of individual probabilities. 



(continued) 
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3. To get the set of cumulative probabilities: Click on cell C2 (this is where the first cumulative probability will go). Follow 
the remainder of the steps in Excel procedure I, except enter A2 into the Number_s (number of successes) box and 
enter true into the Cumulative box. The first probability now appears in C2. Click to select cell C2. Click the lower right 
corner of C2 (the cursor turns to a "+" sign) and drag down until cells C2:C7 contain the set of cumulative probabilities. 



MINITAB 

Probability Density Function 

Binomial with n = 5 and p = 0.2 



PC X = x ) 
0.32768 
0.40960 
0.20480 
0.05120 
0.00640 
0.00032 



Cumulative Distribution Function 

Binomial with n = 5 and p = 0.2 

x PC X <= x ) 



0.32768 
0.73728 
0.94208 
0.99328 
0.99968 
1.00000 



I. Minitab procedure for just one individual or cumulative binomial probability 

1 . Click Calc. Select Probability Distributions. Click Binomial. 

2. In the Binomial Distribution menu, enter 5 into the Number of trials box. Enter 0.20 into the Probability of success 

box. Select Input constant and enter the desired x value (e.g., 2) into the box. Select either Probability or Cumulative 

probability, depending on whether you want an individual probability or a cumulative probability. Click OK. 

II. Minitab procedure for a complete set of individual or cumulative binomial probabilities 

Enter the desired x values (in this case 0 through 5) into column C1 . Follow steps 1 and 2 above, except select Input 
column and enter C1 into the box. 



Additional Comments on the Binomial Distribution 

Under certain conditions, the binomial distribution can be approximated by other 
probability distributions. This is especially useful since there is a practical limit to 
the sheer volume of printed tables that can be included in a statistical appendix. A 
later section of this chapter discusses how another discrete probability distribution 
called the Poisson distribution can be used to approximate the binomial. 
Introduced in the next chapter, a continuous distribution called the normal distri- 
bution is especially useful as a satisfactory substitute in many practical applications. 




EXEfCISE! 




6.16 What is necessary for a process to be considered a 
Bernoulli process? 

6.17 When we are sampling without replacement, under 
what conditions can we assume that the constant tt 
assumption has been satisfactorily met? 

6.18 Seven trials are conducted in a Bernoulli process in 
which the probability of success in a given trial is 0.7. If 



x = the number of successes, determine the following: 
a. E(x) b. a c. P(x = 3) 

d. P(3 < x < 5) e. P(x > 4) 

6.19 Twelve trials are conducted in a Bernoulli process 
in which the probability of success in a given trial is 0.3. 
If x = the number of successes, determine the following: 
a. E(x) b. <r x c. P(x = 3) 

d. P(2sxs 8) e. P(x > 3) 
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6.20 A city law-enforcement official has stated that 20% 
of the items sold by pawn shops within the city have been 
stolen. Ralph has just purchased 4 items from one of the 
city's pawn shops. Assuming that the official is correct, 
and for x = the number of Ralph's purchases that have 
been stolen, determine the following: 

a. P(x = 0) b. P(2 < x) 

c. P(l < x < 3) d. P(x < 2) 

6.21 The regional manager in Exercise 6.10 plans to evalu- 
ate the speed of service at five local franchises of the restau- 
rant. For the five servers, determine the probability that 

a. none will receive a movie coupon. 

b. two of the five will receive a coupon. 

c. three of the five will receive a coupon. 

d. all five will receive a coupon. 

6.22 According to the National Marine Manufacturers 
Association, 50.0% of the population of Vermont were 
boating participants during the most recent year. For a 
randomly selected sample of 20 Vermont residents, with 
the discrete random variable x = the number in the sam- 
ple who were boating participants that year, determine 
the following: source: National Marine Manufacturers Association, 
NMMA Boating Industry Report, p. 3. 

a. E(x) b. P(x < 8) c. P(x = 10) 

d. P(x = 12) e. P(7 < x < 13) 

6.23 In the town of Hickoryville, an adult citizen has a 

1 % chance of being selected for jury duty next year. If jury 
members are selected at random, and the Anderson family 
includes three adults, determine the probability that 

a. none of the Andersons will be chosen. 

b. exactly one of the Andersons will be chosen. 

c. exactly two of the Andersons will be chosen. 

d. all three Andersons will be chosen. 

6.24 It has been reported that the overall graduation rate 
for football players at Division I-A colleges and universi- 
ties is 65%. source: "How Football Rates," USA Today, December 
20, 2005, p. 12C. 

a. If five players are randomly selected from the entire 
population of Division I-A football players, what is 
the probability that at least 3 of the 5 will graduate? 

b. In part (a), what is the probability that all 5 of the 
players will graduate? 

6.25 An association of coffee producers has estimated 
that 50% of adult workers drink coffee at least occasion- 
ally while on the job. Given this estimate: 

a. For a randomly selected group of 4 adult workers, 
what is the probability that no more than 3 of them 
drink coffee at least occasionally while on the job? 

b. In part (a), what is the probability that at least 2 of 
these individuals drink coffee at least occasionally 
while on the job? 

6.26 The U.S. Department of Labor has reported that 
30% of the 2.1 million mathematical and computer 
scientists in the United States are women. If 3 individuals 
are randomly selected from this occupational group, and 



x = the number of females, determine P(x = 0), P(x = 1), 

P(x = 2), and P(x = 3). source: Bureau of the Census, Statistical 
Abstract of the United States 2002, p. 381 . 

6.27 Alicia's schedule includes three Tuesday/Thursday 
courses in which each professor uses a single coin flip to 
decide whether to give a short quiz each class day. Alicia is 
active on the university's student advisory council, works 
two part-time jobs, and is helping to build a float for the 
upcoming homecoming parade. As a result of these activi- 
ties and slightly inadequate planning on her part, she has 
had very little time to prepare for her Thursday classes 
being held tomorrow. What is the probability that Alicia 
will be "lucky" and have no surprise quizzes tomorrow? 
What is the probability that the worst will occur and she 
will have surprise quizzes in all three classes? What is the 
probability that she will escape with minimal damage and 
have a quiz in only one of the three classes? 

6.28 OfficeQuip is a small office supply firm that is cur- 
rently bidding on furniture and office equipment 
contracts with four different potential customers who are 
of comparable size. For each contract, OfficeQuip would 
gain a profit of $50,000 if that contract were accepted, so 
the company could make as little as $0 or as much as 
$200,000. The four potential customers are making inde- 
pendent decisions, and in each case the probability that 
OfficeQuip will receive the contract is 0.40. When all the 
decisions have been made, what is the probability that 
OfficeQuip will receive none of the contracts? Exactly 
one of the contracts? Exactly two of the contracts? 
Exactly three of the contracts? All four contracts? Over- 
all, what is OfficeQuip's expected profit in this business- 
procurement venture? 

6.29 Four wheel bearings are to be replaced on a 
company vehicle, and the mechanic has selected the four 
replacement parts from a large supply bin. Unknown to 
the mechanic, 10% of the bearings are defective and will 
fail within the first 100 miles after installation. What is 
the probability that the company vehicle will have a 
breakdown due to defective wheel bearings before it com- 
pletes tomorrow's 200-mile delivery route? 

6.30 It has been estimated that one in five Americans suf- 
fers from allergies. The president of Hargrove University 
plans to randomly select 10 students from the undergrad- 
uate population of 1400 to attend a dinner at his home. 
Assuming that Hargrove students are typical in terms of 
susceptibility to allergies and that the college president's 
home happens to contain just about every common aller- 
gen to which afflicted persons react, what is the probabil- 
ity that at least 8 of the students will be able to stay for 
the duration of the dinner event? source: World Almanac and 
Book of Facts 2006, p. 171. 

6.31 Airlines book more seats than are actually available, 
then "bump" would-be passengers whenever more people 
show up than there are seats. In 2001, the rate at which 
passengers were bumped was 1.87 per 1000 passengers. 
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Assuming that, on average, the probability of any given 
passenger being bumped is 1.87/1000, or 0.00187: 

source: Time Almanac 2003, p. 216. 

a. Emily is flying nonstop to visit Besco, Inc., for a job 
interview. What is the probability that she will be 
bumped? 

b. Ten members of the Besco board of directors will be 
flying nonstop to the firm's annual meeting, and their 
flights are independent of each other. What is the 
probability that at least one of them will be bumped? 

c. Besco sales personnel made a total of 220 nonstop 
flights last year, and they experienced a total of four 



bumpings. Could the company's experience be 
considered as very unusual? 

6.32 Every day, artists at Arnold's House of Fine 
Figurines produce 5 miniature statues that must be 
sanded and painted. Past experience has shown that 10% 
of the statues have a defect that does not show up until 
after the statue has been sanded and painted. Whenever a 
statue is found to have this defect, Arnold must pay a 
specialist $20 to correct the defect and repaint the statue. 
Describe the probability distribution for the daily amount 
that Arnold must pay the specialist, and determine the 
average daily amount the specialist will receive. 



THE HYPERGEOMETRIC DISTRIBUTION 



6.3 



Description and Applications 

The hypergeometric probability distribution is similar to the binomial in focusing 
on the number of successes in a given number of consecutive trials, but there are 
two major differences: First, the consecutive trials are not independent; that is, the 
outcome in a trial can be affected by the outcomes in earlier trials, and can in turn 
affect the outcomes in later trials. Second, the probability of success does not re- 
main constant from one trial to the next. Unlike the binomial distribution, the hy- 
pergeometric distribution does not rely on a Bernoulli process. 

The distinction between the binomial and hypergeometric distributions can 
also be viewed in terms of sampling with replacement versus sampling without 
replacement. In sampling without replacement, the object selected from the pop- 
ulation is not returned to the population. When sampling is done without replace- 
ment and the population is not very large, the probability of a success can change 
quite drastically from one trial to the next. 

Situations like these can be analyzed by means of tree diagrams, along with 
their marginal, conditional, and joint probabilities, but the hypergeometric distri- 
bution is a lot easier. The distribution can be described as follows: 



The hypergeometric distribution: 

The probability of exactly x successes in n trials that are not independent is 

'si In - s 



P(x) 



\n — x 



N 



where N = size of the population 
n = size of the sample 
s = number of successes in 

the population 
x = number of successes in the sample 



Mean: 
Variance: 



p, = E(x) 



a 2 = E[(x- p.) 2 ] 



ns 
N 



ns{N-s) N-n 



N ■ 
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example 



Hypergeometric Distribution 

Twenty businesses located in a small community have filed tax returns for the 
preceding year. Six of the returns were filled out incorrectly. An Internal Revenue 
Service (IRS) auditor randomly selects four returns for further examination, and 
the random variable is x = the number of examined returns that are incorrectly 
filled out. 

What Is the Expected Value of x? 

For a sample size of n = 4, with s = 6 incorrect returns in the population, and 
a population size of N = 20, the expected number of incorrect returns in the 
sample is 



E(x) 



us 
~N 



4(6) 
20 



1.20 incorrect returns 



What Is the Probability That There Will Be Exactly Two Incorrect Returns 
Among the Four That Are Chosen for Further Examination? 

The quantity to be determined is P(x = 2), with N = 20 businesses in the popu- 
lation, s = 6 incorrect returns in the population, and n = 4 returns randomly se- 
lected from the population. Using the hypergeometric formula for P(x): 



P(x) 



s 




N-s 


x 




n - x 



N 



and P(2) 



l6\ _ 


'20 


- 6 


\2j ' 


, 4 






20 






4 





14 

2 



20 
4 



so 



6! 



14! 



5 14-13 



P(2) 



2!(6 - 2)! 2!(14 - 2)! 2-1 2-1 



20! 



1365 



20 • 19 • 18 • 17 4845 



0.2817 



4!(20 - 4)! 



4 • 3 • 2 • 1 



The probability, P(x = 2), is 0.2817 that exactly two of the four returns will 
be incorrect. If we were to repeat this procedure for the other possible values of x, 
we would find the following probabilities for x = 0 through 4 incorrect returns in 
the sample: 



x = Number of 

incorrect returns: 0 1 2 3 4 

P(x): 0.2066 0.4508 0.2817 0.0578 0.0031 



As the preceding example illustrates, the calculation of hypergeometric proba- 
bilities is rather cumbersome. Unfortunately, with all the possible combinations of 
the determining characteristics, N, n, and s, tables of hypergeometric probabilities 
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are not a realistic alternative. It is much easier to use the computer, and Computer 
Solutions 6.2 describes how we can use Excel or Minitab to determine individual 
or cumulative hypergeometric probabilities. 



i computer solutions B. 2 J*a> 



Hypergeometric Probabilities 

These procedures show how to get hypergeometric probabilities associated with x = the number of 
successes in n = 4 trials from a population of N = 20 when there are s = 6 successes within the population. 
The trials are without replacement. For each software package: 

Procedure I: Individual or cumulative probabilities for x = 2 successes. 
Procedure II: Complete set of individual or cumulative probabilities for x = 0 successes 
through x = 4 successes. 

EXCEL 





A 


B 


C 


1 


X 


P(X = x) 


P(X <= x) 


2 


0 


0.2066 


0.2066 


3 


1 


0.4508 


0.6574 


4 


2 


0.2817 


0.9391 


5 


3 


0.0578 


0.9969 


6 


4 


0.0031 


1.0000 



I. Excel procedure for obtaining an individual hypergeometric probability 

1 . Click on the cell into which Excel is to place the individual probability. Click f x . Click Statistical in the Category box. 
Click HYPGEOMDIST in the Function box. Click OK. 

2. Enter the number of successes in the sample (2) into the Sample_s box. Enter the number of trials (4) into the 
Number sample box. Enter the number of successes in the population (6) into the Population_s box. Enter the size 
of the population (20) into the Number_pop box. Click OK. The probability, P(x =2) = 0.281 7, will appear in the cell 
selected in step 1 . 

[Alternatively, you can use Excel worksheet template tmhyper.xls on the CD that accompanies the text.] 
II. Excel procedure for a complete set of individual or cumulative hypergeometric probabilities 

1 . Enter the column labels into A1:C1 as shown in the printout above. Enter the lowest possible number of successes in 
the sample (in this case, 0) into A2. Enter 1 into A3. Click on cell A2 and highlight cells A2:A3 Click at the lower right 
corner of A3 (the cursor will turn to a " + " sign) and drag downward so cells A2:A6 will contain the values 0 through 
4. (Note that the highest possible number of successes in the sample is 4.) 

2. To get the set of individual probabilities: Click on cell B2 (this is where the first individual probability will go). Click f x . 
Click Statistical in the Category box. Click HYPGEOMDIST in the Function box. Click OK. Now follow step 2 in 
Excel procedure I, except enter A2 into the Sample_s (number of successes in the sample) box. The first individual 
probability now appears in B2. 

3. Click to select cell B2. Click the lower right corner of B2 (the cursor turns to a " + " sign) and drag down until cells 
B2:B6 contain the set of individual probabilities from P(x = 0) through P(x = 4). 

4. To get the set of cumulative probabilities: Copy the value in cell B2 to cell C2 (this will be the first cumulative proba- 
bility). Click on cell C3 and enter the equation =C2+B3. Click the lower right corner of cell C3 (the cursor turns to a 
" + " sign) and drag down until cells C2:C6 contain the set of cumulative probabilities. 
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MINITAB 

Probability Density Function 

Hypergeometri c N = 20, M = 6, and n = 
x P( X = x ) 

0 0.206605 

1 0.450774 

2 0.281734 

3 0.057792 

4 0.003096 



Cumulative Distribution Function 

4 Hypergeometri c N = 20, M = 6, and n = 4 
x PC X <=x) 

0 0.20660 

1 0.65738 

2 0.93911 

3 0.99690 

4 1.00000 



I. Minitab procedure for just one individual or cumulative hypergeometric probability 

1 . Click Calc. Select Probability Distributions. Click Hypergeometric. 

2. In the Hypergeometric Distribution menu, enter 20 into the Population size (N) box. Enter 6 into the Event count 
in population (M) box. Enter 4 into the Sample size (n) box. Select Input constant and enter the desired x value 
(e.g., 2) into the box. Select either Probability or Cumulative probability, depending on whether you want an in- 
dividual probability or a cumulative probability. Click OK. 



II. Minitab procedure for a complete set of individual or cumulative hypergeometric probabilities 

Enter the desired x values (in this case, 0 through 4) into column C1. Follow steps 1 and 2 above, except select Input 
column and enter C1 into the box. 




6.33 Under what circumstances should the hypergeo- 
metric distribution be used instead of the binomial 
distribution? 

6.34 Using the hypergeometric distribution, with N = 4, 
n = 2, and s = 3, determine the following: 

a. P(x = 0) b. P(x = 1) c. P(x = 2) 

6.35 Using the hypergeometric distribution, with N = 5, 
n = 2, and s = 3, determine the following: 

a. P(x = 0) b. P(x = 1) c. P(x = 2) 

6.36 In a criminal trial, there are 25 persons who have 
been approved by both parties for possible inclusion 
in the eventual jury of 12. Of those who have been 
approved, there are 14 women and 11 men. If the 
judge forms the final jury of 12 by randomly selecting 
individuals from the approved listing, what is the 
probability that at least half of the eventual jurors 
will be males? 

6.37 A computer firm must send a group of three 
specialists to give a technical presentation to a potential 



customer. Five such individuals are anxious to attend, 
but only two are considered to have strong interpersonal 
skills. Since the specialists have equal levels of seniority, 
they have agreed to place their names in a hat and allow 
management to randomly select the three to attend the 
presentation. What is the probability that there will be at 
least one person with strong interpersonal skills among the 
group that is sent? 

6.38 Unknown to a rental car office, 3 of the 12 subcom- 
pact models now available for rental are subject to a 
safety recall soon to be announced by the National 
Highway Traffic Safety Administration. Five sub- 
compacts will be rented today, and the cars will be 
randomly selected from those available in the pool. 
What is the probability that exactly one of the recall- 
affected cars will be rented today? What is the probabil- 
ity that all three of the recall-affected cars will be rented 
today? 
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THE POISSON DISTRIBUTION 



Description and Applications 

The Poisson distribution is a discrete probability distribution that is applied to 
events for which the probability of occurrence over a given span of time, space, or 
distance is extremely small. The discrete random variable, x, is the number of 
times the event occurs over the given span, and x can be 0, 1, 2, 3, and so on, with 
(theoretically) no upper limit. Besides its ability to approximate the binomial dis- 
tribution when n is large and it is small, the Poisson distribution tends to describe 
phenomena like these: 

• Customer arrivals at a service point during a given period of time, such as the 
number of motorists approaching a tollbooth, the number of hungry persons 
entering a McDonald's restaurant, or the number of calls received by a com- 
pany switchboard. In this context it is also useful in a management science 
technique called queuing (waiting-line) theory. 

• Defects in manufactured materials, such as the number of flaws in wire or 
pipe products over a given number of feet, or the number of knots in wooden 
panels for a given area 

• The number of work-related deaths, accidents, or injuries over a given num- 
ber of production hours 

• The number of births, deaths, marriages, divorces, suicides, and homicides 
over a given period of time 

Although it is closely related to the binomial distribution, the Poisson distri- 
bution has a number of characteristics that make it unique. Like the binomial 
distribution, it is a family of distributions, but its shape is determined by just 
one descriptor, its mean. In the Poisson distribution, the mean is called A (the 
Greek lowercase letter lambda) instead of fi. Also, the mean of the distribution 
is numerically equal to its variance. The distribution can be summarized as 
follows: 

The Poisson distribution: 

The probability that an event will occur exactly x times over a given span of 
time, space, or distance is 

A* • e" A 

P( x ) = j — where A = the mean, or E(x); the expected number 

of occurrences over the given span 
e = the mathematical constant, 2.71828 
(e is the base of the natural logarithm 
system) 

(Note: In the Poisson distribution, the mean and the variance are equal.) 



i. 



2. 



When using a pocket calculator to make the computation, remember that 
2.71828" x is the same as 1 divided by 2.71828 x . 

Appendix A also contains tables for individual and cumulative Poisson prob- 
abilities. The application of these tables will be discussed after we use an ex- 
ample in demonstrating how the Poisson distribution works. 



N 


O 


T E 


S 



Example 



Part 2: Probability 




Poisson Distribution 

In an urban county, health care officials anticipate that the number of births this 
year will be the same as last year, when 438 children were born — an average of 
438/365, or 1.2 births per day. Daily births have been distributed according to 
the Poisson distribution. 

SOLUTION 

What Is the Mean of the Distribution? 

E(x) = A can be expressed in a number of ways, because it reflects here the 
number of occurrences over a span of time. Accordingly, the "span of time" 
can be months (A = 438/12 = 36.5 births per month), weeks (A = 438/52 = 
8.42 births per week), or days (A = 438/365 = 1.2 births per day). For pur- 
poses of our example, we will be using A = 1.2 births per day in describing the 
distribution. 



For Any Given Day, What Is the Probability That No Children Will Be Born? 

Using the Poisson formula, with A = 1.2 births per day, to find P(x = 0): 



P(x) 



and P(0) 



A x • e" 



o . „-1.2 



1.2° • e 



1 • 0.30119 



0! 



0.3012 



Calculate Each of the Following Probabilities: P(x = 1), P(x = 2), P(x = 3), 
P(x = 4), and P(x = 5) 

Using the same approach as in the preceding calculations, with A = 1.2 mean 
births per day and values of x from 1-5 births per day: 



P(x = 1 birth) : 
P(x = 2 births) 
P(x = 3 births) 
P(x = 4 births) 
P(x = 5 births) 



1.2 1 


• e 


1.2 


(1.2000)(0.30119) 




1! 




1 


1.2 2 


• e 


-1.2 


(1.4400)(0.30119) 




2! 




2 • 1 


1.2 3 


• e 


-1.2 


(1.7280)(0.30119) 




3! 




3-2-1 


1.2 4 


• e 


-1.2 


(2.0736)(0.30119) 




4! 




4-3-2-1 


1.2 s 


• e 


-1.2 


(2.4883)(0.30119) 




5! 




5 • 4 • 3 • 2 • 1 



0.3614 



0.2169 



0.0867 



0.0260 



0.0062 



If we were to continue, we'd find that P(x = 6) = 0.0012 and 
P(x = 7) = 0.0002, with both rounded to four decimal places. Including these 
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FIGURE 6.4 



For the example discussed 
in the text, the Poisson 
probability distribution 
for the number of births 
per day is skewed to the 
right. The mean of the 
distribution is 1.2 births per 
day, and this descriptor is all 
that is needed to determine 
the Poisson probability for 
each value of x. 



probabilities with those just calculated, the result would be the following Poisson 
probability distribution. As Figure 6.4 shows, the distribution is positively 
skewed. 

x 0 123456 7 

P{x) 0.3012 0.3614 0.2169 0.0867 0.0260 0.0062 0.0012 0.0002 

What Is the Probability That No More Than One Birth Will Occur 
on a Given Day? 

Since the events are mutually exclusive, this can be calculated as P(x = 0) + 
P{x = 1), or 0.3012 + 0.3614, and P{x < 1) = 0.6626. 



Using the Poisson Tables 

The Poisson calculations are much easier than those for the binomial. For 
example, you need only compute e~ x just one time for a given value of X, since the 
same term is used in finding the probability for each value of x. Using either the in- 
dividual or the cumulative Poisson distribution table in Appendix A (Tables A.3 
and A. 4, respectively) makes the job even easier. Table 6.3 (page 190) shows a por- 
tion of Table A.3, while Table 6.4 shows its cumulative counterpart. If the prob- 
lem on which you are working happens to include one of the A values listed, you 
need only refer to the P(x = k) or the P(x ^ k) values in the appropriate table. 

In the previous example, A was 1.2 births per day. Referring to the A = 1.2 
column of Table 6.3, we see the same probabilities just calculated for P(x = 1) 
through P(x = 7). For example, in the fifth row of the A = 1.2 column, the entry 
for P(x = 4) is 0.0260. To quickly find the probability that there would be no 
more than 2 births on a given day, we can refer to the cumulative Poisson proba- 
bilities in Table 6.4 and see that P(x < 2) is 0.8795. 

In some cases, the sum of the individual probabilities in a particular Poisson 
table will not add up to 1.0000. This is due to (1) rounding and (2) the fact that 
there is no upper limit to the possible values of x. Some values not listed in the 
table have P(x) > 0, but would appear as 0.0000 because of their extremely small 
chance of occurrence. 
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TABLE 6.3 



A portion of the table of 
individual Poisson probabili- 
ties from Appendix A 



Poisson Distribution, Individual Probabilities 
forx = number of occurrences, prob (x = k) 



1.1 



1.2 



1.3 



1.4 



1.5 



1.6 



1.7 



1.8 



1.9 



2.0 



0 0.3329 0.3012 0.2725 0.2466 0.2231 0.2019 

1 0.3662 0.3614 0.3543 0.3452 0.3347 0.3230 

2 0.2014 0.2169 0.2303 0.2417 0.2510 0.2584 

3 0.0738 0.0867 0.0998 0.1128 0.1255 0.1378 

4 0.0203 0.0260 0.0324 0.0395 0.0471 0.0551 

5 0.0045 0.0062 0.0084 0.0111 0.0141 0.0176 

6 0.0008 0.0012 0.0018 0.0026 0.0035 0.0047 

7 0.0001 0.0002 0.0003 0.0005 0.0008 0.001 1 

8 0.0000 0.0000 0.0001 0.0001 0.0001 0.0002 

9 0.0000 0.0000 0.0000 0.0000 

10 A 



0.1827 
0.3106 
0.2640 
0.1496 
0.0636 
0.0216 
0.0061 
0.0015 
0.0003 
0.0001 
0.0000 



0.1653 
0.2975 
0.2678 
0.1607 
0.0723 
0.0260 
0.0078 
0.0020 
0.0005 
0.0001 
0.0000 



0.1496 
0.2842 
0.2700 
0.1710 
0.0812 
0.0309 
0.0098 
0.0027 
0.0006 
0.0001 
0.0000 



0.1353 
0.2707 
0.2707 
0.1804 
0.0902 
0.0361 
0.0120 
0.0034 
0.0009 
0.0002 
0.0000 



For A = 1 .2 births per day, the probability that 
there will be exactly 3 births on a given day is 
P(x= 3), or 0.0867. 



TABLE 6.4 



A portion of the table 
of cumulative Poisson 
probabilities from 
Appendix A 



Poisson Distribution, Cumulative Probabilities 
forx = number of occurrences, prob (x < k) 



1.1 



1.2 



1.3 



1.4 



1.5 



1.6 



1.7 



1.8 



1.9 



2.0 



0.3329 0.3012 0.2725 0.2466 0.2231 0.2019 

0.6990 0.6626 0.6268 0.5918 0.5578 0.5249 

0.9004 0.8795 0.8571 0.8335 0.8088 0.7834 

0.9743 0.9662 0.9569 0.9463 0.9344 0.9212 

0.9946 0.9923 0.9893 0.9857 0.9814 0.9763 

0.9990 0.9985 0.9978 0.9968 0.9955 0.9940 

0.9999 0.9997 0.9996 0.9994 0.9991 0.9987 

1.0000 1.0000 0.9999 0.9999 0.9998 0.9997 

1.0000 1.0000 1.0000 1.0000 



0.1827 
0.4932 
0.7572 
0.9068 
0.9704 
0.9920 
0.9981 
0.9996 
0.9999 
1.0000 



0.1653 
0.4628 
0.7306 
0.8913 
0.9636 
0.9896 
0.9974 
0.9994 
0.9999 
1 .0000 



0.1496 
0.4337 
0.7037 
0.8747 
0.9559 
0.9868 
0.9966 
0.9992 
0.9998 
1.0000 



0.1353 
0.4060 
0.6767 
0.8571 
0.9473 
0.9834 
0.9955 
0.9989 
0.9998 
1 .0000 




For A = 1 .2 births per day, the probability that there 
will be no more than 3 births on a given day is 
P(x<3), or 0.9662. 
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As with the binomial distribution, Excel and Minitab can provide either indi- 
vidual or cumulative Poisson probabilities. The procedures and the results are 
shown in Computer Solutions 6.3. 

The Poisson Approximation to the Binomial 
Distribution 

When n is relatively large and 77 (the probability of a success in a given trial) is 
small, the binomial distribution can be closely approximated by the Poisson 
distribution. As a rule of thumb, the binomial distribution can be satisfactorily 
approximated by the Poisson whenever n > 20 and tt ^ 0.05. Under these condi- 
tions, we can just use A = mr and find the probability of each value of x using the 
Poisson distribution. 



cc 

• 


Dmputer solutions G.3 




T 








\ 









Poisson Probabilities 

These procedures show how to get probabilities associated with a Poisson distribution with mean = 1.2. 



For each software package: 

Procedure I: Individual or cumulative probabilities for x = 2 occurrences. 
Procedure II: Complete set of individual or cumulative probabilities for x = 0 occurrences 
through x = 8 occurrences. 



EXCEL 





A 


B 


C 


1 


X 


P(X = x) 


P(X <= x) 


2 


0 


0.3012 


0.3012 


3 


1 


0.3614 


0.6626 


4 


2 


0.2169 


0.8795 


5 


3 


0.0867 


0.9662 


6 


4 


0.0260 


0.9923 


7 


5 


0.0062 


0.9985 


8 


6 


0.0012 


0.9997 


9 


7 


0.0002 


1.0000 


10 


8 


0.0000 


1.0000 



I. Excel procedure for just one individual or cumulative Poisson probability 

1. Click on the cell into which Excel will place the probability. Click f Click Statistical in the Category box. Click 
POISSON in the Function box. Click OK. 

2. Enter the number of occurrences (2) into the X box. Enter the mean of the distribution (1.2) into the Mean box. To ob- 
tain the individual probability, P(x = 2), enter false into the Cumulative box. To obtain the cumulative probability, 
P(x < 2), enter true into the Cumulative box. Click OK. 
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II. Excel procedure for a complete set of individual or cumulative Poisson probabilities 

1 . Enter the column labels into A1:C1 as shown in the printout. Enter the lowest number of occurrences in the set (in this 
case, 0) into A2. Enter 1 into A3. Click on cell A2 and highlight cells A2:A3 Click at the lower right corner of A3 (the 
cursor will turn to a " + " sign) and drag downward so cells A2:A10 will contain the values 0 through 8. 

2. To get the set of individual probabilities: Click on cell B2 (this is where the first individual probability will go). Follow the 
remainder of the steps in Excel procedure I, except enter A2 into the X (number of occurrences) box and enter false 
into the Cumulative box. The first probability now appears in B2. Click to select cell B2. Click the lower right corner of 
B2 (the cursor turns to a " + " sign) and drag down until cells B2:B10 contain the set of individual probabilities. 

3. To get the set of cumulative probabilities: Click on cell C2 (this is where the first cumulative probability will go). Follow 
the remainder of the steps in Excel procedure I, except enter A2 into the X (number of occurrences) box and enter true 
into the Cumulative box. The first probability now appears in C2. Click to select cell C2. Click the lower right corner 
of C2 (the cursor turns to a " + " sign) and drag down until cells C2:C10 contain the set of cumulative probabilities. 



MINITAB 

Probability Density Function Cumulative Distribution Function 

Poisson with mean = 1.2 Poisson with mean = 1.2 



X 


P( X = x ) 


X 


P( X <= x ) 


0 


0.301194 


0 


0.30119 


1 


0.361433 


1 


0.66263 


2 


0.216860 


2 


0.87949 


3 


0.086744 


3 


0.96623 


4 


0.026023 


4 


0.99225 


5 


0.006246 


5 


0.99850 


6 


0.001249 


6 


0.99975 


7 


0.000214 


7 


0.99996 


8 


0.000032 


8 


1.00000 



I. Minitab procedure for just one individual or cumulative Poisson probability 

1 . Click Calc. Select Probability Distributions. Click Poisson. 

2. In the Poisson Distribution menu, enter 1.2 into the Mean box. Select Input constant and enter the desired x value 
(e.g., 2) into the box. Select either Probability or Cumulative probability, depending on whether you want an in- 
dividual probability or a cumulative probability. Click OK. 

II. Minitab procedure for a complete set of individual or cumulative binomial probabilities 

Enter the desired x values (in this case, 0 through 8) into column C1. Follow steps 1 and 2 above, except select Input 
column and enter C1 into the box. 



example 



Poisson Approximation to Binomial 

Past experience has shown that 1.0% of the microchips produced by a certain 
firm are defective. A sample of 30 microchips is randomly selected from the firm's 
production. If x = the number of defective microchips in the sample, determine 
P{x = 0), P(x = 1), P{x = 2), P(x = 3), and P(x = 4). 
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SOLUTION 

The Binomial Approach 

The descriptors for the binomial distribution are tt = 0.01, the chance that any 
given microchip will be defective, and n = 30, the number of microchips in the 
sample. Although we will skip the actual binomial calculations, we would use 
tt = 0.01 and n = 30 in arriving at the probabilities shown below. 

The Poisson Approximation Approach 

The only descriptor for the Poisson distribution is A = mr = 30(0.01), or A = 0.3 
microchips expected to be defective in the sample. Referring to the Poisson table 
for A = 0.3, we find the following probabilities for x = 0 through x = 4. Note 
the very close similarity between each binomial probability and its Poisson 
approximation: 



P(x), Using the 



x = Number of 

Defective Microchips Binomial Poisson 

in the Sample Distribution Approximation 

0 0.7397 0.7408 

1 0.2242 0.2222 

2 0.0328 0.0333 

3 0.0031 0.0033 

4 0.0002 0.0003 



exercises 






6.39 For a discrete random variable that is Poisson 
distributed with A = 2.0, determine the following: 
a. P(x = 0) b. P(x = 1) c. P(x < 3) 
d. P(x > 2) 

6.40 For a discrete random variable that is Poisson 
distributed with A = 9.6, determine the following: 
a. P(x = 7) b. P(x = 9) c. P(x < 12) 
d. P(x > 10) 

6.41 In 2000, there were about 530 motor vehicle thefts 
for every 100,000 registrations. Assuming (1) a Poisson 
distribution, (2) a community with a comparable theft 
rate and 1000 registered motor vehicles, and (3) x = the 
number of vehicles stolen during the year in that commu- 
nity, determine the following: source: Statistical Abstract, 

pp. 183, 850. 

a. E(x) b. P(x = 3) c. P(x = 5) 

d. P(x < 8) e. P(3 < x < 10) 

6.42 Arrivals at a walk-in optometry department in 
a shopping mall have been found to be Poisson 
distributed with a mean of 2.5 potential customers arriv- 
ing per hour. If x = number of arrivals during a given 



hour, determine the following: 

a. E(x) b. P(x = 1) c. 

d. P(x < 5) e. P(2 < x < 6) 



P(x = 3) 



6.43 The U.S. divorce rate has been reported as 3.8 
divorces per 1000 population. Assuming that this rate 
applies to a small community of just 500 people and is 
Poisson distributed, and that x = the number of 
divorces in this community during the coming year, 
determine the following: source: New York Times Almanac 
2006, p. 286. 

a. E(x) b. P(x = 1) c. P(x = 4) 

d. P(x < 6) e. P(2 < x < 5) 

6.44 During the 12 P.M.-l P.M. noon hour, arrivals at a 
curbside banking machine have been found to be Poisson 
distributed with a mean of 1.3 persons per minute. If 

x = number of arrivals during a given minute, determine 
the following: 

a. E(x) b. P(x = 0) c. P(x = 1) 

d. P(x < 2) e. P(l < x < 3) 

6.45 During the winter heating season in a northern state, 
Howard's Heating and Cooling Service receives an 
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average of 3.1 emergency service calls per day from heat- 
ing customers in their area. Howard has increased person- 
nel and equipment resources to better handle such calls, 
and his company is now able to satisfy a maximum of 8 
emergency service calls per day. During a typical day dur- 
ing the heating season, and assuming the daily number of 
emergency service calls to be Poisson distributed, what is 
the probability that Howard will receive more emergency 
service calls tomorrow than his company is able to 
handle? Considering this probability, would it seem advis- 
able for Howard to increase or to decrease the level of per- 
sonnel and equipment he devotes to handling such calls? 

6.46 Over the past year, a university's computer system 
has been struck by a virus at an average rate of 0.4 
viruses per week. The university's information technology 
managers estimate that each time a virus occurs, it costs 
the university $1000 to remove the virus and repair the 
damages it has caused. Assuming a Poisson distribution, 
what is the probability that the university will have the 
good fortune of being virus-free during the upcoming 
week? During this same week, what is the expected 
amount of money that the university will have to spend 
for virus removal and repair? 

6.47 According to the Mortgage Bankers Association of 
America, the foreclosure rate on home mortgages in 2001 
was 1%. Assuming that this rate is applicable to a 
community where 1000 homes have mortgages, use the 
Poisson approximation to the binomial distribution to 
determine the following for x = the number of 
foreclosures in this community during the coming year: 
source: Statistical Abstract of the United States 2006, p. 768. 

a. E(x) b. P(x = 7) c. P(x = 10) 

d. P(x < 12) e. P(8 < x < 15) 

6.48 The U.S. Department of Transportation reports 
that America West Airlines got 7.65 complaints of 
mishandled baggage per 1000 passengers in March 2000. 
Assuming this rate applies to the next 500 passengers 
who depart from Portland, Oregon, on America West 
flights, what is the probability that at least 5 people will 



have their baggage mishandled? That no more than 3 
people will have their baggage mishandled? source: Bob 
Laird, "Airlines With the Most Reports of Mishandled Baggage," USA 
Today, May 11, 2000, p. 1B. 

6.49 Taxicab drivers are more likely to be murdered on 
the job than members of any other occupation, even po- 
lice officers and security guards. The annual murder rate 
for cab drivers is 30.0 homicides per 100,000 workers, 
compared to just 0.6 per 100,000 for all occupations. 
Assume that taxicab companies in a large Northeastern 
city employ a total of 3000 drivers and that the family of 
any driver murdered on the job receives $50,000 from the 
city. What is the probability that at least two of the city's 
taxicab drivers will be murdered on the job during the 
next year? What is the approximate expected value for 
the amount of money the city will be providing to fami- 
lies of murdered taxicab drivers during the next year? 
source: "Occupational Homicide Rate Is Highest Among Taxi Drivers," USA 
Today, May 2, 2000, p. 3A. 

6.50 A manufacturing plant's main production line 
breaks down an average of 2.4 times per day. Whenever 
the line goes down, it costs the company $500 in mainte- 
nance, repairs, and lost production. What is the probabil- 
ity that the production line will break down at least 3 
times tomorrow? What is the approximate expected value 
for the amount of money that production line 
breakdowns will cost the company each day? 

6.51 The company in Exercise 6.50 has disciplined a 
worker who was suspected of pilfering tools and supplies 
from the plant. The very next day, the production line 
broke down 9 times. Management has confronted the 
union with accusations of sabotage on the line, but the 
union president says it's just a coincidence that the 
production line happened to break down so many times 
the day after the worker was disciplined. Using probabili- 
ties appropriate to your discussion, comment on how 
much of a coincidence this high number of breakdowns 
would appear to be. 



6.5 SIMULATING OBSERVATIONS FROM A DISCRETE 

PROBABILITY DISTRIBUTION 



Besides describing the exact probability distribution for a discrete random variable, 
statistical software can also randomly select observations from the distribution it- 
self. For example, we can use Excel or Minitab to randomly select observations 
from a given Poisson distribution, then display them in the form of a frequency 
distribution. Computer Solutions 6.4 describes the necessary procedures. Because 
we are using two different software packages, there will be two different sets of ran- 
domly selected data. We will display and discuss the results for Minitab. 
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computer solutions G.4 




Simulating Observations from a Discrete Probability Distribution 

These procedures simulate 10,000 observations from a Poisson distribution with mean = 1.2. The 
procedures for simulating observations from a binomial distribution will differ only slightly from those 
described here. 

EXCEL 

1 . Start with a blank worksheet. Click Tools. Click Data Analysis. Click Random Number Generation. Click OK. 

2. Enter 1 into the Number of Variables box. Enter 10000 into the Number of Random Numbers box. Within the 
Distribution box, select Poisson. Enter the mean (1.2) into the Lambda box. Select Output Range and enter A1 
into the box. Click OK. The 10,000 simulated observations will be located in A1:A10000. 



MINITAB 

1 . Click Calc. Select Random Data. Click Poisson. Enter 10000 into the Number of rows of data to generate box. 
Enter C1 (the destination column) into the Store in columns box. Enter 1.2 into the Mean box. Click OK. The 10,000 
simulated observations will be located in C1. 

2. To obtain the frequencies for the different values of x, click Stat, select Tables, and click Tally Individual Variables. 
Enter C1 into the Variables box and select Counts. Click OK. The relative frequencies shown in part B of the right side 
of Table 6.5 are obtained by dividing each of the frequencies by 1 0,000. 

The frequency distributions in Table 6.6 can be generated using the Minitab procedure described previously in Computer 
Solutions 2.1 . As shown in this earlier procedure, we have the option of displaying the frequencies on the chart istelf. Note 
that users of the Minitab Student Edition are limited to a worksheet size of 5000 data values, so the number of simulated 
observations must be no more than 5000 into an otherwise empty worksheet. 



Table 6.5 (page 196) shows the individual P(x) probabilities as computed by 
Minitab for three different Poisson distributions (A = 0.1, A = 1.2, and A = 5.0). 
Next to each theoretical probability is the relative frequency with which the x 
value occurred when Minitab randomly selected 10,000 observations from that 
distribution. 

Note that the relative frequencies for the randomly selected x values are very 
close, but not necessarily the same as the theoretical probabilities listed in the 
probability distribution. As we discussed in the previous chapter, P(heads) = 0.5 
for a coin flip, but flipping a coin 10 times doesn't mean that you're certain to get 
heads exactly 5 times. However, due to the law of large numbers, observed rela- 
tive frequencies will more closely approach the theoretical probability distribu- 
tion as the number of observations grows larger. 

Parts A through C of Table 6.6 show Minitab frequency distributions for 
10,000 observations randomly selected from the corresponding probability distri- 
bution of Table 6.5. In comparing the three frequency distributions of Table 6.6, 
notice how they become more symmetrical as A is increased when we proceed 
from A to C. Although the Poisson distribution is always positively skewed, it be- 
comes more symmetrical as its mean, A, becomes larger. 

The ability to select random observations from a known or assumed probabil- 
ity distribution also has applications in a number of management science areas. 
Such applications typically involve using one or more such distributions to math- 
ematically represent real-world phenomena of interest. In general, their purpose 
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TABLE 6.5 



Besides describing a proba- 
bility distribution, the com- 
puter can also randomly 
select observations from 
the distribution itself. 



Minitab output. Poisson probabil- 
ity distribution for the discrete ran- 
dom variable, x: 



Probability Density Function 

Poisson with mean = 0.1 



X 


PC x 


= x ) 


0 


0. 


904837 


1 


0. 


090484 


2 


0. 


004524 


3 


0. 


000151 


4 


0. 


000004 



Probability Density Function 

Poisson with mean = 1.2 



X 


PC X 


= x ) 


0 


0 


301194 


1 


0 


361433 


2 


0 


216860 


3 


0 


086744 


4 


0 


026023 


5 


0 


006246 


6 


0 


001249 


7 


0 


000214 


8 


0 


000032 



Probability Density Function 

Poisson with mean = 5 



X 


PC X 


= x ) 


0 


0 


006738 


1 


0 


033690 


2 


0 


084224 


3 


0 


140374 


4 


0 


175467 


5 


0 


175467 


6 


0 


146223 


7 


0 


104445 


8 


0 


065278 


9 


0 


036266 


10 


0 


018133 


11 


0 


008242 


12 


0 


003434 


13 


0 


001321 


14 


0 


000472 


15 


0 


000157 


16 


0 


000049 



When Minitab randomly 
selected 10,000 
observations from the 
distribution, the relative 
frequency with which each x 
value appeared: 



0 


9056 


0 


0892 


0 


0050 


0 


0002 


0 


0000 



0 


3060 


0 


3588 


0 


2085 


0 


0918 


0 


0273 


0 


0063 


0 


0011 


0 


0002 


0 


0000 




0 


0067 


0 


0343 


0 


0848 


0 


1424 


0 


1824 


0 


1693 


0 


1433 


0 


1059 


0 


0637 


0 


0345 


0 


0195 


0 


0086 


0 


0029 


0 


0010 


0 


0005 


0 


0001 


0 


0001 
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A. 10,000 random observations from Poisson distribution with mean, A = 0.1 



9000 
8000 
7000- 
6000- 



§ 5000- 
9 

| 4000- 



3000 
2000 
1000- 
0 



TABLE 6.6 



Minitab frequency distribu- 
tions for 1 0,000 
observations randomly 
taken from each of the 
three Poisson probability 
distributions in Table 6.5 



B. 1 0,000 random observations from Poisson distribution with mean, A = 1 .2 




(continued) 
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is to help the management scientist gain insight into how the processes interact, as 
well as to determine how these interactions might be influenced by managerial de- 
cisions. Most introductory management science texts include extensive treatment 
of the nature, the construction, and the applications of such mathematical mod- 
els of the real world. 




Note: These exercises require a computer and statistical 
software capable of selecting random observations from 
discrete probability distributions. 

6.52 Using your statistical software package: 

a. Simulate 100 observations from a Poisson 
distribution having A = 2.35. 

b. Generate the individual Poisson probabilities for a 
Poisson distribution having A = 2.35. 

c. Repeat part (a). 

d. Repeat part (b). 

e. When part (a) is repeated, the results are now different. 
However, when part (b) is repeated, the results are the 
same. What does this say about frequency 
distributions versus probability distributions? 



6.53 Using your statistical software package: 

a. Simulate 50 observations from a binomial distri- 
bution for which n = 5 and n = 0.2. 

b. Generate a frequency distribution for these observa- 
tions. Is the distribution symmetrical? If not, is it 
positively skewed or negatively skewed? 

c. Repeat parts (a) and (b), but simulate 500 
observations. Is the frequency distribution more 
symmetrical or less symmetrical than the one 
previously obtained? 

d. Comment on how the shape of the distribution would 
change if you were to continue increasing the number 
of simulated observations. 



Chapter 6: Discrete Probability Distributions 



199 



SUMMARY 6.6 



• Discrete random variable and its probability distribution 

A discrete random variable can take on only certain values along a range or con- 
tinuum, and there are gaps between these possible values. On the other hand, a 
continuous random variable can have any value along a range or continuum. This 
chapter has focused on discrete random variables, while the next concentrates on 
continuous random variables. 

The probability distribution for a discrete random variable lists all possible 
values the variable can assume, along with their respective probabilities of occur- 
rence. There are several types of discrete distributions, each with its own charac- 
teristics and applications. 

• Binomial distribution and the Bernoulli process 

The binomial distribution deals with consecutive trials, each of which has only 
two possible outcomes, generally referred to as success or failure. This distribu- 
tion assumes a Bernoulli process, in which the trials are independent of each 
other, and the probability of success remains constant from one trial to the next. 

• Hypergeometric distribution 

Like the binomial distribution, the hypergeometric distribution deals with consec- 
utive trials, and there are only two possible outcomes for each trial. The trials are 
not statistically independent, however, and the probability of a success is not con- 
stant from one trial to the next. 



• Poisson distribution 

The Poisson distribution applies to events for which the probability of occurrence 
over a given span of time, space, or distance is extremely small. In this distribu- 
tion, the random variable is the number of occurrences in the specified span of 
time, space, or distance. For a Bernoulli process in which the number of trials is 
large and the probability of success on any given trial is very small, the Poisson 
distribution can be used as an approximation to the binomial distribution. 

• Determining probabilities for a discrete random variable 

Appendix A contains tables for the binomial and Poisson distributions. However, 
computer statistical packages enable the user to determine the probabilities asso- 
ciated with a given distribution without the need to refer to such tables. 

• Simulation of observations from a discrete probability distribution 

Some computer packages can randomly select a sample of observations from a 
probability distribution that has already been specified. In addition to being 
useful in more advanced management science applications such as simulation 
models intended to represent real-world phenomena, this capability enables the 
user to easily view the approximate shape of a given probability distribution. 
Based on the law of large numbers, the observed relative frequencies will more 
closely approach the theoretical probability distribution as the number of obser- 
vations grows larger. 
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Characteristics of a Discrete Probability Distribution 

• For any value of x, 0 £ P(x) £ 1. 

• The values of x are exhaustive: The probability distribution includes all possible 
values. 

• The values of x are mutually exclusive: Only one value can occur for a given 
experiment. 

• The sum of their probabilities is 1, or 2P(x I ) = 1.0. 

General Formulas: Mean and Variance of a Discrete Probability Distribution 

• Mean: 

fx = E(x) or fx = Ex ; P(x ; ) for all possible values of x 

• Variance: 

a 2 = E[(x — fx) 2 ] or a 2 = — fx) 2 F(x t ) for all possible values of x, 

and the standard deviation 

T 2 



The Binomial Probability Distribution 

The probability of exactly x successes in n trials is 
w! 



P(x) 



• Mean: 



Variance: 



tt x (1 — tt)" x where n = number of trials 
x -\ n x l- x = number of successes 

7r = the probability of success in 
any given trial 
(1 — it) = the probability of failure in 
any given trial 



fX = E(x) = W7T 



O- 2 = E[(X - fX) 2 ] = M77(l - 77) 



The Hypergeometric Distribution 

The probability of exactly x successes in n trials that are not independent is 



P(x) 



IN 
\n - 



N 
1 n 



where N = size of the population 
n = size of the sample 
s = number of successes in the population 
x = number of successes in the sample 



Mean: 



= E(x) 



N 
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Variance: 



The Poisson Probability Distribution 

The probability that an event will occur exactly x times over a given span of time, space, 
or distance is 

A* • e~ A 

P(x) = j where X = the mean, or E(x); the expected number of 

occurrences over the given span 
e = the mathematical constant, 2.71828 (e is the 
base of the natural logarithm system) 

(Note: In the Poisson distribution, the mean and the variance are equal.) 



chapter Exercises 



6.54 The owner of a charter fishing boat has found that 
12% of his passengers become seasick during a half- 
day fishing trip. He has only two beds below deck to 
accommodate those who become ill. About to embark 
on a typical half-day trip, he has 6 passengers on 
board. What is the probability that there will be 
enough room below deck to accommodate those who 
become ill? 

6.55 During 2001, the crash rate for commuter air carri- 
ers was 2.1 per 100,000 flying hours. Assuming this rate 
continues, what is the probability that there will be no 
more than one crash in the next 50,000 flying hours? 

source: Statistical Abstract, p. 663. 

6.56 Bill has to sell three more cars this month in order 
to meet his quota. Tonight he has after-dinner 
appointments with five prospective customers, each of 
whom happens to be interested in a different car. If he has 
a 30% chance of success with each customer, what is the 
probability that he will meet his quota by tomorrow 
morning? 

6.57 In Exercise 6.56, suppose that three of the 
customers are interested in the same car, and that they 
will go elsewhere if it has already been sold. Would it be 
appropriate to use the binomial distribution under these 
conditions? Why or why not? 

6.58 During the 1999 tax filing season, 13.7% of all indi- 
vidual U.S. tax returns were prepared by H&R Block. 
For a random selection of 3 tax returns, describe the 
probability distribution for x = the number in the sample 
whose returns were prepared by H&R Block, source: H&R 
Block, Inc., Annual Report 1999, p. 3. 

6.59 It has been estimated that 40% of all U.S. children 
are covered by life insurance. For a randomly selected 



group of 10 children, use the appropriate statistical table 
to describe the probability distribution for x = the num- 
ber in the sample who are covered by life insurance. 

source: 1994 Life Insurance Fact Book, p. 35. 

6.60 A company has installed five intrusion-detection 
devices in an office building. The devices are very sensi- 
tive and, on any given night, each one has a 0.1 proba- 
bility of mistakenly sounding an alarm when no 
intruder is present. If two or more of the devices are 
activated on the same night, the intrusion system auto- 
matically sends a signal to the local police. If no 
intruder is present on a given night, what is the 
probability that the system will make the mistake of 
calling the police? 

6.61 A mining company finds that daily lost-work 
injuries average 1.2. If the local union contract has a 
clause requiring that the mine be shut down as soon as 
three workers incur lost-work injuries, on what percent- 
age of the days will the mine be operational throughout 
the day? 

6.62 Unknown to a quality-control inspector, 20% of a 
very large shipment of electric switches are defective. The 
inspector has been told to reject the shipment if, in a sam- 
ple of 15 switches, 2 or more are defective. What is the 
probability that the shipment will be rejected? 

6.63 Repeat Exercise 6.29, but under the assumption that 
there are only 20 bearings in the supply bin, 2 of which 
are defective. 

6.64 A town council includes 10 Democrats and 15 
Republicans. In response to a developer's request for a 
variance that would allow her to build a convenience 
store in an area of town currently zoned as residential, 
the mayor is going to randomly select 5 council 
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members for a committee to examine the issue. What is 
the probability that there will be at least 3 Republicans 
on the committee? 

6.65 Fred and Emily are among the 50 finalists in a 
lottery in which 5 winners will be randomly selected to 
receive a free vacation in Hawaii. Using the hypergeomet- 
ric distribution, determine the probability that exactly 
one of these two individuals will win a free vacation. 

6.66 Seven of the 15 campus police officers available for 
assignment to the auditorium in which a local politician 
is to speak have received advanced training in crowd con- 
trol. If 5 officers are randomly selected for service during 
the speech, what is the probability that exactly 2 of them 
will have had advanced training in crowd control? What 
is the probability that at least 3 of them will have had ad- 
vanced training? 

6.67 A producer of copper tubing has found that, on the 
average, five flaws occur per 2000 feet of tubing. The tub- 
ing is sold in continuous rolls of 200 feet each. What is 
the probability that a randomly selected 200-foot roll will 
have no flaws? 

6.68 A large plumbing contractor buys eight 200-foot 
rolls from the producer described in Exercise 6.67. Deter- 
mine the probability that no more than three of the eight 
rolls will be flawless. 

6.69 It has been estimated that about 30% of families 
with an annual income of $100,000 or more own 
savings bonds. What is the probability that exactly 
three of a randomly selected group of five such families 
own savings bonds? source: Statistical Abstract, p. 720. 

6.70 A trucking company has found that its trucks aver- 
age 0.2 breakdowns during round trips from New York 
to Los Angeles. 

a. What is the probability that a single truck will make 
the complete trip without experiencing a breakdown? 

b. If 3 trucks are assigned to a NY/LA round trip, what 
is the probability that at least 2 of them will make the 
complete trip without experiencing a breakdown? 

6.71 The Bureau of Labor Statistics says that baggage 
handlers, trash haulers, and coal miners are among the 
occupations having the highest risk of a debilitating 
injury or illness. For coal miners working over the course 
of a year, the injury rate has been reported as 3.6 injuries 
per 100 coal miners. For a company that employs 200 
coal miners, what is the probability that at least 10 of its 
workers will experience a debilitating injury or illness 
over the next year? source: "Workers at Risk," USA Today, March 
15, 2006, p. 6D. 

6.72 According to the U.S. Centers for Disease Control 
and Prevention, Americans 85 or more years of age 
annually experience flu-associated respiratory and 
circulatory illnesses at a rate of approximately 1200 



cases per 100,000 persons. If a rural county has 500 
seniors in this age group, what is the probability that 
no more than 5 of them will experience such an illness 
during the coming year? If, in such a county, 15 seniors 
in this age group were to come down with such an 
illness during the coming year, would this be considered 
rather unusual? Explain your conclusion, source: Anita 
Manning, "Tens of Thousands More People Are Hospitalized for Flu Than 
Expected," USA Today, September 15, 2004, p. 5D. 

6.73 Although they have no commercials, public 
television stations periodically conduct campaigns dur- 
ing which programming is occasionally interrupted by 
appeals for subscriber donations. A local public 
television station has found that, during these breaks, 
the number of calls averages 3 per minute. It has 
recently upgraded the switchboard so it can handle as 
many as 5 calls per minute. The station is now in the 
middle of a program break. What is the probability that 
the switchboard will be overloaded during the next 
minute? 

6.74 A gorilla once made headlines in 400 newspapers 
and appeared on three national television shows after 
correctly "selecting" the winner of 13 out of 20 football 
games. He had made his "choice" for each game by grab- 
bing one of two pieces of paper on which the names of 
the teams were written. If a great many gorillas had the 
same opportunity, what percentage of them would be at 
least this lucky? source: "Gone Ape," Pittsburgh Press, June 8, 
1986. 

6.75 It has been estimated that over 90% of the world's 
large corporations are actively involved in data 
warehousing. Using the 90% figure, and considering a 
random sample of 10 large corporations, what is the 
probability that at least 7 of them are actively involved in 
data warehousing? 

6.76 A country club has found that the number of com- 
plete rounds played by a member during a given outing 
averages 1.3 rounds. What is the probability that a golfer 
selected at random will complete at least 2 rounds of golf 
on his or her next outing? 

6.77 A computer manufacturer that sells notebook 
computers directly to customers has been having 
problems with one of its models — it seems that about 
10% of the units are inoperable upon arrival to the 
customer. They are functional when tested at the 
factory, but there is some problem associated with 
the shipping/handling process that the company's 
engineers have not yet been able to identify and correct. 
A highly valued customer places an emergency order 
for 8 of these notebooks, and wants them delivered 
overnight and ready for use in the field by tomorrow. 
Wanting to be honest with this customer, and wishing 
to help ensure that the customer receives at least 8 
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working computers, the manufacturer offers to send 9 
units instead of the 8 that the customer needs. The cus- 
tomer accepts this gesture of goodwill and 9 units are 
sent. What is the probability that the customer will end 
up with at least 8 working computers? 

6.78 According to the National Insurance Crime Bureau, 
the metropolitan area with the highest incidence of auto 
theft is Miami, Florida, with an annual rate of 2.6 thefts 
for every 100 vehicles registered in 1997. a The University 
of Miami has approximately 1100 faculty members. b 
Under the assumption that the average faculty member 
has just one registered vehicle, and applying the vehicle 
theft rate reported for the city, what is the expected 
number of faculty vehicles that will be stolen during the 
upcoming year? Assuming that one of the academic 
departments has 50 faculty members, what is the proba- 
bility that all of them will make it through the coming 
year without having a vehicle stolen? That at least 2 of 
them will have a vehicle stolen next year? sources: "National 
Insurance Crime Bureau, in "Top Theft Areas," USA Today, March 22, 
1999, p. 13B. h World Almanac and Book of Facts 2003, p. 256. 

6.79 A tire manufacturer has advertised that only 2% of 
its tires have surface blemishes. Further, the 
manufacturer argues that tires with such blemishes are 
perfectly safe, and with only the physical appearance suf- 
fering very slightly. A consumer magazine randomly pur- 
chases 40 of the tires for an upcoming tire performance 
comparison report. Upon examining the tires, the maga- 
zine's test engineers find that 5 of them have surface 
blemishes. If it were true that just 2% of the tires 
produced have surface blemishes, what would be the 
probability of getting 5 or more with such blemishes in a 
random sample of 40 tires? Considering this probability, 
comment on the believability of the tire manufacturer's 
advertising claim. 



6.80 It has been estimated that 5% of people who are 
hospitalized pick up infections while in the hospital, and 
that 2.2 million people a year get sick and 88,000 die 
because of these infections. Experts advise patients to be 
sure that health care personnel have washed their hands, 
to select surgeons with higher levels of experience, and 
to minimize the amount of time that they are treated 
with catheterization. Grove Tree Mountain Hospital an- 
ticipates admitting 100 patients this month for surgical 
and other treatments. Suppose that 14 of the patients in 
the sample actually end up getting a hospital-borne 
infection. Treating the 100 patients as a random sample 
of all hospital patients, and applying the infection rate 
cited above, what is the probability that a typical hospi- 
tal would have 14 or more infections in a sample of this 
size? Considering this probability, comment on whether 
Grove Tree Mountain might not be a typical hospital in 
terms of its patients' likelihood of picking up an 
infection, source: Anita Manning, "Hospital Infections Jump as Out- 
patient Care Rises," USA Today, March 6, 2000, p. 8D. 

6.81 J. D. Power and Associates' Initial Quality Study 
reports that the industry-average problem rate for 
vehicles is 154 problems per 100 vehicles. The 
highest-rated marque was Acura, with a rate of just 
91 problems per 100 vehicles. On the other hand, 
Kia was reported as having 251 problems per 100 
vehicles. Allen is on his way to pick up his brand-new 
Acura. At the same time, Kevin is on his way to pick up 
his brand-new Kia. What is the probability that Allen's 
Acura will have exactly 2 problems? What is the 
probability that Kevin's Kia will have no more than 1 
problem? What is the probability that neither Allen nor 
Kevin will have a problem with his car? source: Mark 
Rechtin, "Import Brands Set Pace in Power Quality Tally," Automotive 
News, May 1 5, 2000, p. 4. 



integrated case 



m 




Thorndike Sports Equipment 



Arriving at the office a bit early, Ted finds Luke Thorndike 
in a fit of rage. Mr. Thorndike is sipping coffee and look- 
ing over Ted's notes on the defective racquets that were 
constructed with aluminum purchased from each of the 
company's three suppliers. 

In rummaging through the basement crate, Ted 
has found a total of 30 defective racquets. Of these, 5 
were made of aluminum supplied by the Snow- 
met Corporation and 5 were made of aluminum purchased 
from Barstow Aluminum, Inc. These were the two 



suppliers from whom Thorndike Sports Equipment had 
been buying aluminum for many years, and with good 
results. 

The cause of Mr. Thorndike's anger was the total of 
20 defective racquets made from aluminum purchased 
from the company's newest and lowest-priced supplier, 
Darwood Discount Metals, Inc. Luke had just placed a big 
order with Darwood, and he was counting the minutes 
until 10 A.M. so that he could call the West Coast firm, can- 
cel the order, and give Mr. Darwood a big piece of his 
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mind. The arrival of the morning mail only served to 
heighten Luke's anger, as three more complaints were re- 
ceived from disgruntled customers demanding immediate 
replacements for their broken racquets. 

Ten o'clock finally rolls around, and Luke places a 
person-to-person collect call to Mr. Darwood. After berat- 
ing Mr. Darwood and his products, Luke demands that the 
order be canceled immediately. Mr. Darwood does not ap- 
preciate the elder Thorndike's tirade, but he is quite cog- 
nizant of the fact that 10% of his company's profits come 
from Thorndike Sports Equipment. Though irritated, he 
patiently tries to reason with Mr. Thorndike. 

According to Mr. Darwood, independent metallurgists 
have conducted a lot of tests in which they found Dar- 
wood aluminum to be every bit as good as 
the product supplied by Snowmet and Barstow. He sug- 
gests that the unusually high number of defective racquets 
found by Luke was merely a fluke. Already operating on a 
short fuse, Mr. Thorndike responds that he doesn't much 
care for silly rhymes in the middle of the morning, and he 
warns Darwood not to load any more aluminum onto 
Thorndike-bound trucks until he and Ted have a chance to 
further examine the information Ted collected over the 
weekend. He promises to call Mr. Darwood with a final 
decision by 3 p.m., Pacific Time. 

Ted spends a very busy morning and skips lunch, but 
by 2 p.m. he comes up with some data that might prove use- 
ful. Most important, he has uncovered a research study in 
which it was found that 0.8% of all aluminum racquets end 
up being returned as defective. The number of Snowmet 
and Barstow racquets in the "defectives" crate is about 1% 
of those produced. However, of the 1200 racquets made 
from Darwood aluminum, 20 (about 1.7%) are 
defective.Ted decides to consider the situation as a binomial 
probability distribution in which the probability of a defect 
on a given trial is 0.008, corresponding to the research find- 
ing that 0.8% of all the aluminum racquets produced are 
defective. Using the estimate for the entire industry, the ex- 
pected number of defectives among the 1200 racquets made 



of Darwood aluminum would be mr = 1200(0.008), or 
9.6, but 20 defectives were observed. 

To determine how unusual this result really is, Ted 
uses Excel and n = 1200, tt = 0.008 to find the cumulative 
probabilities for the number of defects. The results are 
shown in Table 6.7. 

Receiving a telephone call from home and finding out 
that the water pipes have just burst and his house is flood- 
ing, Ted has to flee the office. On his way out, he asks you 
to use the information he has found, along with Table 6.7, 
in giving advice to Mr. Thorndike before he makes his call 
to Darwood. 



TABLE 6.7 Ted Thorndike has used Excel in generating these 
cumulative probabilities for a binomial distribution in which the 
number of trials is 1 200 and the probability of a defect in any 
given trial is 0.8%, or 0.008. 
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ATM Arrival Times: A Continuous Distribution 

In the opening vignette to Chapter 6, we described how the number of automatic 
teller machine (ATM) customers over a 5-minute period represents a discrete proba- 
bility distribution, with x = the number of arrivals (0, 1 , 2, etc.) during the period. 

Without getting into trouble with campus security, stake out your local ATM as 
you did for the opener to Chapter 6 — except now use the chronograph feature of 
your (or a friend's) watch to record the times between the arrivals of successive 
customers. For example, x 1 is the time between the arrival of customer A and 
customer B, x 2 is the time between customers B and C, and so on. What you're doing 
now is creating a continuous probability distribution, the topic of this chapter. 

As a matter of fact, our discussion 
here is the flip side of the vignette that i;tt|p a norm 3 ' 

opened Chapter 6. The inverse of rnrnP things look a 
arrivals-per-minute is minutes- 
between-arrivals. Where the discrete 
Poisson distribution of Chapter 6 
focused on the former, the continuous 
exponential distribution of this chapter 
is concerned with the latter. This unique 
relationship between the Poisson and 
exponential distributions will be 
examined later in this chapter. Measuring 
and predicting time between arrivals can 
be important to planners and service 
providers in many real-world situations, 
especially those dealing with emergency 
medical services. 
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learning 
objectives 

After reading this 
chapter, you should 
be able to: 



Understand the nature and the applications of the normal distribution. 

Use the standard normal distribution and z-scores to determine probabilities 
associated with the normal distribution. 

Use the normal distribution to approximate the binomial distribution. 

Understand the nature and the applications of the exponential distribution, 
including its relationship to the Poisson distribution of Chapter 6. 

Use the computer in determining probabilities associated with the normal and 
exponential distributions. 



7.1 INTRODUCTION 



Chapter 6 dealt with probability distributions for discrete random variables, 
which can take on only certain values along an interval, with the possible values 
having gaps between them. This chapter presents several continuous probability 
distributions; these describe probabilities associated with random variables that 
are able to assume any of an infinite number of values along an interval. 

Discrete probability distributions can be expressed as histograms, where the 
probabilities for the various x values are expressed by the heights of a series of 
vertical bars. In contrast, continuous probability distributions are smooth curves, 
where probabilities are expressed as areas under the curves. The curve is a func- 
tion of x, and f(x) is referred to as a probability density function. Since the con- 
tinuous random variable x can be in an infinitely small interval along a range or 
continuum, the probability that x will take on any exact value may be regarded as 
zero. Therefore, we can speak of probabilities only in terms of the probability that 
x will be within a specified interval of values. For a continuous random variable, 
the probability distribution will have the following characteristics: 

The probability distribution for a continuous random variable: 

1. The vertical coordinate is a function of x, described as f(x) and referred to as 
the probability density function. 

2. The range of possible x values is along the horizontal axis. 

3. The probability that x will take on a value between a and b will be the 
area under the curve between points a and b, as shown in Figure 7.1. The 



FIGURE 7.1 



For a continuous random 
variable, the probability dis- 
tribution is described by a 
curve called the probability 
density function, f(x). The 
total area beneath the curve 
is 1.0, and the probability 
that x will take on some 
value between a and b is 
the area beneath the curve 
between points a and b. 



Area = P(a <x<b) 











Navigating Normal Curves with Thorndike Sports Equipment 



I was flattered but kind of surprised when Dr. Weiers asked 
Thorndike Sports Equipment to do a Statistics in Action 
commentary for his textbook. Doesn't he realize we're just 
imaginary? After all, he made us up in the first place. Any- 
way, I usually don't spend a lot of time thinking about the 
normal curve, but preparing this commentary helped open 
my eyes to how often it comes up. 

In many facets of our business, we find a normal curve 
closely approximates the data we've collected. For exam- 
ple, in the "bounce test" we use in monitoring the liveli- 
ness of our golf balls, we find the bounce distances tend to 
be normally distributed. Although we don't like to talk 
about it much, we try to make sure our sponsored 
professionals get balls that are from the most lively 0.5% of 
the output. My grandson Ted tells me that corresponds to 
z = +2.58 standard deviation units from the mean. 

According to our human resources manager, a lot of 
the aptitude and performance tests we give potential and 



current employees also tend to result in scores that are 
distributed according to that classic "bell-shaped" curve. 
She's been with us for many years, and she finds it very 
interesting to compare the means and standard devia- 
tions today with the ones she got ten or twenty years 
ago. 

Ted says we also rely on the normal distribution in con- 
structing and interpreting the statistical process control 
charts that have helped Thorndike racquets gain their rep- 
utation for consistently high quality. Here at Thorndike 
Sports Equipment, we greatly appreciate the applications 
and value of the normal distribution, and we are happy to 
pass our experience on to you. 

Luke Thorndike 
President and Founder 
Thorndike Sports 
Equipment 



probability density function, f(x), for a given continuous distribution is ex- 
pressed in algebraic terms, and the areas beneath are obtained through the 
mathematics of calculus. However, tables are provided in the text for readily 
identifying or calculating such areas. 
4. The total area under the curve will be equal to 1.0. 

The first continuous probability distribution we will examine is the normal dis- 
tribution. As suggested by the commentary in Statistics in Action 7.1, this is a very 
important continuous distribution in both the study of statistics and its application 
to business. The normal distribution is a bell-shaped, symmetrical curve, the use of 
which is facilitated by a standard table listing the areas beneath. 

As demonstrated in Chapter 6, binomial probabilities can involve either 
tedious computations or extensive tables. We will describe a method by which the 
normal distribution can be used as a convenient approximation for determining 
the probabilities associated with discrete random variables that are of the binomial 
distribution. 

The next topic will be the exponential distribution. This is related to the (dis- 
crete) Poisson distribution of the previous chapter but is a continuous probability 
distribution describing probabilities associated with the (continuous) time or 
distance intervals between occurrences of the so-called rare events of a Poisson 
process. 

Finally, we will use Excel and Minitab to randomly select observations 
from a specified continuous probability distribution. This is similar to the 
simulated samples obtained from selected discrete probability distributions in 
Chapter 6. 
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7.1 What is the difference between a continuous proba- 
bility distribution and a discrete probability distribution? 

7.2 What is a probability density function and how is it 
relevant to the determination of probabilities associated 
with a continuous random variable? 



7.3 Why is the total area beneath a probability density 
function equal to 1.0? 

7.4 What is the probability that a continuous random 
variable will take on any specific value? Explain your 
answer. 



THE NORMAL DISTRIBUTION 



Description and Applications 

The normal distribution is the most important continuous distribution in statis- 
tics. It occupies this position for three key reasons: (1) many natural and eco- 
nomic phenomena tend to be approximately normally distributed; (2) it can be 
used in approximating other distributions, including the binomial; and (3) as we 
will see in Chapter 8, sample means and proportions tend to be normally dis- 
tributed whenever repeated samples are taken from a given population of any 
shape. This latter characteristic is related to the central limit theorem, a concept 
also discussed in the next chapter. 

The normal distribution is really a family of distributions, each member of 
which is bell shaped and symmetrical (the left side is a mirror image of the right). 
As Figure 7.2 shows, (1) the mean, median, and mode are all at the same position 
on the horizontal axis; (2) the curve is asymptotic, approaching the horizontal axis 
at both ends, but never intersecting with it; and (3) the total area beneath the curve 
is equal to 1.0. The specific member of the family depends on just two descriptors: 



FIGURE 7.2 



The normal distribution is 
actually a family of bell- 
shaped distributions, each 
of which has these charac- 
teristics. The specific mem- 
ber of the family depends 
on just two descriptors, the 
mean (fx) and the standard 
deviation (a). 



Symmetric: Each side is the 
mirror image of the other 




Asymptotic: 
Tail approaches 
the x axis but 
never reaches it 



Asymptotic: 
Tail approaches 
the x axis but 
never reaches it 



Mean, median, 
and mode 
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FIGURE 7.3 



The shape of a normal 
curve and its position on 
the x axis will depend on its 
mean (/i) and standard de- 
viation (<x). Although curve 
6 has a greater mean, its 
standard deviation is 
smaller. 



the mean (ft) and the standard deviation (a). The normal distribution can be de- 
scribed as follows: 



The normal distribution for the continuous random variable, x (with 

-co < x < +°o): 

f(x) = — j= e -(i/2)[(x-A)/°-] 2 w here fi = mean 

(TV 2 77 



a = standard deviation 
e = the mathematical constant, 2.71828 
7r = the mathematical constant, 3.14159 



In this equation, 77 is the geometric constant for the ratio of the circumference of 
a circle to its diameter, and is not related to the symbol used for a population 
proportion. 

Not all normal curves will look exactly like the one in Figure 7.2. The shape 
of the curve and its location along the x axis will depend on the values of the stan- 
dard deviation and mean, respectively. You can demonstrate this by using Seeing 
Statistics Applet 4 at the end of the chapter. Figure 7.3 shows two normal curves 
representing output diameters for two tubing extrusion machines: 

• The average diameter of tubing from machine B exceeds that from machine A. 
Looking along the x axis, it can be seen that fx B > fx A . 

• The variability of the output from machine A is greater than that from ma- 
chine B. In symbolic terms, this can be expressed as a A > a B . 

Areas Beneath the Normal Curve 

Regardless of the shape of a particular normal curve, the areas beneath it can be 
described for any interval of our choice. This makes it unnecessary to get involved 
with calculus and the complexities of the probability density function presented 
earlier. For any normal curve, the areas beneath the curve will be as follows (see 
also part (a) in Figure 7.4 on page 210): 

• About 68.3% of the area is in the interval fx — a to fx + a. 

• About 95.5% of the area is in the interval fx — 2a to fx + 2a. 

• Nearly all of the area (about 99.7%) is in the interval fx — 3a to fx + 3a. 



NOT 
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FIGURE 7.4 



For any normal distribution, 
the approximate areas 
shown in part (a) will lie 
beneath the curve for the 
intervals shown. In part (b), 
the areas show how 
these intervals could apply 
to the amount of time 
logged by general-aviation 
aircraft during the year. 
Assuming fj, = 130 hours 
and (t — 30 hours, about 
95.5% of the planes 
logged between 70 and 
1 90 hours of flying time. 




-95.5%- 
-99.7%- 



(a) Approximate areas beneath the normal curve 




Assuming: 
fj, = 130 hours 
o" = 30 hours 



220 x (hours) 



(b) Approximate areas applied to annual flying hours 



example 



Normal Distribution 

The General Aviation Manufacturers Association has reported the average annual 
number of hours flown by general-aviation aircraft to be approximately 130 hours. 1 
For purposes of our example, we will assume a normal distribution and a standard 
deviation of 30 hours for the flying hours of aircraft within this population. 



Source: General Aviation Manufacturers Association, General Aviation Statistical Databook, 2005 
Edition, p. 20. 
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SOLUTION 

Based on the reported mean, fx = 130 hours, and the assumed standard deviation, 
cr = 30 hours, we can use the known properties of the normal distribution to deter- 
mine that about 95.5% of these aircraft logged between (130 — 2[30]) and 
(130 + 2[30]) hours, or between 70 and 190 hours of flying time during that year. 
Likewise, we could determine that about 68.3% of the aircraft logged between 100 
and 160 (fx ± a) hours and that about 99.7% logged between 40 and 220 (/x ± 3cr) 
hours. These intervals and approximate areas are shown in part (b) of Figure 7.4. 













■ 
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exercises 















7.5 It has been stated that the normal distribution is 
really a "family" of distributions. Explain. 

7.6 In the normal distribution, the probability that x 
will exceed (/a + 2a) is the same as the probability that x 
will be less than (ju, — 2a). What characteristic of the 
normal distribution does this reflect? 

7.7 Sketch two different normal distributions along a 
single x axis so that both of the following conditions are 
satisfied: (a) fju a = [i b and (b) a a > a h . 

7.8 If x is normally distributed with /a = 20.0 and 
a = 4.0, determine the following: 

a. P(x > 20.0) b. P(16.0 < x < 24.0) 

c. P(x < 12) d. P(x = 22.0) 

e. P(12.0 < x < 28.0) f. P(x > 16) 

7.9 If x is normally distributed with /a = 25.0 and 
a = 5.0, determine the following: 

a. P(% > 25.0) b. P(20.0 < x < 30.0) 

c. P(x < 30) d. P(% = 26.2) 

e. P(15.0 < x < 25.0) f. P(* > 15) 

7.10 The Canada Urban Transit Association has reported 
that the average revenue per passenger trip during a given 
year was $1.05. If we assume a normal distribution and 

a standard deviation of a = $0.20, what proportion of 
passenger trips produced a revenue of source: American 
Public Transit Association, APTA 1999 Transit Fact Book, p. 161. 

a. less than $1.05? 

b. between $0.65 and $1.45? 

c. between $0.85 and $1.25? 

d. between $0.45 and $1.05? 

7.1 1 In 2004, the average conventional first mortgage for 
new single-family homes was $294,000. Assuming a nor- 
mal distribution and a standard deviation of a = $30,000, 
what proportion of the mortgages were source: Bureau of 
the Census, Statistical Abstract of the United States 2006, p. 768. 

a. more than $294,000? 

b. between $234,000 and $354,000? 



c. between $264,000 and $324,000? 

d. more than $204,000? 

7.12 In 1999, the average charge for tax preparation by 
H&R Block, Inc. was $84.57. Assuming a normal distri- 
bution and a standard deviation of a = $10, what 
proportion of H&R Block's tax preparation fees were 
source: H&R Block, Inc., Annual Report 1999, p. 39. 

a. more than $84.57? 

b. between $64.57 and $104.57? 

c. between $74.57 and $94.57? 

d. more than $104.57? 

7.13 It has been reported that the average hotel check-in 
time, from curbside to delivery of bags into the room, is 
12.1 minutes. An Li has just left the cab that brought her 
to her hotel. Assuming a normal distribution with a stan- 
dard deviation of 2.0 minutes, what is the probability 
that the time required for An Li and her bags to get to the 
room will be: source: Tammi Wark, "Business Travel Today," USA 
Today, October 29, 1996, p. 1B. 

a. greater than 14.1 minutes? 

b. between 10.1 and 14.1 minutes? 

c. less than 8.1 minutes? 

d. between 10.1 and 16.1 minutes? 

7.14 A study by the National Golf Foundation reports 
that the 6.2 million golfers over the age of 50 spent an av- 
erage of $939 on golf during the previous year. Assuming 
a normal distribution with a standard deviation of $200, 
what is the probability that a randomly selected golfer in 
this age group will have spent: source: Scott Boeck and Marcia 
Steiner, "Grown-Up Games," USA Today, October 29, 1996, p. 1C. 

a. more than $1539? 

b. between $939 and $1339? 

c. less than $1139? 

d. between $539 and $1139? 

7.15 On average, commuters in the Los Angeles, 
California, area require 30.0 minutes to get to work. 
Assume a normal distribution with a standard deviation 
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of 5.0 minutes and a randomly selected Los Angeles-area 
commuter named Jamal. source: The World Almanac and Book 
of Facts 2006, p. 474. 

a. What is the probability that Jamal will require more 
than 45.0 minutes to get to work on any given day? 



b. Jamal has just left home and must attend an impor- 
tant meeting with the CEO in just 25.0 minutes. If 
the CEO routinely fires employees who are tardy, 
what is the probability that Jamal will be going to 
work tomorrow? 



THE STANDARD NORMAL DISTRIBUTION 



Description and Applications 

Because there is a different normal curve for every possible pair of /jl and tr, the 
number of statistical tables would be limitless if we wished to determine the areas 
corresponding to possible intervals within all of them. Fortunately, we can solve 
this dilemma by "standardizing" the normal curve and expressing the original x 
values in terms of their number of standard deviations away from the mean. The 
result is referred to as a standard (or standardized) normal distribution, and it al- 
lows us to use a single table to describe areas beneath the curve. The key to the 
process is the z-score: 



The z-score for a standard normal distribution: 



— where z = the distance from the mean, measured in 
standard deviation units 
x = the value of x in which we are interested 
[jl = the mean of the distribution 
(j = the standard deviation of the distribution 



In the aircraft flying hours example discussed earlier, the mean and (assumed) 
standard deviation were fx, = 130 hours and a = 30 hours, respectively. Using the 
Z-score equation, we can convert this distribution into a standard normal distri- 
bution in which we have z values instead of x values. For example: 



Corresponding z Value in the 
Standard Normal Distribution 



x Value in the Original 
Normal Distribution 

x = 130 hours 
x = 170 hours 
x = 100 hours 



_ X - fJL _ 
(7 

_ X - fJL _ 
(7 

_ X - (I _ 
(7 



130 




130 




30 




170 




130 




30 




100 




130 



In Figure 7.5, there are two scales beneath the curve: an x (hours of flying time) 
scale and a z (standard deviation units) scale. Note that the mean of the z scale is 
0.00. Regardless of the mean and standard deviation of any normal distribution, we 
can use the z-score concept to express the original values in terms of standard 
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FIGURE 7.5 



For the distribution of 
annual flying hours in part 
(b) of Figure 7.4, the upper 
of the two horizontal scales 
shown here represents the 
standardized normal 
distribution, in which x 
values (hours) have been 
converted to z-scores 
(standard deviation units 
from the mean). The use 
of z-scores makes it possible 
to use a single table for all 
references to the normal 
distribution, regardless of 
the original values of /x 
and a. 

For * = 100 hours, z = ^ = 1Ut) - ljU = -1.00 



deviation multiples from the mean. It is this transformation that allows us to use the 
standard normal table in determining probabilities associated with any normal dis- 
tribution. You can use Seeing Statistics Applet 5, at the end of the chapter, to see 
how the normal distribution areas respond to changes in the beginning and ending 
z values associated with Figure 7.5. 



Using the Standard Normal Distribution Table 

Regardless of the units involved (e.g., pounds, hours, dollars, and so on) in the 
original normal distribution, the standard normal distribution converts them into 
standard deviation distances from the mean. As we proceed through the next 
example, our original units will be grams. 

In demonstrating the use of the standard normal distribution table, we will rely 
on a situation involving the amount of weight necessary to balance a generator 
shaft. In answering each of the related questions, we will go through the following 
steps: 

1. Convert the information provided into one or more z-scores. 

2. Use the standard normal table to identify the area(s) corresponding to the 
Z-score(s). A portion of the standard normal distribution table is shown in 
Table 7.1 (page 214). The table provides cumulative areas from the midpoint 
(the mean, z = 0.00) to the z value of interest. 

3. Interpret the result in such a way as to answer the original question. 

In using the standard normal table, areas of interest may lie to the left of the 
mean or to the right of the mean or may even require adding two areas or sub- 
tracting one from the other. Because of (1) the variety of possibilities, and (2) the 
importance of knowing how to use the standard normal table, we will examine a 
number of applications that relate to the following example. 
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TABLE 7.1 



A portion of the standard 
normal distribution table 
from the back endsheet 



For z = 1.34, refer to the 
1.3 row and the 0.04 column 
to find the area, 0.4099. 



0 z 

z 0.00 0.01 0.02 0.03 0.04 0.05 0.06 0.07 0.08 0.09 

0.0 0.0000 0.0040 0.0080 0.0120 0.0160 0.0199 0.0239 0.0279 0.0319 0.0359 

0.1 0.0398 0.0438 0.0478 0.0517 0.0557 0.0596 0.0636 0.0675 0.0714 0.0753 

0.2 0.0793 0.0832 0.0871 0.0910 0.0948 0.0987 0.1026 0.1064 0.1103 0.1141 

0.3 0.1179 0.1217 0.1255 0.1293 0.1331 0.1368 0.1406 0.1443 0.1480 0.1517 

0.4 0.1554 0.1591 0.1628 0.1664 0.1700 0.1736 0.1772 0.1808 0.1844 0.1879 

0.5 0.1915 0.1950 0.1985 0.2019 0.2054 0.2088 0.2123 0.2157 0.2190 0.2224 

0.6 0.2257 0.2291 0.2324 0.2357 0.2389 0.2422 0.2454 0.2486 0.2517 0.2549 

0.7 0.2580 0.2611 0.2642 0.2673 0.2704 0.2734 0.2764 0.2794 0.2823 0.2852 

0.8 0.2881 0.2910 0.2939 0.2967 0.2995 0.3023 0.3051 0.3078 0.3106 0.3133 

0.9 0.3159 0.3186 0.3212 0.3238 0.3264 0.3289 0.3315 0.3340 0.3365 0.3389 

1.0 0.3413 0.3438 0.3461 0.3485 0.3508 0.3531 0.3554 0.3577 0.3599 0.3621 

1.1 0.3643 0.3665 0.3686 0.3708 0.3729 0.3749 0.3770 0.3790 0.3810 0.3830 

1.2 0.3849 0.3869 0.3888 0.3907 0.3925 0.3944 0.3962 0.3980 0.3997 0.4015 

1.3 0.4032 0.4049 0.4066 0.4082 0.4099 0.4115 0.4131 0.4147 0.4162 0.4177 

1.4 0.4192 0.4207 0.4222 0.4236 0.4251 0.4265 0.4279 0.4292 0.4306 0.4319 

1.5 0.4332 0.4345 0.4357 0.4370 0.4382 0.4394 0.4406 0.4418 0.4429 0.4441 

1.6 0.4452 0.4463 0.4474 0.4484 0.4495 0.4505 0.4515 0.4525 0.4535 0.4545 

1.7 0.4554 0.4564 0.4573 0.4582 0.4591 0.4599 0.4608 0.4616 0.4625 0.4633 

1.8 0.4641 0.4649 0.4656 0.4664 0.4671 0.4678 0.4686 0.4693 0.4699 0.4706 

1.9 0.4713 0.4719 0.4726 0.4732 0.4738 0.4744 0.4750 0.4756 0.4761 0.4767 
2.0 0.4772 0.4778 0.4783 0.4788 0.4793 0.4798 0.4803 0.4808 0.4812 0.4817 



Example 



Normal Probabilities 

Following their production, industrial generator shafts are tested for static and 
dynamic balance, and the necessary weight is added to predrilled holes in order to 
bring each shaft within balance specifications. From past experience, the amount 
of weight added to a shaft has been normally distributed, with an average of 
35 grams and a standard deviation of 9 grams. 
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SOLUTIONS 

What Is the Probability That a Randomly Selected Shaft Will Require Be- 
tween 35 and 40 Grams of Weight for Proper Balance? [P(35 < x < 40)] 

1. Conversion of x = 40 grams to z-score: 

x- ft- 40 - 35 

z = = = 0.56 

cr 9 

2. Reference to the standard normal table: Referring to the 0.5 row and the 0.06 
column of the table, we find an area of 0.2123. This is the area between the 
midpoint and z = 0.56. 

3. Solution: The probability that the randomly selected shaft will require be- 
tween 35 and 40 grams is 0.2123, as illustrated in Figure 7.6. 

What Is the Probability That a Randomly Selected Shaft Will Require at 
Least 50 Grams of Weight for Proper Balance? [P(x > 50)] 

1. Conversion of x = 50 grams to z-score: 

x - /i 50-35 

z = = = 1.67 

cr 9 

2. Reference to the standard normal table: Referring to the 1.6 row and the 0.07 
column of the table, we find an area of 0.4525. Since the total area to the right 
of the mean is 0.5000, the area in the right tail of the distribution is 
0.5000 - 0.4525, or 0.0475. 

3. Solution: The probability that a randomly selected shaft will require at least 
50 grams is 0.0475, as illustrated in Figure 7.7 (page 216). 

What Is the Probability That a Randomly Selected Shaft Will Require Be- 
tween 41 and 49 Grams of Weight for Proper Balance? [P(41 < x < 49)] 

1. Conversion of x = 49 grams to z-score: 

x-fi 49 - 35 

z = = = 1.56 

cr y 



P(35 < x < 40) = f(0.00 < z < 0.56) 

or 0.2123 (Area between z 



0.00 and z = 0.56) 




FIGURE 7.6 



The probability is 0.2123 
that a randomly selected 
generator shaft will require 
between 35 and 40 grams 
of weight in order to 
achieve proper balance. 




/u. = 35 grams 
<r = 9 grams 



z: 
x: 



0.00 0.56 
35 40 



(cr units) 
(grams) 



For x = 40 grams, z - 



40-35 
9 



= 0.56 
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FIGURE 7.7 



The probability is 0.0475 
that a randomly selected 
generator shaft will require 
at least 50 grams for proper 
balance. 



P(x > 50) = P(z> 1.67) 

or 0.5000 (Area from z = 0.00 to +~) 
-0.4525 (Area from z = 0.00 to z= 1.67) 



= 0.0475 - 



z: 
x: 




■■ 35 grams 
: 9 grams 



(a units) 
(grams) 



FIGURE 7.8 



The probability is 0.1920 
that a randomly selected 
generator shaft will require 
between 41 and 49 grams 
for proper balance. 



P{ 41 < x < 49) = P(0.67 < z < 1.56) 

or 0.4406 (Area from z = 0.00 to z = 1.56) 
-0.2486 (Area from z = 0.00 to z = 0.67) 

= 0.1920 




/j. = 35 grams 
a = 9 grams 



z: 
x: 



0.00 0.67 1.56 
35 41 49 



(a units) 
(grams) 



Conversion of x = 41 grams to z-score: 



41 - 35 



0.67 



(7 



2. Reference to the standard normal table: In this case, we must look up two 
areas in the table. Referring to the 1.5 row and the 0.06 column of 
the table, we find the area from the midpoint to z is 0.4406. Referring to 
the 0.6 row and the 0.07 column, we find the area from the midpoint to z 
is 0.2486. 

3. Solution: To find the area between z = 0.67 and z = 1.56, we must subtract 
the area from the midpoint to z = 0.67 from the area from the midpoint to 
z = 1.56, which results in 0.4406 - 0.2486, or 0.1920. As Figure 7.8 shows, 
there is a 0.1920 chance that a randomly selected shaft will require between 41 
and 49 grams. 
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P(x < 26) = P(z< -1.00) 

or 0.5000 (Area from z - 
-0.3413 (Area from z = 

- = 0.1587 



: 0.00 tO -<*>) 

0.00 to z = -1.00) 




fj, = 35 grams 
(7 = 9 grams 



(cr units) 
(grams) 



FIGURE 7.9 



The probability is 0.1587 
that a randomly selected 
generator shaft will require 
no more than 26 grams for 
proper balance. 



What Is the Probability That a Randomly Selected Shaft Will Require No 
More Than 26 Grams of Weight for Proper Balance? [P(x < 26)] 

1. Conversion of x = 26 grams to z-score: 

, = ^ = ^^=-1.00 
a 9 

2. Reference to the standard normal table: The distribution is symmetrical, so 
the area from the mean to z = —1.00 is the same as the area from the mean 
to z = +1.00. Referring to the 1.0 row and the 0.00 column of the table, we 
find the area between the midpoint and z = —1.00 to be 0.3413. 

3. Solution: As Figure 7.9 illustrates, the probability that a randomly selected 
shaft will require no more than 26 grams is 0.5000 — 0.3413, or 0.1587. 



In the preceding cases, we proceeded from x values to the determination of a 
corresponding area beneath the standard normal curve. However, it is also 
possible to specify an area, then identify a range of x values that corresponds to 
this area. 



example 



x Value for a Cumulative Probability 

In the setting just described, the average amount of weight added to the generator 
shafts was 35 grams, with a standard deviation of 9 grams. Management has just 
directed that the best 5% of the output be reserved for shipment to aerospace 
customers. Translating "the best 5%" into an amount of balancing weight, what 
weight cutoff should be used in deciding which generator shafts to reserve for 
aerospace customers? 
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FIGURE 7.10 



If the best 5% of the out- 
put is to be reserved for 
aerospace customers, shafts 
requiring no more than 
20.195 grams for balancing 
would be allocated to this 
industry. 



Area = 0.05 




35 grams 
9 grams 



(it units) 
(grams) 



For a left-tail area of 0.05, z must be -1 .645. 
If w = the maximum number of grams for generator 
shafts being sold to the aerospace industry, 



W — /uL 



or -1.645: 



■35 



and w = 20.195 



SOLUTION 

The quantity to be determined is the amount of weight (w) such that only 5% of 
the shafts require no more than w grams. Expressed in probability terms, the 
value of w must be such that P(x < w) = 0.05. 

The solution is to find out what value of z corresponds to a left-tail area of 

0. 05. then convert this z value into w grams. Reversing the steps of the previous 
examples: 

1. Find the value of z that corresponds to a left-tail area of 0.05. Referring to the 
standard normal table, we are seeking a listed value of 0.4500, since this will 
provide a tail area of 0.0500. Of the values listed, we see that the area from 
the mean to z = —1.64 includes an area of 0.4495, while the area from the 
mean to z = —1.65 includes an area of 0.4505. Interpolating between the 
two, the number of grams at the cutoff corresponds to z = —1.645. 

2. Continuing to work backwards, and knowing that fx, = 35 grams and a = 9 
grams, the next step is to substitute w into the equation for the previously 
determined z-score, z = —1.645: 

z= W ~ M or -1.645 = W ~ 35 and w = 35 - 9(1.645) = 20.195 grams 
a 9 

As Figure 7.10 shows, the manufacturer should set the cutoff for aerospace gen- 
erator shafts at 20.195 grams, since 95% of the shafts produced will require this 
amount or more in order to achieve proper balance. 

Excel and Minitab can be used to obtain cumulative normal probabilities 
associated with specified values for x — the procedures and results are shown in 
Computer Solutions 7.1. These software packages can also be used in doing 
the reverse — that is, specifying a cumulative probability, then finding the x 
value associated with it. These procedures and results are shown in Computer 
Solutions 7.2. 
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Normal Probabilities 

These procedures show how to get cumulative probabilities. 
For each software package: 

Procedure I: Getting the cumulative probability associated with one x value. 
Procedure II: Getting cumulative probabilities associated with a set of x values. 



EXCEL 





A 


B 


1 


Normal Distribution 


2 


with mean = 35, 


3 


std dev. = 9. 


4 


X 


Area to L. 


5 


45 


0.8667 


6 


35 


0.5000 


7 


25 


0.1333 



I. Excel procedure for the cumulative probability associated with one x value 

1. Click on the cell into which Excel will place the probability. Click f Click Statistical in the Category box. Click 
NORMDIST in the Function box. Click OK. 

2. Enter the x value of interest (e.g., 45) into the X box. Enter the mean of the distribution (e.g., 35) into the 
Mean box. Enter the standard deviation (e.g., 9) into the Standard_dev box. Enter True into the Cumulative box. 
Click OK. 

II. Excel procedure for cumulative probabilities associated with a set of x values 

1 . Enter the labels into A4:B4 as shown in the printout above. Enter the x values of interest (e.g., 45, 35, 25) starting with 
cell A5. Click on cell B5. 

2. Click f x . Click Statistical in the Category box. Click NORMDIST in the Function box. Click OK. 

3. Enter A5 into the X box. Enter the mean of the distribution (e.g., 35) into the Mean box. Enter the standard deviation 
(e.g., 9) into the Standard_dev box. Enter True into the Cumulative box. Click OK. The first cumulative probability 
will appear in cell B5. 

4. Click at the lower right corner of B5 (the cursor will turn to a " + " sign) and drag downward so cells B5:B7 will con- 
tain the set of cumulative probabilities. 

MINITAB 

Cumulative Distribution Function 

Normal with mean = 35 and standard deviation = 9 

x P( X <= x ) 
45 0.866740 
35 0.500000 
25 0.133260 

I. Minitab procedure for cumulative probability associated with one x value 
1 . Click Calc. Select Probability Distributions. Click Normal. 



(continued) 
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2. In the Normal Distribution menu, enter the mean of the distribution (e.g., 35) into the Mean box. Enter the stan- 
dard deviation (e.g., 9) into the Standard deviation box. Select Input constant and enter the x value of interest 
(e.g., 45) into the box. Select Cumulative. Click OK. 

II. Minitab procedure for cumulative probabilities associated with a set of x values 

Enter the desired x values (e.g., 45, 35, 25) into column C1 . Follow steps 1 and 2 above, except select Input column 
and enter C1 into the box. 




Inverse Normal Probabilities 

These procedures show how to get x values associated with cumulative normal probabilities. 
For each software package: 

Procedure I: Getting the x value associated with one cumulative probability. 
Procedure II: Getting the x values associated with a set of cumulative probabilities. 



EXCEL 





A 


B 


1 


Normal Distribution 


2 


with mean = 35, 


3 


std. Dev. = 9. 


4 


Area to L. 


X 


5 


0.9750 


52.64 


6 


0.5000 


35.00 


7 


0.2500 


28.93 



I. Excel procedure for the x value associated with one cumulative probability 

1. Click on the cell into which Excel will place the x value. Click f x . Click Statistical in the Category box. Click NORMINV 
in the Function box. Click OK. 

2. Enter the cumulative probability of interest (e.g., 0.9750) into the Probability box. Enter the mean of the distribution 
(e.g., 35) into the Mean box. Enter the standard deviation (e.g., 9) into the Standard_dev box. Click OK. 

II. Excel procedure for x values associated with a set of cumulative probabilities 

1 . Enter the labels into A4:B4 as shown in the printout above. Enter the cumulative probabilities of interest (e.g., 0.9750, 
0.5000, 0.2500) starting with cell A5. Click on cell B5. 

2. Click f x . Click Statistical in the Category box. Click NORMINV in the Function box. Click OK. 

3. Enter A5 into the Probability box. Enter the mean of the distribution (e.g., 35) into the Mean box. Enter the 
standard deviation (e.g., 9) into the Standard_dev box. Click OK. The first cumulative probability will appear in 
cell B5. 

4. Click at the lower right corner of B5 (the cursor will turn to a " + " sign) and drag downward so cells B5:B7 will con- 
tain the set of cumulative probabilities. 
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MINITAB 

Inverse Cumulative Distribution Function 

Normal with mean = 35 and standard deviation = 9 
P( X <= x ) x 

0.975 52.6397 

0.500 35.0000 

0.250 28.9296 

I. Minitab procedure for the x value associated with one cumulative probability 

1 . Click Calc. Select Probability Distributions. Click Normal. 

2. In the Normal Distribution menu, enter the mean of the distribution (e.g., 35) into the Mean box. Enter the stan- 
dard deviation (e.g., 9) into the Standard deviation box. Select Input constant and enter the cumulative probabil- 
ity of interest (e.g., 0.9750) into the box. Select Inverse cumulative probability. Click OK. 

II. Minitab procedure for the x values associated with a set of cumulative probabilities 

Enter the cumulative probabilities of interest (e.g., 0.9750, 0.5000, 0.2500) into column C1. Follow steps 1 and 2 
above, except select Input column and enter C1 into the box. 




7.16 The normal distribution is really a family of 
distributions. Is the standard normal distribution also a 
family of distributions? Explain. 

7.17 In the standard normal distribution, approximately 
what values of z correspond to the 

a. first and third quartiles? 

b. first and ninth deciles? 

c. 23rd and 77th percentiles? 

7.18 A continuous random variable, x, is normally 
distributed with a mean of $1000 and a standard devia- 
tion of $100. Convert each of the following x values into 
its corresponding z-score: 

a. x = $1000 b. x = $750 

c. x = $1100 d. x = $950 

e. x = $1225 

7.19 A continuous random variable, x, is normally 
distributed with a mean of 200 grams and a standard 
deviation of 25 grams. Convert each of the following 
x values into its corresponding z-score: 

a. x = 150 b. x = 180 

c. x = 200 d. x = 285 

e. x = 315 

7.20 Using the standard normal table, find the following 
probabilities associated with z: 

a. P(0.00 < z < 1.25) 

b. P(-1.25 < z < 0.00) 

c. P(-1.25 < z < 1.25) 



7.21 Using the standard normal table, find the following 
probabilities associated with z: 

a. P(0.00 < z < 1.10) 

b. P(z> 1.10) 

c. P(z< 1.35) 

7.22 Using the standard normal table, find the following 
probabilities associated with z: 

a. P(-0.36 < z < 0.00) 

b. P(z < -0.36) c. P(z > -0.43) 

7.23 Using the standard normal table, find the following 
probabilities associated with z: 

a. P(-1.96 < z < 1.27) 

b. P(0.29 < z < 1.00) 

c. P(-2.87 < z < -1.22) 

7.24 Using the standard normal table, determine a 
z value (to the nearest two decimal places) such that 
the area 

a. from the midpoint to z is 0.2486. 

b. from the midpoint to z is 0.3554. 

c. between z and negative infinity is 0.0694. 

d. between z and positive infinity is 0.0212. 

7.25 Using the standard normal table, determine a z value 
(to the nearest two decimal places) such that the area 

a. from the midpoint to z is 0.20. 

b. from the midpoint to z is 0.48. 

c. between z and negative infinity is 0.54. 

d. between z and positive infinity is 0.30. 
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7.26 For the normal distribution described in Exercise 

7.10, what is the probability that a randomly selected 
passenger trip would have produced a revenue 

a. between $1.05 and $1.30? 

b. over $1.40? c. under $0.75? 

7.27 For the normal distribution described in Exercise 

7.11, what is the probability that a randomly selected 
first mortgage would have been for an amount 

a. between $239,000 and $259,000? 

b. over $259,000? c. under $324,000? 

7.28 For the normal distribution described in Exercise 

7.12, what is the probability that a randomly selected 
tax preparation customer would have paid a fee 

a. between $70 and $80? 

b. under $60? c. over $90? 

7.29 For the normal distribution described in Exercise 
7.15, what is the probability that Jamal's commuting time 
to work today will be 

a. less than 29 minutes? 

b. between 29 and 33 minutes? 

c. more than 32 minutes? 

7.30 For the normal distribution described in Exercise 
7.15, what commuting time will be exceeded on only 10% 
of Jamal's commuting days? 

7.31 For the normal distribution described in Exercise 

7.11, what first-mortgage amount would have been 
exceeded by only 5% of the mortgage customers? 

7.32 For the normal distribution described in Exercise 

7.12, what tax preparation fee would have been exceeded 
by 90% of the tax preparation customers? 

7.33 It has been reported that households in the West 
spend an annual average of $6050 for groceries. Assume 
a normal distribution with a standard deviation of 
$1500. source: David Stuckey and Sam Ward, "Home Cooked," USA 
Today, July 25, 2006, p. 1A. 

a. What is the probability that a randomly selected West- 
ern household spends more than $6350 for groceries? 

b. How much money would a Western household have 
to spend on groceries per year in order to be at the 
99th percentile (i.e., only 1% of Western households 
would spend more on groceries)? 

7.34 Andre is a fearless circus performer who gets shot 
from a special cannon during the grand finale of the show 
and is supposed to land on a safety net at the other side 
of the arena. The distance he travels varies, but is 
normally distributed with a mean of 150 feet and a stan- 
dard deviation of 10 feet. The landing net is 30 feet long. 

a. To maximize Andre's probability of landing on the 
net, how far away from the cannon should he 
position the nearest edge of the net? 

b. Given the net position in part (a), what is the proba- 
bility that Andre will be able to return for tomorrow 
night's show? 



7.35 Drying times for newly painted microwave oven 
cabinets are normally distributed with a mean of 

2.5 minutes and a standard deviation of 0.25 minutes. 
After painting, each cabinet is mated with its electronic 
modules and mechanical components. The production 
manager must decide how much time to allow after 
painting before these other components are installed. 
If the time is too short, the paint will smudge and the 
unit will have to be refinished. If the time is too long, 
production efficiency will suffer. A consultant has 
concluded that the time delay should be just enough 
to allow 99.8% of the cabinets to dry completely, 
with just 0.2% ending up being smudged and sent 
back for refinishing. Given this information, for what 
time setting should the production manager set the 
automatic timer that pauses the production line while 
each cabinet dries? 

7.36 KleerCo supplies an under-hood, emissions- 
control air pump to the automotive industry. The pump 
is vacuum powered and works while the engine is oper- 
ating, cleaning the exhaust by pumping extra oxygen 
into the exhaust system. If a pump fails before the 
vehicle in which it is installed has covered 50,000 miles, 
federal emissions regulations require that it be replaced 
at no cost to the vehicle owner. The company's current 
air pump lasts an average of 63,000 miles, with a stan- 
dard deviation of 10,000 miles. The number of miles a 
pump operates before becoming ineffective has been 
found to be normally distributed. 

a. For the current pump design, what percentage of the 
company's pumps will have to be replaced at no 
charge to the consumer? 

b. What percentage of the company's pumps will fail at 
exactly 50,000 miles? 

c. What percentage of the company's pumps will fail 
between 40,000 and 55,000 miles? 

d. For what number of miles does the probability 
become 80% that a randomly selected pump will no 
longer be effective? 

7.37 The KleerCo company in Exercise 7.36 would like 
to design a more durable pump so that no more than 2% 
of original-equipment pumps are returned under free 
warranty, and the standard deviation continues to be 
10,000 miles. What will the new average service life have 
to be in order for the new pump design to comply with 
this requirement? 

7.38 A battery manufacturer has just begun distribution 
of its SuperVolt, a 6-volt lantern battery for outdoor en- 
thusiasts. Prototype tests of the new battery found the 
average lifetime in continuous use to be 8 hours, with a 
standard deviation of 2 hours. Battery lifetimes in such 
applications have been approximately normally 
distributed. Edgar Evans, the consumer reporter for a 
large metropolitan newspaper, has purchased one of the 
batteries to take along on a camping trip. On the first 
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day of his trip, Edgar goes fishing in his rowboat, 
becomes disoriented, and gets lost among the many small 
islands in the large lake near his camp. Fortunately, 
Edgar has brought along his trusty flashlight and its new 
SuperVolt battery. At 9 p.m., Edgar turns on his flashlight 
and shines it into the air, hoping that someone will see 
his signal and rescue him. Unfortunately, it is not until 
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3 a.m. that searchers begin their flight over the lake. If 
Edgar's light is still shining as they become airborne, they 
will easily spot it from the air. When Edgar finally gets 
back to the city, what is the probability that he will 
have an exciting story to tell about how his flashlight 
and its SuperVolt battery were two of the heroes in his 
rescue? 



THE NORMAL APPROXIMATION TO THE 7.4 
BINOMIAL DISTRIBUTION 



The binomial distribution, discussed in Chapter 6, is symmetrical whenever the 
population proportion, tt, is 0.5 and approaches symmetry for values that are 
close to 0.5. In addition, whenever the number of trials, n, becomes larger, the bi- 
nomial distribution takes on a bell-shaped appearance that resembles very closely 
the normal distribution discussed in this chapter. 

Although the binomial distribution is discrete and the normal distribution is 
continuous, the normal distribution is a very good approximation to the binomial 
whenever both nir and n(l — tt) are s 5. This can be quite useful when n and tt 
are not among the values listed in the binomial probability tables. As we have 
seen, such tables are already quite extensive for even relatively small upper limits 
for n. In addition, calculating binomial probabilities in the absence of such tables 
is a tedious proposition. 

The Binomial Distribution: A Brief Review 
The Bernoulli Process 

The binomial distribution assumes a Bernoulli process, which has the following 
characteristics: 

1. There are two or more consecutive trials. 

2. On each trial, there are just two possible outcomes — denoted as "success" or 
"failure." 

3. The trials are statistically independent; that is, the outcome in any trial is not 
affected by the outcomes of earlier trials, and it does not affect the outcomes 
of later trials. 

4. The probability of a success remains the same from one trial to the next. This is 
satisfied if either we sample with replacement or the sample is small compared 
to the size of the population. As a rule of thumb, if we are sampling without 
replacement and the population is at least 20 times as large as the number of 
trials, the constant tt assumption can be considered to have been met. 

The Mean and Standard Deviation 

The mean and standard deviation of the binomial distribution are the basis for the 
normal distribution that will be used in making the approximation. As described 
in Chapter 6, these are 

• Mean: 



/JL = HTT 



224 



Part 2: Probability 



• Standard deviation: 

a = V«7r(l — 77) where n = the number of trials 

77 = the probability of success 

on any given trial 
x = the number of successes 

in n trials 



Correction for Continuity 

Because the binomial distribution has gaps between possible values of x, while the 
normal distribution is continuous, the normal approximation to the binomial in- 
volves a correction for continuity. The correction consists of expanding each 
possible value of the discrete variable, x, by 0.5 in each direction. In continuous 
distributions like the normal, the probability of x taking on any exact value (e.g., 
x = 12.000) is zero. This is why we must consider each of the discrete values of x 
as being an interval. 

The continuity correction process where the discrete random variable, x, can take 
on integer values from 0 to 15: 

The discrete value of x is converted to the interval. 



x = 0 




-0.5 < x < 0.5 


>■ 



x - 1 




0.5 < x < 1.5 


► 



x — 2 


>- 


1.5 < x < 2.5 





x - 14 




13.5 < x < 14.5 


>- 



x = 15 




14.5 < x < 15.5 


>- 



The Approximation Procedure 

In using the normal distribution to approximate the binomial, we first determine 
the mean and standard deviation for the binomial distribution, then use these val- 
ues as ft and a in finding the area(s) of interest beneath the standard normal 
curve. In particular, we can find 

1. the probability for an individual value for x, now expressed in terms of the 
interval from [x — 0.5] to [x + 0.5], or 

2. the probability that x will lie within or beyond a given range of values. (Note: 
In this application, the correction for continuity becomes less important when- 
ever x can take on a large number of possible values.) 
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Normal Approximation 

This example demonstrates the use of the normal distribution to approximate the 
binomial. It represents a situation in which both nir and n(l — 77) are & 5, which 
is requisite for using the approximation. For purposes of our example, we will as- 
sume that 60% of the Americans who exercise each week by walking are women. 
A randomly selected group comprises 15 persons who exercise each week by 
walking, with x = the number of women in the group. 

SOLUTION 

Determine the Mean and Standard Deviation of x. 

The expected number of females in the group is 9.0. 

fi = mr or 15(0.60) = 9 



a = VW(1 - 77) = Vl5(0.60)(l - 0.60) = 1.897 

What Is the Probability That There Will Be Exactly 11 Females in the Group? 

Using the normal distribution to approximate the binomial, with fx = 9.0 and 
cr = 1.897, this probability is expressed as P(10.5 s x ^ 11.5). The z-score asso- 
ciated with x = 11.5 is 



fx _ 11.5 - 9.0 



(T 



The z-score associated with x 



1.897 
10.5 is 



1.32 



/X 



10.5 - 9.0 
1.897 



0.79 



Referring to the standard normal table, the area from the mean to z = 1.32 
is 0.4066 and the area from the mean to z = 0.79 is 0.2852. As Figure 7.11 
(page 226) shows, P(10.5 ^ x ^ 11.5) is the difference between these two areas, 
or 0.4066 - 0.2852 = 0.1214. 

Note that this approximation is quite close to the actual binomial probability 
of 0.1268, obtained from the binomial probability tables in Appendix A. Keep in 
mind, however, that this example was one in which both tt and n happened to be 
available in these tables. This will not always be the case, hence the usefulness of 
being able to use the normal distribution as an approximation to the binomial. 

What Is the Probability That There Will Be at Least 10 Females in the Group 
of 15? 

Again using the normal distribution to approximate the binomial, the probability 
of interest is now expressed as P(x s 9.5). (The continuity correction has 
replaced the discrete value of 10 with the interval from 9.5 to 10.5.) The z-score 
associated with x = 9.5 is 



jx _ 9.5 — 9.0 



1.897 



0.26 



Referring to the standard normal table, the area from the mean to z = 0.26 is 
0.1026. Since the total area to the right of the mean is 0.5000, the area to the right 
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FIGURE 7.11 



Normal approximation to a 
binomial distribution with 
n = 15 and it = 0.60. The 
continuity correction repre- 
sents x = 1 1 as the interval 
from 10.5 to 11.5, then the 
standard normal table is 
used in obtaining the 
approximate value of 
0.1214forP(x= 11). 
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Using the normal approximation with /j, = 9.0 and a= 1.897, 
P(x = 11) is estimated as P(10.5 <*< 11.5), orP(0.79<z< 1.32), 
Shaded area = 

0.4066 (area from z = 0.00 to z = 1 .32) 
-0.2852 (area from z = 0.00 to z = 0.79) 

= 0.1214 
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of z = 0.26 is (0.5000 - 0.1026) = P(x > 9.5) = 0.3974. The normal approxi- 
mation is again very close to the actual probability (0.4032) obtained from the 
binomial tables. 



The normal approximation to the binomial distribution is quite close when- 
ever both mr and n(l — tt) are s 5. However, the approximation becomes even 
better as the value of n increases and whenever it is closer to 0.5. Also, when we 
are determining probabilities for x being within or beyond a given interval, the 
correction for continuity becomes less important when x can take on a great many 
different values. For example, if n = 1000 and tt = 0.3, adding and subtracting 
0.5 from each x value will make virtually no difference in the results. Refer to 
Seeing Statistics Applet 6, at the end of the chapter, for a visual demonstration of 
the normal approximation to the binomial distribution. 



exercisEs 




7.39 What is the correction for continuity and why is it 
employed when the normal distribution is used in approx- 
imating the binomial distribution? 

7.40 Under what circumstances is it permissible to use 
the normal distribution in approximating the binomial 
distribution? 



7.41 In a certain binomial distribution, tt = 0.25 and 
n = 40. In using the normal approximation, 

a. What are the mean and standard deviation of the cor- 
responding normal distribution? 

b. If x = the number of "successes" among the 40 
observations, determine the following: P(x = 8), 
P(12 < x < 16), P(10 < x < 12), P(x > 14). 
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7.42 In a certain binomial distribution, tt = 0.20 and 
n = 30. In using the normal approximation, 

a. What are the mean and standard deviation of the 
corresponding normal distribution? 

b. If x = the number of "successes" among the 30 
observations, determine the following: P(x = 5), 
P{4 < x < 7), P(l < jc < 5), P(x > 7). 

7.43 Approximately 60% of Americans have some form 
of life insurance coverage. For a randomly selected sample 
of « = 20 individuals, and the discrete random variable 

x = the number of persons in the sample who are covered 
by life insurance: source: American Council of Life Insurance, 7994 
Life Insurance Fact Book, p. 35. 

a. Calculate the mean and the standard deviation of the 
binomial distribution. 

b. Using the binomial distribution, calculate the prob- 
ability that x will be exactly 12 persons. 

c. Repeat part (b), using the normal approximation to the 
binomial. (Hint: The area of interest under the normal 
curve will be between x = 11.5 and x = 12.5.) 

d. Using the normal approximation to the binomial, 
find the probability that at least 10 persons in 
the sample will be covered by some form of life 
insurance. 

7.44 About 30% of new single-family homes completed in 
the United States during 2004 were heated with electricity. 
For a randomly selected sample of 25 new single-family 
homes completed during that year and the discrete random 
variable, x = the number of homes in this group heated 
with electricity: source: Statistical Abstract of the United States 
2006, p. 620. 

a. Calculate the mean and standard deviation of the 
binomial distribution. 

b. Using the binomial distribution tables, determine the 
probability that x will be exactly 8 homes heated 
with electricity. 



c. Using the binomial distribution tables, determine the 
probability that x will be at least 6 but no more than 
9 homes. 

d. Repeat part (b), using the normal approximation to 
the binomial distribution. 

e. Repeat part (c), using the normal approximation to 
the binomial distribution. 

7.45 Of all individual tax returns filed in the United States 
during the 1999 tax filing season, 13.7% were prepared 
by H&R Block. For a randomly selected sample of 1000 
tax returns filed during this period, use the normal 
approximation to the binomial distribution in determining 
the probability that between 110 and 140 of these returns 
were prepared by H&R Block, source: h&r Block, inc., Annual 
Report 1999, p. 3. 

7.46 The Electronic Industries Association reports that 
about 54% of U.S. households have a camcorder. For a 
randomly selected sample of 800 U.S. households, use 
the normal approximation to the binomial distribution 
in determining the probability that at least 420 of these 
households have a camcorder, source: New York Times 
Almanac 2006, p. 401. 

7.47 Repeat Exercise 7.45, but without using the 
correction for continuity (i.e., the discrete random 
variable, x, is not replaced by the interval x — 0.5 to 

x + 0.5). Does your answer differ appreciably from that 
obtained in Exercise 7.45? If not, comment on the need 
for using the correction for continuity when x can take 
on a large number of possible values. 

7.48 Repeat Exercise 7.46, but without using the correc- 
tion for continuity (i.e., the discrete random variable, x, is 
not replaced by the interval x — 0.5 to x + 0.5). Does 
your answer differ appreciably from that obtained in 
Exercise 7.46? If not, comment on the need for using the 
correction for continuity when x can take on a large 
number of possible values. 



THE EXPONENTIAL DISTRIBUTION 



Description and Applications 

Chapter 6 discussed the Poisson distribution, in which the discrete random 
variable was the number of rare events occurring during a given interval of time, 
space, or distance. For a Poisson process, a distribution called the exponential 
distribution describes the continuous random variable, x = the amount of time, 
space, or distance between occurrences of these rare events. For example, in the 
context of arrivals at a service counter, x will be a continuous random variable 
describing the time between successive arrivals. 

As with the Poisson distribution, the exponential distribution has applica- 
tion to the management science topic of queuing, or waiting-line theory. Like the 
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Poisson distribution, it also assumes that the arrivals are independent from one 
another. The distribution can be summarized as follows: 

The exponential distribution: 

• For x = the length of the interval between occurrences: 

f(x) = Xe~ Xx for x > 0 and A > 0 where A = the mean and variance 

of a Poisson distribution 

1/A = the mean and standard 

deviation of the corresponding 
exponential distribution 

e = the mathematical constant, 
2.71828, the base of the nat- 
ural logarithm system 

• Areas beneath the curve: 

P(x > k) = e~ A ^ where k = the time, space, or distance until the next 

occurrence 

Like the Poisson distribution, the exponential distribution is really a family of 
distributions, and the particular member of the family is determined by just one 
descriptor: its mean. Actually, the mean of the exponential distribution is the in- 
verse of that of the Poisson. For example, if the average number of arrivals at a 
tollbooth is A = 5 persons per minute, the mean of the corresponding exponential 
distribution will be 1/A = 1/5, or 0.20 minutes between persons. Note that the 
Poisson random variable is discrete (number of persons), whereas the random 
variable for the exponential distribution is continuous, in this case, time. 

It is important that we be consistent in our use of physical-measurement units 
when we invert A to arrive at the mean of the corresponding exponential distribu- 
^™ tion. For example, if arrivals tend to occur at the rate of 15 vehicles per hour, we 
can express this as 15 arrivals divided by 60 minutes, or 0.25 arrivals per minute. 
If we then consider A = 0.25 arrivals per minute as the Poisson mean, the mean of 
the corresponding exponential distribution must also be based on minutes, and 
1/A will be 1/0.25, or 4 minutes between arrivals. 



Example 



Exponential Distribution 

Calls to the "911" emergency number of a large community have been found to 
be Poisson distributed with an average of 10 calls per hour. 

SOLUTIONS 

Determine the Mean and Standard Deviation of the Corresponding 
Exponential Distribution 

The mean of the Poisson distribution is A = 10 calls per hour, and the mean of the 
corresponding exponential distribution is the inverse of this, 1/A = 1/10, or 0.10 
hours between calls. The standard deviation is numerically equal to the mean. 
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FIGURE 7.12 




An exponential distribution 
in which the mean time 
between occurrences is 
6 minutes. The probability 
is 0.4347 that at least 
5 minutes will elapse before 
the next call to the "91 1 " 
switchboard. 



I I I I I I I I I I I I I I I I I L_ 

0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 

x = Minutes between calls to "9 11 " 



For greater clarity in answering the questions that follow, we will express the 
means of the Poisson and exponential distributions using minutes instead of 
hours. Accordingly, the mean of the Poisson distribution is 10 calls per 60 min- 
utes, or A = 1/6 calls per minute. The mean of the corresponding exponential 
distribution is 1/A = 6 minutes between calls. 

What Is the Probability That the Next Call Will Occur at Least 5 Minutes from Now? 

Referring to the continuous probability distribution of Figure 7.12, this probabil- 
ity corresponds to the area to the right of k = 5 minutes on the x axis. Applying 
the formula for determining areas beneath the curve: 

P(x & k) = e~ xk where k = 5 minutes 

or 

P{x > 5) = e-WVW = 2.71828" 0 - 833 or 0.4347 

There is a 0.4347 chance that at least 5 minutes will elapse before the next call 
is received. Alternatively, the probability that the next call will occur within the 
next 5 minutes is (1.000 — 0.4347), or 0.5653. This is because (1) the total area 
beneath the curve is 1.0, and (2) the exponential distribution is continuous and 
the probability is zero that a continuous random variable will take on any exact 
value [i.e., P(x = 5) is 0.0]. 

What Is the Probability That the Next Call Will Occur Between 3 and 
8 Minutes from Now? 

The area beneath the curve in the interval 3 ^ x ^ 8 can be determined by sub- 
tracting the area representing x ^ 8 from the area representing i>3. This is 
shown in Figure 7.13 (page 230) and is calculated as 

P(3 < x < 8) = P(x > 3) - P(x > 8) = e -<V6)(3) _ e -(l/6)(8) 

= 2.71 828 - 0 - 500 - 2.71 828" 1 - 333 or 0.6065 - 0.2637 = 0.3428 



Excel and Minitab can be used to obtain cumulative exponential probabilities 
associated with specified values for x — the procedures and results are shown in 
Computer Solutions 7.3. These software packages can also be used in doing the 
reverse — that is, specifying a cumulative probability, then finding the x value asso- 
ciated with it. These procedures and results are shown in Computer Solutions 7.4. 
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FIGURE 7.13 



For the exponential distribu- 
tion in which the mean 
time between calls is 
6 minutes, the probability is 
0.3428 that between 3 and 
8 minutes will elapse before 
the next call to the "91 1 " 
switchboard. 
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x = Minutes between calls to "91 1 " 




Exponential Probabilities 

These procedures show how to get cumulative exponential probabilities. 
For each software package: 



Procedure I: Getting the cumulative probability associated with one x value. 
Procedure U: Getting cumulative probabilities associated with a set of x values. 

EXCEL 





A 


B 


C 


1 


Exponential Distribution 


2 


with mean = 6.0. 




3 


X 


Area to L. 




4 


7.0 


0.6886 




5 


6.0 


0.6321 




6 


5.0 


0.5654 





I. Excel procedure for the cumulative probability associated with one x value 

1. Click on the cell into which Excel will place the probability. Click f x . Click Statistical in the Category box. Click 
EXPONDIST in the Function box. Click OK. 

2. Enter the x value of interest (e.g., 5) into the X box. Enter the inverse of the exponential mean (e.g., if the mean is 6 
minutes between calls, enter 1 /6, or 0.16667) into the Lambda box. Enter True into the Cumulative box. Click OK. 

II. Excel procedure for cumulative probabilities associated with a set of x values 

1 . Enter the labels into columns A3:B3 as shown in the printout above. Enter the x values of interest (e.g., 7, 6, 5) start- 
ing with cell A4. Click on cell B4. 

2. Click f x . Click Statistical in the Category box. Click EXPONDIST in the Function box. Click OK. 
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3. Enter A4 into the X box. Enter the inverse of the exponential mean (e.g., 1/6, or 0.16667) into the Lambda box. Enter 
True into the Cumulative box. Click OK. The first cumulative probability will appear in cell B4. 

4. Click at the lower right corner of B4 (the cursor will turn to a " + " sign) and drag downward so cells B4:B6 will con- 
tain the set of cumulative probabilities. 



MINI TAB 

Cumulative Distribution Function 

Exponential with mean = 6 
x P( X <= x ) 
7 0.688597 
6 0.632121 
5 0.565402 

I. Minitab procedure for cumulative probability associated with one x value 

1 . Click Calc. Select Probability Distributions. Click Exponential. 

2. In the Exponential Distribution menu, enter the mean of the distribution (e.g., 6) into the Mean or Scale box. Se- 
lect Input constant and enter the x value of interest (e.g., 5) into the box. Select Cumulative probability. Click OK. 

II. Minitab procedure for cumulative probabilities associated with a set of x values 

Enter the desired x values (e.g., 7, 6, 5) into column C1 . Follow steps 1 and 2 above, except select Input column and 
enter C1 into the box. 





computer solutions 7.4 












34 , 





Inverse Exponential Probabilities 



For Minitab only. Excel does not currently offer inverse exponential probabilities. 



MINITAB 

Inverse Cumulative Distribution Function 

Exponential with mean = 6 
P( X <= x ) x 

0.975 22.1333 

0.500 4.1589 

0.250 1.7261 

I. Minitab procedure for the x value associated with one cumulative probability 

1 . Click Calc. Select Probability Distributions. Click Exponential. 

2. In the Exponential Distribution menu, enter the mean of the distribution (e.g., 6) into the Mean or Scale box. 
Select Input constant and enter the cumulative probability of interest (e.g., 0.8000) into the box. Select Inverse 
cumulative probability. Click OK. 

II. Minitab procedure for the x values associated with a set of cumulative probabilities 

Enter the cumulative probabilities of interest (e.g., 0.9750, 0.5000, 0.2500) into column C1. Follow steps 1 and 2 
above, except select Input column and enter C1 into the box. 
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7.49 What is the relationship between the Poisson distri- 
bution and the exponential distribution? 

7.50 Every day, drivers arrive at a tollbooth. If the 
Poisson distribution were applied to this process, what 
would be an appropriate random variable? What would 
be the exponential-distribution counterpart to this 
random variable? 

7.51 The main switchboard at the Home Shopping Net- 
work receives calls from customers. If the Poisson distri- 
bution were applied to this process, what would be an 
appropriate random variable? What would be the expo- 
nential-distribution counterpart to this random variable? 

7.52 A random variable is Poisson distributed with A = 1.5 
occurrences per hour. For the corresponding exponential 
distribution, and x = hours until the next occurrence, 
identify the mean of x and determine the following: 

a. P(x > 0.5) b. P(x > 1.0) 
c. P(x > 1.5) d. P(x > 2.0) 

7.53 A random variable is Poisson distributed with 

A = 0.02 occurrences per minute. For the corresponding 
exponential distribution, and x = minutes until the next 
occurrence, identify the mean of x and determine the 
following: 

a. P(x > 30.0) b. P(x > 40.0) 
c. P(x > 50.0) d. P(x > 60.0) 

7.54 A random variable is Poisson distributed with 

A = 0.50 arrivals per minute. For the corresponding expo- 
nential distribution, and x = minutes until the next arrival, 
identify the mean of x and determine the following: 
a. P(x < 0.5) b. P(x < 1.5) 
c. P(x > 2.5) d. P(x > 3.0) 

7.55 The owner of a self-service carwash has found that 
customers take an average of 8 minutes to wash and dry 
their cars. Assuming that the self-service times tend to be 
exponentially distributed, what is the probability that a 
customer will require more than 10 minutes to complete 
the job? 

7.56 A taxi dispatcher has found that successive calls for 
taxi service are exponentially distributed, with a mean time 



between calls of 5.30 minutes. The dispatcher must discon- 
nect the telephone system for 3 minutes in order to have the 
push-button mechanism repaired. What is the probability 
that a call will be received while the system is out of service? 

7.57 During 2004, U.S. general aviation pilots had 1.20 
fatal crashes per 100,000 flying hours. Harriet Arnold is 
president of Arnold's Flying Service, a company that oper- 
ates a total of 50 sightseeing planes based in 20 regions of 
the United States. Altogether, the planes in this fleet are in 
the air about 40,000 hours per year. Ever since a fatal 
crash last year involving one of the company's planes that 
took off under questionable weather conditions, the 
company has been getting close scrutiny from both 
government safety officials and the national media. 
Harriet believes her planes are just as safe as anyone else's, 
but is concerned that even one more fatal accident within 
the coming year might be too much for the now-struggling 
company to overcome. Assuming an exponential distribu- 
tion for x = thousands of flying hours between fatal 
crashes, what is the probability that Arnold's Flying 
Service will not experience a fatal crash until at least a 
year from today? Until at least 2 years from today? What 
number of flying hours would be associated with a 90% 
probability of experiencing no crashes within that number 
of flying hours from today? source: General Aviation Manufactur- 
ers Association, General Aviation Statistical Databook, 2005 Edition, p. 39. 

7.58 The American Iron and Steel Institute reports the fa- 
tality rate for workers in this industry as 0.007 fatalities 
per 200,000 worker-hours of production. During 1998, the 
aggregate number of worker-hours spent on producing iron 
and steel products amounted to 243 million worker-hours. 
Assuming that (1) an exponential distribution applies for 
the number of worker-hours between fatalities in this indus- 
try, and (2) the 243 million worker-hours level of activity 
continues in the industry, what is the expected number of 
worker-hours until the next fatality occurs in this industry? 
If a given company in this industry has 1,000,000 worker- 
hours devoted to production each year, what is the proba- 
bility that the next fatal injury in this company will not 
occur until at least 30 years from now? source: American Iron 
and Steel Institute, Annual Statistical Report 1998, pp. 16-17. 



7.6 SIMULATING OBSERVATIONS FROM A CONTINUOUS 

PROBABILITY DISTRIBUTION 



Although practically all statistical packages can provide probabilities like those in 
the Computer Solutions earlier in this chapter, some can also randomly select 
observations from the continuous distribution itself. For example, Computer 
Solutions 7.5 lists the procedures and shows the results when Excel and Minitab 
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computer solutions 7. 
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Simulating Observations from a Continuous Probability Distribution 

These procedures simulate 10,000 observations from a normal distribution with mean = 35 and standard 
deviation = 9. The procedures for simulating observations from an exponential distribution will differ only 
slightly from those described here. 

EXCEL 



16 



10,000 Simulated Observations 



1200 
900 
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Grams 



Mean 



Standard Error 

Median 

Mode 



Standard Deviation 
Sample Variance 



Kurtosis 



Skewness 



Range 



Minimum 



Maximum 



Sum 



Count 



34.965 



0.090 
34.987 
29.216 



8.994 
80.897 



-0.002 



63.870 



1.326 



65.196 



349645.570 



10000 



Start with a blank worksheet. Click Tools. Click Data Analysis. Click Random Number Generation. Click OK. 
Enter 1 into the Number of Variables box. Enter 10000 into the Number of Random Numbers box. Select 
Normal within the Distribution box. Enter 35 into the Mean box. Enter 9 into the Standard Deviation box. Select 
Output Range and enter A1 into the box. Click OK. The 1 0,000 simulated observations will be located in A1 :A1 0000. 
The Excel descriptive statistics and histogram above can be generated using the procedures in Computer Solutions 2.1 
and 3.1. 



MINITAB 
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(continued) 
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Descriptive Statistics: Grams 

Variable N N* Mean SE Mean StDev Minimum Ql Median Q3 

Crams 10000 0 34.995 0.091 9.078 2.055 28.835 35.020 41.079 

Variable Maximum 
Crams 70.164 

1 . Click Calc. Select Random Data. Click Normal. Enter 10000 into the Number of rows of data to generate box. 
Enter C1 (the destination column) into the Store in columns box. Enter 35 into the Mean box. Enter 9 into the Stan- 
dard deviation box. Click OK. The 10,000 simulated observations will be located in column C1. 

2. The Minitab descriptive statistics and histogram shown above can be generated using the procedures in Computer 
Solutions 2.1 and 3.1 . 

Note: Users of the Minitab Student Edition are limited to a worksheet size of 5000 data values, so the number of 
simulated observations must be no more than 5000 into an otherwise empty worksheet. 



are used in randomly selecting 10,000 observations from a normal distribution. 
Both sets of simulated observations are from a normal distribution examined ear- 
lier in the chapter — in this situation, the continuous random variable is x = the 
amount of weight added so that a generator shaft would have proper static and 
dynamic balance. In the probability distribution, the average amount of weight 
added is = 35 grams and the standard deviation is cr = 9 grams. 

In each of the printouts in Computer Solutions 7.5, the mean and standard de- 
viation of the simulated observations are very close to the actual mean and stan- 
dard deviation of the theoretical distribution from which the 10,000 observations 
were drawn. In addition to differing slightly from the theoretical values, the Excel 
and Minitab sample means and standard deviations also differ slightly from each 
other. This is because each package generated its own sample of 10,000 observa- 
tions, and each sample had a different set of observations from the other. 

According to the law of large numbers, histograms like those in Computer 
Solutions 7.5 will more closely approach the theoretical distribution from which 
they were drawn whenever the number of observations becomes larger. The topic 
of how samples can differ from each other, and from the population from which 
they were drawn, will be an important topic in the next two chapters. 



exercises 




ex 




7.59 A computer statistical package has simulated 1000 
random observations from a normal distribution with 

H = SO and cr = 10. Sketch the approximate box-and- 
whisker display for the resulting data. 

7.60 A computer statistical package has simulated 2000 
random observations from a normal distribution with 

fi = 80 and <j = 20. Sketch the approximate box-and- 
whisker display for the resulting data. 

7.61 If a computer statistical package were to simulate 
2000 random observations from a normal distribution with 
fi = 150 and cr = 25, what percentage of these 



observations would you expect to have a value of 140 or 
less? Do you think the actual number in the 140" range 
would equal the expected number in this range? If so, why? 
If not, why not? 

7.62 If a computer statistical package were to simulate 
1000 random observations from a normal distribution with 
/a = 200 and cr = 50, what percentage of these 
observations would you expect to have a value of 300 or 
more? Do you think the actual number in the "S 300" 
range would equal the expected number in this range? If 
so, why? If not, why not? 
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• Nature and applications of the normal distribution 

Continuous probability distributions describe probabilities associated with ran- 
dom variables that can take on any value along a given range or continuum and 
for which there are no gaps between these possible values. The curve f(x) is a 
function of the continuous random variable, and f(x) is called the probability den- 
sity function. For any specified interval of values, the probability that x will as- 
sume a value within the interval is the area beneath the curve between the two 
points describing the interval. The total area beneath the curve is 1.0, and the 
probability that x will take on any specific value is 0.0. 

The most important continuous distribution is the normal distribution. This 
is because many natural and economic phenomena tend to be approximately nor- 
mally distributed; it can be used to approximate other distributions, such as the 
binomial; and the distribution of sample means and proportions tends to be nor- 
mally distributed whenever repeated samples are taken from a given population 
of any shape. 

The normal distribution is a symmetrical, bell-shaped curve, and the mean, 
median, and mode are located at the same position. The curve is asymptotic in 
that each end approaches the horizontal (x) axis, but never reaches it. The normal 
distribution is actually a family of such distributions, and the specific member of 
the family is determined by the mean (/x) and standard deviation (cr). Regardless 
of the shape of a particular normal curve, about 68.3% of the area is within the 
interval fx ± cr, 95.5% within fx ± 2cr, and 99.7% within fx ± 3o\ 

• Standard normal distribution and z-scores 

When the original x values are expressed in terms of the number of standard 
deviations from the mean, the result is the standard (or standardized) normal dis- 
tribution in which the z value (or z-score) replaces x. The standard normal distri- 
bution has a mean of 0 and a standard deviation of 1, and areas beneath its curve 
are readily available from the standard normal table. 

• Normal distribution to approximate the binomial distribution 

Although the binomial distribution is discrete and the normal distribution is 
continuous, the normal distribution is a good approximation to the binomial 
whenever both mr and n(l — tt) are & 5. The normal approximation is useful 
whenever suitable binomial tables are not available, and makes it unnecessary to 
engage in the tedious computations that the binomial distribution would other- 
wise require. 

Because the binomial distribution has gaps between possible values of x, the 
normal approximation to the binomial involves a correction for continuity, which 
consists of "expanding" each possible value of x by 0.5 in each direction. This 
correction becomes less important whenever the number of trials is large and we 
are finding the probability that x lies within or beyond a given range of values. 

• Poisson distribution 

For a Poisson process, the exponential distribution describes the continuous ran- 
dom variable, x = the amount of time, space, or distance between occurrences of 
the rare events of interest. The exponential distribution is a family of distribu- 
tions, all of which are positively skewed, and the specific member is determined 
by the value of the mean. 
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• Computer-generated probabilities and simulations 

As with discrete probability distributions, computer statistical packages can 
determine the probabilities associated with possible values of x. In the case of 
continuous distributions, this includes finding the exact area beneath a specified 
portion of the curve, or probability density function, and eliminates the need for 
calculations or table references. Some packages, such as Excel and Minitab, can 
also simulate the random selection of observations from the distribution itself. 



The Normal Distribution 

1 MM1 „_ ..w_,a 



f(x) 



(tV2tt 



where /a = mean 

a = standard deviation 

e = the mathematical constant, 2.71828 

7r = the mathematical constant, 3.14159 



The z-Score for a Standard Normal Distribution 

_ X — /JL 



a 



where z = the distance from the mean, measured in standard 
deviation units 
x = the value of x in which we are interested 
fi = the mean of the distribution 
a = the standard deviation of the distribution 



The Normal Approximation to the Binomial Distribution 

• Mean: 

/JL = HIT 

• Standard deviation: 

cr = V«7r(l — it) where n = the number of trials 

77 = the probability of success 

on any given trial 
x = the number of successes in 

n trials 

(Note: Used when nir and «(1 — it) are both > 5.) 
The Exponential Distribution 

• For x = the length of the interval between occurrences: 



f{x) = \e~ Xx for x > 0 and A > 0 



where A = the mean and variance 
of a Poisson distribution 
1/A = the mean and standard 

deviation of the corresponding 
exponential distribution 
e = the mathematical constant, 2.71828, 
the base of the 
natural logarithm system 



Areas beneath the curve: 



P(x > k) = e~ xk 



where k = the time, space, or distance until the 
next occurrence 
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7.63 According to a recent survey, 35% of the adult 
citizens of Mayville have a charge account at Marcy's De- 
partment Store. Mr. Marcy's nephew is going to trial for 
embezzlement, and nine jurors are to be randomly selected 
from the adult citizenry. What is the probability that at 
least four of the jurors will be charge-account holders? 

7.64 A food expert claims that there is no difference be- 
tween the taste of two leading soft drinks. In a taste test 
involving 200 persons, however, 55% of the subjects say 
they like soft drink A better than B. If the food expert is 
correct, we would expect that 50% of the persons would 
have preferred soft drink A. Under the assumption that 
the food expert is correct, what would be the probability 
of a "taste-off" in which 110 or more of the subjects 
preferred soft drink A? 

7.65 In Exercise 7.64, the maker of soft drink A claims its 
product to be "superior" to soft drink B. Given the results 
obtained in Exercise 7.64, do they appear to have a solid 
basis for their claim? 

7.66 During an annual heating season, the average gas 
bill for customers in a New England community heating 
their homes with gas was $457. Assuming a normal dis- 
tribution and a standard deviation of $80: 

a. What proportion of homes heating with gas had a gas 
bill over $382? 

b. What proportion of homes heating with gas had a gas 
bill between $497 and $537? 

c. What amount was exceeded by only 2.5% of the 
homes heating with gas? 

d. What amount was exceeded by 95% of the homes 
heating with gas? 

7.67 The American Council of Life Insurance and the 
Life Insurance Marketing and Research Association have 
reported that insured households with heads 35 to 44 
years old had an average of $186,100 of life insurance 
coverage. Assuming a normal distribution and a standard 
deviation of $40,000, what is the probability that a ran- 
domly selected household with a head in this age group 
had less than $130,000 in life insurance coverage? source: 
7994 Life Insurance Fact Book, p. 36. 

7.68 During 2004, the average number of flying hours for 
aircraft operated by regional airlines in the United States 
was 2389. Assume a normal distribution and a standard 
deviation of 300 hours, source: Statistical Abstract of the United 
States 2006, p. 701. 

a. What proportion of aircraft flew more than 2200 
hours during the year? 

b. What proportion of aircraft flew between 2000 and 
2400 hours during the year? 



c. What number of annual flying hours was exceeded by 
only 15% of the aircraft? 

d. What number of annual flying hours was exceeded by 
75% of the aircraft? 

7.69 During fiscal 2002, the average daily volume for 
FedEx Corporation was 3,077,000 packages per day. 
Assuming a normal distribution and a standard deviation 
of 400,000 packages per day, on what proportion of the 
days was the volume between 3,000,000 and 3,200,000 
packages? source: FedEx Corporation, 2002 Annual Report, p. 56. 

7.70 For itemized tax returns in the $60,000-$75,000 
income group for the most recent year reported, the aver- 
age charitable contribution was $1935. Assume a normal 
distribution and a standard deviation of $400. source: 
Statistical Abstract of the United States 2002, p. 360. 

a. For a randomly selected household from this income 
group, what is the probability that the household 
made charitable contributions of at least $1600? 

b. For a randomly selected household from this income 
group, what is the probability that the household made 
charitable contributions of between $2200 and $2400? 

c. What level of contributions was exceeded by only 
20% of the itemized returns in this income group? 

d. What level of contributions was exceeded by 40% of 
the itemized returns in this income group? 

7.71 Of the 1.0 million persons of voting age in Maine, 
72% voted in the 2004 presidential election. For a 
randomly selected group of 30 Maine residents who were 
of voting age at that time, what is the probability that at 
least 20 of these persons voted in the presidential 
election? source: Statistical Abstract of the United States 2006, p. 264. 

7.72 A public relations agency tells its client that 80% of 
the residents in a 50-mile radius believe the company is 
"an industry leader." Skeptical, the company commissions 
a survey in which just 320 of the 500 persons interviewed 
felt the company was an industry leader. If the public rela- 
tions agency's claim is correct, what is the probability that 
interviews with 500 people would yield 320 or fewer who 
felt the company was an industry leader? 

7.73 Given the results of Exercise 7.72, evaluate the 
claim made by the public relations agency. 

7.74 A researcher is studying a particular process and 
has identified the following times, in minutes, between 
consecutive occurrences: 1.8, 4.3, 2.3, 2.1, 5.6, 1.5, 
4.5, 1.8, 2.4, 2.6, 3.2, 3.5, 1.6, 2.1, and 1.9. Given this 
information, what would be the researcher's best 
estimate for the mean of the exponential distribution? 
What would be her best estimate for the mean of the 
corresponding Poisson distribution? 
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7.75 Based on data covering two consecutive school 
years, the National Safety Council found that fourth to 
sixth graders engaging in unorganized activities on school 
grounds experienced injuries at the rate of 2.4 injuries per 
100 thousand student-days. On the average, how many 
student-days elapse between injuries to fourth to sixth 
graders in unorganized activities at school, and what is 
the probability that the next such injury will occur before 
45 thousand student-days have passed? source: National 
Safety Council, Accident Facts 1987 Edition, p. 100. 

7.76 The number of defects in rolls of aluminum sheet 
average 3.5 defects per 100 feet. The company president is 
accompanying overseas visitors on a tour of the plant, and 
they have just arrived at the aluminum-sheet machine. If 
they remain at the machine long enough to observe 32 feet 
of aluminum being produced, what is the probability that 
they will have left the area before the next defect occurs? 

7.77 The mileage death rate for motorcycle riders has been 
estimated to be about 39 deaths per 100 million miles of 
motorcycle travel. A national motorcycling association has 
1200 members who travel a total of 2 million miles each 
year on their motorcycles. What is the probability that the 
next highway fatality experienced by the association will 
not occur until more than a year from now? Until more 
than 2 years from now? source: USAToday.com, July 30, 2006. 

7.78 The "20-ounce" package of mozzarella cheese dis- 
tributed by a food company actually has a weight that is 
normally distributed, with /jl = 21 ounces and a = 0.2 
ounces. 

a. What is the probability that a randomly selected 
package of this cheese will actually contain at least 
20.5 ounces? 

b. What is the probability that a randomly selected 
package of this cheese will actually contain between 
20.5 and 21.3 ounces? 

c. A shopper randomly selects 8 of these packages. 
What is the probability that 3 or more of them will 
contain at least 21.2 ounces of cheese? 

7.79 State Police Academy graduates from a large gradu- 
ating class have scored an average of 81.0 on a 
proficiency test, and the scores have been normally 
distributed, with a = 8.5. Graduates are randomly paired 
and assigned to barracks throughout the state. Madan 
has scored 83.0 on the test. What is the probability that 
he will be paired with someone whose score was within 
5.0 points of his own? 

7.80 The trainer for a professional football team has 
found that, on the average, his team experiences 2.3 
sprained ankles per 1000 game minutes. If a game lasts 60 
minutes, and the number of game minutes between ankle 
sprains is exponentially distributed, determine the proba- 
bility that the next ankle sprain will occur 

a. during the team's next game. 

b. sometime during the team's next ten games. 



c. at the moment the 2-minute warning is sounded in 
the next game. 

7.81 The precooked weight of hamburgers at a gourmet 
hamburger restaurant has been normally distributed with 
a mean of 5.5 ounces and a standard deviation of 0.15 
ounces. A dining-column journalist from the local news- 
paper enters the restaurant and orders a hamburger. 
What is the probability that he will receive a hamburger 
that had a precooked weight less than 5.3 ounces? If, in a 
separate visit to the restaurant, the journalist is accompa- 
nied by three colleagues who also order a hamburger, 
what is the probability that at least 2 of these 4 customers 
will receive a hamburger that had a precooked weight 
greater than 5.7 ounces? 

7.82 Boxes are filled with sugar by a machine that is 
set to deliver an average of 20.3 ounces. The weights 
are normally distributed with a standard deviation of 
0.3 ounces. If a consumer advocate buys a package of the 
product and then proceeds to weigh the contents, what is 
the probability that the package she has purchased will 
have a content weight between 20 and 21 ounces? If the 
consumer advocate plans to purchase 100 boxes of the 
product, then file suit against the company if more than 

5 of the boxes have content weights below the 20-ounce 
stated weight on the package, what is the probability that 
she will end up filing suit against the company? 

7.83 In Exercise 7.82, the company would like to put in 
a greater weight "cushion" to help protect itself from 
consumer advocates. If the company wants to have just 
2% of the packages contain less than 20 ounces, to what 
average weight must the filling machine be set? 

7.84 Discount Micros, a computer mail order company, 
has received 2000 desktop computers from a major man- 
ufacturer. Unknown to the mail order company, 250 of 
the computers have a hard disk that was improperly 
installed at the factory. A consulting firm purchases 40 
computers from Discount Micros. If the order is filled by 
random selection from the 2000 computers just received, 
what is the probability that the shipment to the consult- 
ing firm will include no more than 2 defective computers? 
At least 1, but no more than 3 defective computers? 

7.85 The U-Drive car rental corporation has found that 
the cars in the company's rental fleet experience 
punctures at the rate of 1.25 punctures per 10,000 miles, 
and the number of punctures per distance traveled is Pois- 
son distributed. Ed and Harriet are taking advantage of a 
U-Drive promotion in which there is no mileage charge. 
During their vacation, they plan to drive from Boston to 
Los Angeles and back, a total distance of 6164 miles. 

a. What is the probability that Ed and Harriet will not 
have to change any tires during their vacation? 

b. While traveling west, what is the probability that Ed 
and Harriet will experience a punctured tire before 
they make it to Denver, 2016 miles away? 
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c. For what mileage is the probability 0.80 that Ed and 
Harriet will experience a puncture before they reach 
this distance? 

7.86 Bonnie Rogers, the customer relations manager for a 
large computer manufacturer, has recently studied the du- 
ration of telephone calls received by the company's techni- 
cal support department. The call lengths are exponentially 
distributed, with a mean duration of 8 minutes. Experience 
has taught Bonnie that lengthy calls usually can be more 
efficiently handled by a technical support manager, rather 
than by the less experienced personnel who receive the 
initial technical inquiry. To avoid having the higher-paid 
managers virtually take over the task of handling technical 
support, Bonnie would like to have only the longest 10% 
of the calls redirected to a technical support manager. How 
long must a technical support call last before it qualifies for 
redirection to a technical support manager? 



7.87 A storage warehouse in a relatively remote part of a 
county is protected by a homemade burglary detection 
system. Once actuated by an intruder, the system sounds 
a horn and flashes a light for 15 minutes, then saves bat- 
tery power by shutting down and resetting itself until the 
next intrusion is detected. Police in three patrol cars have 
this part of the county as one of their responsibilities, and 
they routinely drive by the warehouse facility during the 
hours from dark to dawn. On average, a patrol car passes 
the warehouse every 20 minutes, and the time between 
patrol car arrivals is exponentially distributed. It's the 
middle of the night and a burglar has just broken into the 
storage warehouse and set off the alarm. What is the 
probability that the alarm will shut off before the next 
police patrol car drives by? 



integrated cases 




Thorndike Sports Equipment 

(Corresponds to Thorndike Video Unit Three) 



Since Thorndike "Graph-Pro" racquetball racquets were 
introduced several years ago, no attempt has ever been 
made to offer a lightweight version. Ted has done some 
research and found that some of the other manufacturers 
offer a lighter version of their standard racquet. In addi- 
tion, some of the racquetball players he has talked to 
have said a lighter racquet would probably improve their 
game. 

Talking with production engineers, Ted learns that the 
racquets coming off the line don't all weigh the same any- 
way. According to the chief engineer, the racquets weigh an 
average of 240 grams, but the weights "vary all over the 
lot," and he takes advantage of the situation to press for a 
high-tech $700,000 machine that would produce racquets of 
equal quality while holding this variation within narrower 
limits. 

Old Luke isn't too keen on buying a $700,000 machine, 
especially since Ted might have a better solution. Ted has 
dusted off the literature that accompanied the racquet- 
production machine and found that the machine generates 
output that is approximately normally distributed, with a 
standard deviation of 10 grams. This is a little sloppy these 
days, but maybe Thorndike Sports Equipment can turn this 
weakness into a strength. 



Conversing further with Luke, talking to three sporting- 
goods retailers, and interviewing players at five different 
racquetball facilities, Ted comes back to the office with a 
terrific idea. Asking Luke to sit down, and bringing him 
some coffee, Ted begins his pitch: "Say, grandfather, why 
don't we take the racquets at the lighter end of the range 
and label them 'Graph-Pro Light,' with the ones more in 
the middle being 'Graph-Pro Regular,' and the ones at the 
heavier end can be 'Graph-Pro Stout'?" 

Luke responds that just because this works for the beer 
companies, this doesn't mean it will work for Thorndike 
Sports Equipment. However, he's willing to give it a try. 
After accompanying Ted on another round of conversations 
with retailers and players, the elder Thorndike tells Ted that 
for now, he'd like to see 15% of the production consist of 
"Graph-Pro Light," with 80% "Graph-Pro Regular," and 
5% "Graph-Pro Stout." 

Ted's next task is to tell the production people how to 
go about selecting which racquets are to be put into these 
three different categories. He's also trying to follow his 
grandfather's directive to come up with a better name for 
the "Stout" model, but he figures he can do that after 
lunch. 



■'0 . . . V 



in ' mi 



eeing statistics: applet 4 





Size and Shape of Normal Distribution 



This applet has two normal curves, a blue one and a red 
one. The blue one stays fixed with a mean of 0 and a stan- 
dard deviation of 1 . Using the sliders at the bottom, we can 
change the mean and standard deviation of the red curve 




and observe how these changes affect its position and its 
shape. Moving the top slider to the left or right will decrease 
or increase the mean of the red normal curve, and moving 
the bottom slider to the left or right will decrease or increase 
its standard deviation. 

Applet Exercises 

4.1 Position the top slider at the far left, then gradually 
move it to the far right. What effect does this have on the 
mean of the distribution? 

4.2 Using the bottom slider, change the standard deviation 
so that it is greater than 1 . How does the shape of the red 
curve compare to that of the blue curve? 

4.3 Using the two sliders, position the red curve so that its 
mean is as small as possible and its shape is as narrow as 
possible. Compare the numerical values of the mean and 
standard deviation to those of the blue curve (mean = 0, 
standard deviation = 1). 



Thorndike Golf Products Division 



During an 8-hour shift of golf-ball production, one golf ball is 
randomly selected from each 2 minutes' worth of output. The 
ball is then tested for "liveliness" by rolling it down a grooved, 
stainless-steel surface. At the bottom, it strikes a smooth iron 
block and bounces backward, up a gradually sloped, grooved 
surface inscribed with distance markings. The higher the ball 
gets on the rebound surface, the more lively it is. 

The production machine is set so that the average 
rebound distance will be /x = 30.00 inches, and 
the standard deviation is cr = 2.00 inches. The 
scores are normally distributed. 

1. A driving range has just purchased 100 golf balls. Use 
the computer and the binomial distribution in deter- 
mining the individual and cumulative probabilities 
for x = the number of balls that scored at least 31.00 
inches on the "bounce" test. 

2. Repeat part (1), but use the normal approximation to 
the binomial distribution. Do the respective probabili- 
ties appear to be very similar? 

3. Given the distribution of bounce test scores described 
above, what score value should be exceeded only 5% 



of the time? For what score value should only 5% of 
the balls do more poorly? 

4. What is the probability that, for three consecutive 
balls, all three will happen to score below the lower of 
the two values determined in part (3)? 

Two hundred forty golf balls have been subjected 
to the bounce test during the most recent 8-hour 
shift. From the 1st through the 240th, their scores 
are provided in computer data file CDB07 as rep- 
resenting the continuous random variable, 
BOUNCE. The variable BALL is a sequence from 
1 to 240. 

5. Using the computer, generate a line graph in which 
BOUNCE is on the vertical axis and BALL is on the 
horizontal axis. 

6. On the graph obtained in part (5), draw two horizontal 
lines — one at each of the BOUNCE scores determined 
in part (3). 

7. Examining the graph and the horizontal lines drawn in 
part (6), does it appear that about 90% of the balls 
have BOUNCE scores between the two horizontal 
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Normal Distribution Areas 

This applet allows us to conveniently find the area corre- 
sponding to a selected interval beneath a normal curve that 
has been specified. We can specify the mean by typing it 
into the box at the lower left (be sure to press the Return or 
Enter key while the cursor is still in the text box). Likewise, 
we can specify the standard deviation of the normal curve 
by using the text box at the lower right. 

With the distribution described by its mean and stan- 
dard deviation, we can click on the shaded portion of the 
curve and change the start and end points for the interval. 
The resulting z values and the probability corresponding to 
the shaded area are shown at the top of the applet. 

Applet Exercises 

5.1 Using the text boxes, ensure that the mean is 130 and 
the standard deviation is 30, as in the distribution of flying 
hours in Figure 7.5. (Don't forget to hit the Enter or Return 
key while the cursor is still in the text box.) Next, ensure that 
the left and right boundaries of the shaded area are 70 and 
190, respectively; if necessary, click at the left and right 
edges of the shaded area and drag them. Observing the 
display at the top, what is the probability that a randomly 
selected plane will have flown between 70 and 190 hours 
during the year? What values of z correspond to 70 hours 
and 190 hours, respectively? 

5.2 With the mean and standard deviation set at 130 and 
30, respectively, drag the edges of the shaded area so that 
its left boundary is at 130 and its right boundary is at 190. 




Raw: P[70.0 <= X <= 190.0] = 0.95 
Z P[-2.0 <=Z<= 2.0| = 0.95 




Mean 



What is the probability that a randomly selected plane will 
have flown between 130 and 190 hours during the year? 
What values of z correspond to 1 30 hours and 1 90 hours, 
respectively? 

5.3 By dragging the left and right edges of the shaded area, 
determine the probability that a randomly selected plane 
will have flown between 140 and 170 hours during the 
year. What values of z correspond to 140 hours and 170 
hours, respectively? 

5.4 Drag the left and right edges of the shaded area so that 
the left boundary corresponds to z = 0 and the right 
boundary corresponds to z = +1.0. What probability is 
now associated with the shaded area? 



lines you've drawn, as the normal distribution would 
suggest when /x = 30.00 and a = 2.00? 
8. Comparing the BOUNCE scores when BALL = 1 
through 200 to those when BALL = 201 through 240, 
does it appear that the process may have changed in 
some way toward the end of the work shift? In what 
way? Does it appear that the machine might be in need 
of repair or adjustment? If so, in what way should the 



adjustment alter the process as it appeared to exist at 
the end of the work shift? 

NOTE: Items 5 through 8 contain elements of the topic 
known as statistical process control. It is an important 
technique that will be examined more completely in 
Chapter 20. 



Normal Approximation to Binomial Distribution 



This applet demonstrates the normal approximation to the 
binomial distribution. We specify the desired binomial distri- 
bution by entering the sample size (n) and the population 
proportion {ir, shown in the applet as pi). We enter the k 
value of interest into the text box at the lower left. Once 
again, when using the text boxes, remember to press Enter 
or Return while the cursor is still within the box. Once we 
have specified n, n, and k, we see two results: (1) the bino- 
mial probability P(x < k), shown in the Prob text box, and (2) 
the corresponding approximation using the normal distribu- 
tion. Please note that probabilities shown in the applet may 
sometimes differ very slightly from those we might calculate 
using the pocket calculator and printed tables. This is because 
the applet is using more exact values than we are able to 
show within printed tables like the binomial and normal 
tables in the text. 

Applet Exercises 

6.1 With n = 15 and ir = 0.6, as in the example in Sec- 
tion 7.4, what is the probability that there are no more than 
k = 9 females in the sample of 1 5 walkers? How does this 



compare with the corresponding probability using the nor- 
mal approximation to the binomial distribution? 

6.2 With n = 5 and tt = 0.6, what is the actual binomial 
probability that there are no more than k = 3 females in the 
sample of 5? How does this compare with the correspond- 
ing probability using the normal approximation? What ef- 
fect has the smaller sample size had on the closeness of the 
approximation? 

6.3 With n = 1 00 and ir = 0.6, what is the actual binomial 
probability that there are no more than k = 60 females in 
the sample of 1 00? How does this compare with the corre- 
sponding probability using the normal approximation? 
What effect has the larger sample size had on the closeness 
of the approximation? 

6.4 Repeat Applet Exercise 6.1 for k values of 6 through 12, 
in each case identifying the actual binomial probability that 
there will be no more than k females in the sample. For each 
value of k, also note the normal approximation result and its 
closeness to the actual binomial probability. 





ProcEss Control and the Sampling 
Distribution oFthe Mean 

When an industrial process is operating properly, it is said to be "in control." 
However, random variations from one sample to the next can lead to sample means 
that differ both from each other and from the assumed population mean, fi. 
Applying the concepts introduced in this chapter, we can state that if the process 
is in control, the means of samples of n > 30 should tend to be normally distributed, 
with ix- = /J, and a- = a/vn. 

As long as the process is in control, there is a 95.5% chance that a sample mean 
will be within z = ±2 standard errors of the population mean, a 99.73% chance that 
it will be within z = ±3 standard errors of the population mean, and so on. Using this 
approach, "control limits" can be established to monitor the production process. 

For example, if a filling machine is known to have a 

■ximity to 



process standard deviation of a = 0.1 ounces 
and is set to fill boxes with 1 2.0 ounces of 
product, the 95.5% ("two-sigma") control 
limits for the means of samples of n = 30 
would be 



i their pro: 



12.0 ± 2.0 



0.1 



'30 



12.0 ± 0.037 ounces 



or from 1 1 .963 to 1 2.037 ounces 

If sample means consistently tend to fall 
outside these limits, this suggests that the pop- 
ulation mean may have "drifted" from the 
desired 12.0 ounces and that a filling-quantity 
adjustment is in order. Such use of what is called 
an x control chart will be examined further in 
Chapter 20, Total Quality Management. 
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Part 3: Sampling Distributions and Estimation 



learning 
objectives 

After reading this 
chapter, you should 
be able to: 



Understand that the sample mean or the sample proportion can be considered 
a random variable. 

Understand and determine the sampling distribution of means for samples 
from a given population. 

Understand and determine the sampling distribution of proportions for 
samples from a given population. 

Explain the central limit theorem and its relevance to the shape of the 
sampling distribution of a mean or proportion. 

Determine the effect on the sampling distribution when the samples are rela- 
tively large compared to the population from which they are drawn. 



8.1 INTRODUCTION 



In the two preceding chapters, we discussed populations in terms of ( 1 ) whether they 
represented discrete versus continuous random variables and (2) their specific prob- 
ability distributions and the shape of these distributions. In both chapters, we em- 
phasized entire populations. More often, however, we must use sample data to gain 
insight into the characteristics of a population. Using information from a sample to 
reach conclusions about the population from which it was drawn is known as infer- 
ential statistics. The purpose of this chapter is to examine how sample means and 
proportions from a population will tend to be distributed, and to lay the ground- 
work for the inferential statistics applications in the chapters that follow. 

To underscore the pivotal importance of this chapter, let's specifically consider 
it in terms of the chapters that immediately precede and follow: 

• In Chapter 7, we had access to the population mean, and we made probabil- 
ity statements about individual x values taken from the population. 

• In this chapter, we again have access to the population mean, but we will be 
making probability statements about the means of samples taken from the 
population. (With this chapter, the sample mean itself is now considered as a 
random variable!) 

• In Chapter 9, we will lack access to the population mean, but we will begin 
using sample data as the basis from which to make probability statements 
about the true (but unknown) value of the population mean. 

In essence, this chapter is a turning point where we begin the transition from (a) 
knowing the population and making probability statements about elements taken 
from it, to (b) not knowing the population, but making probability statements 
about the population based on sample data taken from it. 



8.2 A PREVIEW OF SAMPLING DISTRIBUTIONS 



As suggested in Section 8.1, we're now going to escalate to a new level and con- 
sider the sample mean itself as a random variable. If we were to take a great many 
samples of size n from the same population, we would end up with a great many 
different values for the sample mean. The resulting collection of sample means 
could then be viewed as a new random variable with its own mean and standard 
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deviation. The probability distribution of these sample means is called the distri- 
bution of sample means, or the sampling distribution of the mean. 

For any specified population, the sampling distribution of the mean is defined 
as the probability distribution of the sample means for all possible samples of that 
particular size. Table 8.1 provides an introduction to the concept of the sampling 
distribution of the mean. Referring to parts A-C of Table 8.1: 

• Part A The population. Each of the four members of the population has a 
given number of bottles of Diet Pepsi in his or her refrigerator. Bill has 1, 
Carl has 1, Denise has 3, and Ed has 5. The probability distribution of the dis- 
crete random variable, x = number of bottles of Diet Pepsi in the refrigerator, 
is shown in this portion of Table 8.1, and /u = (1 + 1+ 3 + 5)/4, or /jl = 2.5 
bottles of Diet Pepsi. 

• Part B All possible simple random samples of n = 2 from this population, 
along with the probability that each possible sample might be selected. With 
the simple random sampling technique, each sample has the same probability 



A. The Population, Its Probability Distribution, and Its Mean 
Person x 



Bill 
Carl 
Denise 
Ed 



1 
1 

3 
5 



0.50 r 

0.25 

0.00 



Mean: fi = (1 + 1 + 3 + 5)/4 = 2.5 ■ 



TABLE 8.1 



Although the population 
is very small, parts (A) 
through (C) show the basic 
principles underlying the 
sampling distribution of the 
mean. The discrete random 
variable isx = the number 
of bottles of Diet Pepsi in 
the person's refrigerator. 



B. All possible simple random samples ofn = 2 

Probability of 

Sample Mean of this Sample Selecting this Sample 



Bill, Carl 


x = 


(1 + 1)/2 


= 1.0 


1/6 


Bill, Denise 


x = 


(1 + 3)/2 


= 2.0 


1/6 


Bill, Ed 


x = 


(1 + 5)/2 


= 3.0 


1/6 


Carl, Denise 


x = 


(1 + 3)/2 


= 2.0 


1/6 


Carl, Ed 


x = 


(1 + 5)/2 


= 3.0 


1/6 


Denise, Ed 


x = 


(3 + 5)/2 


= 4.0 


1/6 



C. The Probability Distribution of the Sample Means (the Sampling Distribution of 
the Mean) 

2/6 ,- 
1/6 - 



0 



0 



I 



a_=2.5- 



The mean of the sample means, /jl-, is calculated as 



fju- = 1(1 .0) + |(2.0) + J(3.0) + 1(4.0) = 2.5 bottles 
6 6 6 6 



246 



Part 3: Sampling Distributions and Estimation 



of selection. The number of combinations of 4 elements taken 2 at a time is 6, 
the number of different simple random samples possible for n = 2. In this 
portion of the table, we also have the mean of each possible sample. 
• Part C The probability distribution of the sample means (i.e., the sampling 
distribution of the mean), along with /jl- = the mean of the sampling distribu- 
tion. Note that pu- is the same as /jl, or 2.5 bottles. 

The sampling distribution of the mean has four important characteristics: 

1. The sampling distribution of the mean will have the same mean as the origi- 
nal population from which the samples were drawn [i.e., E(x) = p.- = p]. 

2, The standard deviation of the sampling distribution of the mean is referred to 
as the standard error of the mean, or a-. For the sampling distribution in 
Table 8.1, a- can be calculated in the same way as the standard deviations in 
Chapter 6, as the positive square root of its variance: 



a\ = E[(x - p,) 1 ] = (1.0-2.5) 



+ (3.0 - 2.5) 



+ (4.0 - 2.5) 



+ (2.0 - 2.5) 2 
= 0.917 



and 



V0.917 = 0.958 bottles 



3. If the original population is normally distributed, the sampling distribution of 
the mean will also be normal. 

4. If the original population is not normally distributed, the sampling distribu- 
tion of the mean will be approximately normal for large sample sizes and will 
more closely approach the normal distribution as the sample size increases. 
Known as the central limit theorem, this result will be an especially important 
topic in the next section. 

When a discrete random variable is the result of a Bernoulli process, with 
x = the number of successes in n trials, the result can be expressed as a sample 
proportion, p: 

_ Number of successes _ x 
^ Number of trials n 

Like the sample mean, the sample proportion can also be considered a ran- 
dom variable that will take on different values as the procedure leading to the 
sample proportion is repeated a great many times. The resulting probability dis- 
tribution is called the sampling distribution of the proportion, and it will also 
have an expected value (it, the probability of success on any given trial) and a 
standard deviation (a ). These will be discussed in Section 8.4. 




8.1 What is the difference between a probability 
distribution and a sampling distribution? 

8.2 What is the difference between a standard deviation 
and a standard error? 



8.3 In a simple random sample of 1000 households, 
150 households happen to own a barbecue grill. Based 
on the characteristics of the population, the expected 
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number of grill owners in the sample was 180. What are 
the values of it, p, and «? 

8.4 For a population of five individuals, television owner- 
ship is as follows: 

x = Number of 
Television Sets Owned 



Allen 2 

Betty 1 

Chuck 3 

Dave 4 

Eddie 2 



a. Determine the probability distribution for the discrete 
random variable, x = number of television sets 
owned. Calculate the population mean and standard 
deviation. 

b. For the sample size n = 2, determine the mean for 
each possible simple random sample from the five 
individuals. 

c. For each simple random sample identified in part (b), 
what is the probability that this particular sample 
will be selected? 

d. Combining the results of parts (b) and (c), describe 
the sampling distribution of the mean. 

e. Repeat parts (b)-(d) using n = 3 instead of n = 2. 
What effect does the larger sample size have on the 
mean of the sampling distribution? On the standard 
error of the sampling distribution? 



8.5 Given the following probability distribution for an 
infinite population with the discrete random variable, x: 

x 1 2 

P(x) 0.5 0.5 

a. Determine the mean and the standard deviation of x. 

b. For the sample size n = 2, determine the mean for each 
possible simple random sample from this population. 

c. For each simple random sample identified in part (b), 
what is the probability that this particular sample 
will be selected? 

d. Combining the results of parts (b) and (c), describe 
the sampling distribution of the mean. 

e. Repeat parts (b)-(d) using a sample size of n = 3 
instead of n = 2. What effect does the larger sample 
size have on the mean of the sampling distribution? 
On the standard error of the sampling distribution? 

8.6 The operator of a museum exhibit has found that 30% 
of the visitors donate the dollar that has been requested to 
help defray costs, with the others leaving without paying. 
Three visitors are on their way into the exhibit. Assuming 
that they will be making independent decisions on whether 
to donate the dollar that has been requested: 

a. Determine the mean and standard deviation for 
x = the amount paid by individual visitors. 

b. Determine the mean and the standard deviation for 
the sampling distribution of x for samples consisting 
of 3 visitors each. 



THE SAMPLING DISTRIBUTION OF THE MEAN 8.3 



When the Population Is Normally Distributed 

When a great many simple random samples of size n are drawn from a population 
that is normally distributed, the sample means will also be normally distributed. 
This will be true regardless of the sample size. In addition, the standard error of the 
distribution of these means will be smaller for larger values of n. In summary: 

Sampling distribution of the mean, simple random samples from a normally dis- 
tributed population: 

Regardless of the sample size, the sampling distribution of the mean will be nor- 
mally distributed, with 

• Mean: 

E(x) = ix- = fi 

• Standard error: 

a- = —7= where fi = population mean 

a = population standard deviation 
n = sample size 
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The standard deviation (referred to as the standard error) of the sample means 
will be smaller than that of the original population. This is because each sample 
mean is the average of several values from the population, and relatively large or 
small values of x are being combined with values that are less extreme. 



example 



Sampling Distribution 

In Chapter 7, we reported that the average annual number of hours flown by 
general-aviation aircraft was 130 hours. 1 For purposes of our discussion here, we 
will continue to assume that the flying hours for these aircraft are normally dis- 
tributed with a standard deviation of 30 hours. 

SOLUTION 

Assuming a normal distribution with jx = 130.0 hours and a = 30.0 hours, part (a) 
of Figure 8.1 shows the shape of the population distribution, while part (b) shows 
the sampling distribution of the mean for simple random samples of n = 9 aircraft 
each. Notice how the sampling distribution of the mean is much narrower than the 
distribution of flying hours for the original population. While the population x val- 
ues have a standard deviation of 30.0 hours, the sample means have a standard error 
of just 30.0/V9, or 10.0 hours. 

In part (c) of Figure 8.1, we see how the sampling distribution of the mean be- 
comes even narrower when the sample size is increased. For simple random sam- 
ples of size n = 36, the sample means have a standard error of only 30.0/V36, or 
5.0 hours. 



Just as we can use the z-score to convert a normal distribution to a standard 
normal distribution, we can also convert a normally distributed sampling distri- 
bution of means to a standard normal distribution. The z-score formula for doing 
so is very similar to the one introduced in the previous chapter: 

Z-score for the sampling distribution of the mean, normally distributed 
population: 

z = — where z = distance from the mean, measured 

* in standard error units 

x = value of the sample mean in which 

we are interested 
/jl = population mean 
(T- = standard error of the sampling 

distribution of the mean, or a/\/n 



Source: General Aviation Manufacturers Association, General Aviation Statistical Databook, 2005 
Edition, p. 20. 
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FIGURE 8.1 




For a normal population, 
the sampling distribution 
of x will also be normal 
regardless of the sample 
size. As n increases, the 
standard error of the 
sampling distribution will 



40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 become smaller, 

(a) The population 




/j,^ = /j, = 130 hours 
a_ = cr/Vn = 30/V9 
= 10 hours 



40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 
(b) The sampling distribution of x when n = 9 




/j.- = /jl = 1 30 hours 
cr_= <W^=30/V36 
= 5 hours 



40 50 60 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 
(c) The sampling distribution of x when n = 36 



Example 




, r. •£ ; ■> ~ f • 





z-Score 

For this example, we have a population of general-aviation aircraft, with the 
assumption of /jl = 130.0 flying hours and a = 30.0 flying hours. 



SOLUTION 

Two important notes: 

1. This is the same population of aircraft we discussed in Chapter 7. In Chapter 7, 
we were making probability statements about individual planes selected from 
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FIGURE 8.2 



For a simple random sample 
of n = 36 from the aircraft 
flying-hours population, the 
probability is just 0.0548 
that the sample mean will 
be 1 38 hours or more. 



P{x> 138) = P(z> 1.60) 

or 0.5000 (Area from z = 
-0.4452 (Area from z = 

= 0.0548 



0.00 to +oo) 
0.00 to z= 1.60) 




/j. x = 130 hours 
<r_ = 5 hours 



(<r_ units) 
(hours) 



For x= 138 hours, z = = 138 ~ 130 = 1.60 



the population. Here we are making probability statements about the means 
of samples of planes taken from the original population. 
2. In Chapter 7, we dealt with a population consisting of individual planes. Here 
we are considering a population consisting of sample means — this is what is 
meant by the sampling distribution of the mean. 

For a Simple Random Sample of 36 Aircraft, What Is the Probability That 
the Average Flying Time for the Aircraft in the Sample Was at Least 
138.0 Hours? 

As calculated previously, the standard error of the sampling distribution of the 
mean for samples of this size is cr- = 30.0/V36, or 5.0 hours. We can now pro- 
ceed to calculate the z-score corresponding to a sample mean of 138.0 hours: 



/J, 138.0 - 130.0 



5.0 



^0 
5.0 



1.60 



Referring to the standard normal table, the area from z = 0.00 to z = 1.60 is 
0.4452. Since the total area to the right of the midpoint is 0.5000, the probability 
that x will be at least 138.0 hours is 0.5000 - 0.4452, or 0.0548, as shown in 
Figure 8.2. 

As with our use of the z-score in finding areas beneath the standard normal 
curve in the preceding chapter, many other possibilities are available for identify- 
ing probabilities associated with the sample mean. For example, you may wish to 
verify on your own that P( 133.0 s x s 139.0) = 0.2384 for a simple random 
sample of n = 36 aircraft. 
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When the Population Is Not Normally Distributed 

The assumption of normality for a population isn't always realistic, since in many 
cases the population is either not normally distributed or we have no knowledge 
about its actual distribution. However, provided that the sample size is large (i.e., 
n > 30), the sampling distribution of the mean can still be assumed to be normal. 
This is because of what is known as the central limit theorem: 

Central limit theorem For large, simple random samples from a population that 
is not normally distributed, the sampling distribution of the mean will be approxi- 
mately normal, with the mean ^ = n and the standard error = cr/Vn. As the 
sample size (n) is increased, the sampling distribution of the mean will more closely 
approach the normal distribution. 

As shown in Figure 8.3, regardless of how a population happens to be dis- 
tributed, the central limit theorem enables us to proceed as though the samples 



FIGURE 8.3 



When populations are 
not normally distributed, 
the central limit theorem 
states that the sampling 
distribution of the mean 
will be approximately 
normal for large simple 
random samples. 



For large simple random samples, the sample means 
will be approximately normally distributed 




<7- = crNn 

X 
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were being drawn from a population that is normally distributed. 2 The only re- 
quirement here is that the sample size be sufficiently large. In Chapter 9, we will 
consider the constraint that <r is known. As a practical matter, when the sample 
size is at least 30, the "sufficiently large" requirement is assumed to have been 
met. As the definition of the central limit theorem indicates, the approximation to 
the normal distribution becomes even better for larger samples. Refer to Seeing 
Statistics Applets 7 and 8, at the end of the chapter, for visual demonstrations of 
the sampling distribution of the mean for fair and unfair dice, respectively. 

The central limit theorem is basic to the concept of statistical inference because it 
permits us to draw conclusions about the population based strictly on sample 
data, and without having any knowledge about the distribution of the underlying 
population. Such inferences are important to the topics of the next three chapters, 
and they also play a key role in the application of statistics to total quality man- 
agement. This chapter's opening vignette is a process-control application to be 
examined in more detail in Chapter 20. 



exercises 




8.7 A random variable is normally distributed with mean 
(jl = $1500 and standard deviation a = $100. Determine 
the standard error of the sampling distribution of the 
mean for simple random samples with the following 
sample sizes: 

a. n = 16 b. n = 100 

c. n = 400 d. n = 1000 

8.8 For a random variable that is normally distributed, 
with fx, = 200 and a = 20, determine the probability 
that a simple random sample of 4 items will have a 
mean that is 

a. greater than 210. 

b. between 190 and 230. 

c. less than 225. 

8.9 For a random variable that is normally distributed, 
with ft = 80 and a = 10, determine the probability that 
a simple random sample of 25 items will have a mean 
that is 

a. greater than 78. 

b. between 79 and 85. 

c. less than 85. 

8.10 Employees in a large manufacturing plant worked 
an average of 62.0 hours of overtime last year, with a 
standard deviation of 15.0 hours. For a simple random 
sample of n = 36 employees and x = the number of 



'MiL 

overtime hours worked last year, determine the z-score 
corresponding to each of the following sample means: 
a. x = 55.0 hours b. x = 60.0 hours 

c. x = 65.0 hours d. x = 47.0 hours 

8.11 For the manufacturing plant discussed in 
Exercise 8.10, the union president and the human 
resources director jointly select a simple random sample 
of 36 employees to engage in a discussion with regard to 
the company's work rules and overtime policies. What is 
the probability that the average number of overtime 
hours last year for members of this sample would have 
been less than 65.0 hours? Between 55.0 and 65.0 hours? 

8.12 Shirley Johnson, a county political leader, is looking 
ahead to next year's election campaign and the possibility 
that she might be forced to support a property tax 
increase to help combat projected shortfalls in the county 
budget. For the dual purpose of gaining voter support 
and finding out more about what her constituents think 
about taxes and other important issues, she is planning to 
hold a town meeting with a simple random sample of 40 
homeowners from her county. For the more than 100,000 
homes in Shirley's county, the mean value is $190,000 
and the standard deviation is $50,000. What is the prob- 
ability that the mean value of the homes owned by those 
invited to her meeting will be greater than $200,000? 



2 The central limit theorem applies to any distribution that has a nonzero variance. If there is zero 
variance in the population, sample means will all have the same value, hence they cannot be normally 
distributed under this (rare) condition. 
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8.13 An industrial crane is operated by four electric mo- 
tors working together. For the crane to perform properly, 
the four motors have to generate a total of at least 380 
horsepower. Engineers from the plant that produces the 
electric motors have stated that motors off their assembly 
line average 100 horsepower, with a standard deviation 
of 10 horsepower. The four electric motors on the crane 
have just been replaced with four brand new motors from 
the motor manufacturer. A safety engineer is concerned 
that the four motors might not be powerful enough to 
properly operate the crane. Stating any additional 
assumptions you are using, does the engineer's concern 
appear to be justified? 

8.14 From past experience, an airline has found the 
luggage weight for individual air travelers on its trans- 
Atlantic route to have a mean of 80 pounds and a 
standard deviation of 20 pounds. The plane is 
consistently fully booked and holds 100 passengers. 
The pilot insists on loading an extra 500 pounds of fuel 
whenever the total luggage weight exceeds 8300 pounds. 
On what percentage of the flights will she end up having 
the extra fuel loaded? 

8.15 A hardware store chain has just received a truckload 
of 5000 electric drills. Before accepting the shipment, 

the purchasing manager insists that 9 of the drills be 
randomly selected for testing. He intends to measure the 
maximum power consumption of each drill and reject 
the shipment if the mean consumption for the sample is 
greater than the 300 watts listed on the product label. 
Unknown to the purchasing manager, the drills on the 
truck require an average of 295 watts, with a standard 
deviation of 12 watts. Stating any additional assumptions 
you are using, find the probability that the truckload of 
drills will be rejected. 

8.16 The tensile strength of spot welds produced by a 
robot welder is normally distributed, with a mean of 
10,000 pounds per square inch and a standard deviation 
of 800 pounds per square inch. For a simple random 
sample of n = 4 welds, what is the probability that the 
sample mean will be at least 10,200 pounds per square 
inch? Less than 9900 pounds per square inch? 



8.17 At a department store catalog orders counter, the 
average time that a customer has to wait before being 
served has been found to be approximately exponentially 
distributed, with a mean (and standard deviation) of 

3.5 minutes. For a simple random sample of 36 recent 
customers, invoke the central limit theorem and 
determine the probability that their average waiting 
time was at least 4.0 minutes. 

8.18 It has been reported that the average U.S. office 
worker receives 14 fax messages per day. For purposes 
of this exercise, we will assume that the daily number 
of fax messages is normally distributed with a standard 
deviation of 5.0 messages. For an office in which 

10 workers are employed, and considering these persons 
as a simple random sample of all U.S. office workers, 
what is the probability that the office will receive at least 
160 fax messages (i.e., a sample mean of at least 16 faxes 
per worker) next Wednesday? source: Anne R. Carey and 
Genevieve Lynn, "Message Overload?" USA Today, September 13, 1999, 
p. 1B. 

8.19 The average length of a hospital stay in general or 
community hospitals in the United States is 5.7 days. 
Assuming a population standard deviation of 2.5 days 
and a simple random sample of 50 patients, what is the 
probability that the average length of stay for this group 
of patients will be no more than 6.5 days? If the sample 
size had been 8 patients instead of 50, what further 
assumption(s) would have been necessary in order to 
solve this problem? source: Bureau of the Census, Statistical 
Abstract of the United States 2006, p. 119. 

8.20 According to the Pew Research Center for the 
People and the Press, persons in higher age groups tend 
to spend much more time reading the newspaper. For per- 
sons 65 or more years of age who read a newspaper, the 
average time has been reported as 33 minutes. Assuming 
a population standard deviation of 10.0 minutes and a 
simple random sample of 30 newspaper readers in the 
65-or-over age group, what is the probability that mem- 
bers of this group will average at least 35 minutes reading 
their next newspaper? source: Anne R. Carey and Web Bryant, 
"Speed Readers," USA Today Snapshot Calendar, May 20, 1999. 



THE SAMPLING DISTRIBUTION OF THE PROPORTION 8.4 



As mentioned earlier, the proportion of successes in a sample consisting of n trials 
will be 

Number of successes x 

Sample proportion = p = — — : — — = — 

Number of trials n 

Whenever both mi and n(l — tt) are S 5, the normal distribution can be 
used to approximate the binomial, an application examined in Chapter 7. When 
these conditions are satisfied and the procedure leading to the sample outcome is 
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repeated a great many times, the sampling distribution of the proportion (p) will 
be approximately normally distributed, with this approximation becoming better 
for larger values of n and for values of 77 that are closer to 0.5. 

Like the sampling distribution of the mean, the sampling distribution of the pro- 
portion has an expected value and a standard error. Also, as the size of the sample 
becomes larger, the standard error becomes smaller. For simple random samples, 
the sampling distribution of the proportion can be described as follows: 

• Sampling distribution of the proportion, p: 

Mean = E(p) = tt 



Standard error = cr p = y — " where tt = population proportion 

n = sample size 

Z-score for a given value of p: 
_ p — TT 

z — where p = the sample proportion value of interest 



example 



Sampling Distribution, Proportion 

According to the Bureau of the Census, 85.1% of adult residents of Arizona have 
completed high school. 3 

SOLUTION 

What Is the Probability That No More Than 80% of the Persons in a Simple 
Random Sample of 200 Adult Residents of Arizona Have Finished High 
School (i.e., the Probability That p < 0.800)? 

The expected value of the sampling distribution of the proportion is tt = 0.851. 
This, along with the sample size of n = 200, is used in determining the standard 
error of the sampling distribution and the z-score corresponding to a sample pro- 
portion of p = 0.800: 



Standard error = o\ 



77(1 



0.851(1 - 0.851) 



200 



0.0252 



77 0.800 - 0.851 



0.0252 



-2.02 



Referring to the standard normal distribution table, the area from z = 0.00 to 
z = —2.02 is 0.4783. Since the total area to the left of the midpoint is 0.5000, the area 
in the left tail is 0.5000 - 0.4783 = 0.0217, as shown in Figure 8.4. The probability 
that no more than 80% of these 200 people are high school graduates is 0.0217. 



^Source: Bureau of the Census, Statistical Abstract of the United States 2002, p. 141. 



Chapter 8: Sampling Distributions 



255 



P(p< 0.800) =P(z< -2.02) 

or 0.5000 (Area from z = 
-0.4783 (Area from z = 



0.00 to -°°) 
0.00 to z = -2.02) 




P- 



-2.02 
0.800 



0.00 
0.851 



[xr units) 



FIGURE 8.4 



The population proportion 
{it) of adult residents of 
Arizona who have finished 
high school is 0.851 . For a 
simple random sample of 
200 adults from that state, 
the probability is 0.0217 
that the proportion of 
sample members who are 
high school graduates will 
be no more than 0.800. 



For p = 0.800, 



P-tt = 0.800-0.851 
°"p " 0.0252 



-2.02 



Although the binomial distribution is discrete and the normal continuous, we have 
not used the correction for continuity in reaching this solution. As Chapter 7 
explained, such correction makes little difference in the results when sample sizes 
are large and the number of successes can take on a large number of possible val- 
ues. Because of the sample sizes typically involved when dealing with the sampling 
distribution of the proportion, this correction is generally not employed in these 
applications and will not be necessary in the exercises that follow. 



NOT 



IP* exercises 

on ~sn cm Vi *" 




8.21 Of the 53,309 salaried U.S. employees of Ford 
Motor Company, 26.5% are females. For simple random 
samples of size n, with n > 30, describe the sampling dis- 
tribution of p = the proportion of females in the sample. 

source: Ford Motor Company, 1999 Annual Report, p. 73. 

8.22 Shirley Johnson, the county political leader discussed 
in Exercise 8.12, is a Republican. According to voter reg- 
istration records, 60% of the homeowners in her county 
are also Republicans. For the simple random sample of 
40 homeowners invited to attend her town meeting, 
describe the sampling distribution of p = the proportion 
of Republicans among those invited to the meeting. 

8.23 A simple random sample with n = 300 is drawn 
from a binomial process in which it = 0.4. Determine the 
following probabilities for p = the proportion of successes: 
a. P(p > 0.35) 



b. P(0.38 < p < 0.42) 

c. P(p < 0.45) 

8.24 A simple random sample with n = 200 is drawn 
from a binomial process in which it = 0.6. Determine the 
following probabilities for p = the proportion of successes: 

a. P(0.50 < p) 

b. P(0.65 < p < 0.70) 

c. P(p > 0.70) 

8.25 It has been reported that 42.6% of U.S. workers 
employed as purchasing managers are females. In a 
simple random sample of U.S. purchasing managers, 70 
out of the 200 are females. Given this information: source: 
Bureau of the Census, Statistical Abstract of the United States 2002, p. 38 1 . 

a. What is the population proportion, it? 

b. What is the sample proportion, p? 

c. What is the standard error of the sample proportion? 
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d. In the sampling distribution of the proportion, what 
is the probability that a sample of this size would 
result in a sample proportion at least as large as that 
found in part (b)? 

8.26 It has been estimated that 17.0% of mutual fund 
shareholders are retired persons. Assuming the 
population proportion to be tt = 0.17 and that a simple 
random sample of 400 shareholders has been selected: 
source: Investment Company Institute, 1 999 Mutual Fund Fact Book, p. 45. 

a. What is the expected value of p = the proportion 
of those in the sample who are retired? 

b. What is the standard error of the sampling distribu- 
tion of the proportion, p? 

c. What is the probability that at least 20% of those 
in the sample will be retired? 

d. What is the probability that between 15% and 
25% of those in the sample will be retired? 

8.27 A mail-order discount outlet for running shoes has 
found that 20% of the orders sent to new customers end 
up being returned for refund or exchange because of 
size or fit problems experienced by the customer. The 
100-member Connor County Jogging Society has just 
found out about this mail-order source and each member 
has ordered a pair of running shoes. The group-orders 
manager of the mail-order firm has promised the club a 
15% discount on its next order if 10 or more of the pairs 
ordered must be returned because of size or fit problems. 
What is the probability that the club will get 15% off on 
its next order? 

8.28 The Temployee Company supplies temporary work- 
ers to industry and has found that 58% of the positions it 
is asked to fill require a knowledge of Microsoft Word. 
The firm currently has 25 individuals available who 
know Microsoft Word. If Temployee Company receives 
50 inquiries for temporary workers next week, how likely 
is it that the company will not have enough Microsoft 
Word-skilled people on hand? 



8.29 The absentee rate for bus drivers in a large school 
district has been 5% in recent years. During the past few 
months, the school district and the bus drivers' union 
have been engaged in negotiations for a new contract. 
Negotiations are at an impasse over several key issues, 
and the drivers already have worked 6 months past the 
termination of their previous labor contract with the 
district. Yesterday's negotiating session was particularly 
contentious, with members of both sides engaging in 
heated debate and name-calling. Of the 400 bus drivers 
who serve the school district, 60 of them called in sick 
this morning, an absentee rate of 15%. The school 
district has accused the union of collusion in carrying 
out a work stoppage and the union has responded that 
the school district is just trying to make a big deal out of 
more people than usual happening to be sick on a given 
day. Given the historical absentee rate, what is the proba- 
bility that a daily absentee rate would happen to be 15% 
or more? Comment on the competing claims of the 
school district and the bus drivers' union. 

8.30 According to the Federal Trade Commission (FTC), 
about 3.6% of sale-priced items come up at the wrong price 
when scanned at the checkout counter. Freda Thompson, 
director of investigations for her state's consumer affairs 
office, sets up a study in which she has undercover shoppers 
(sometimes known as "mystery shoppers") randomly select 
and purchase 600 sale-priced items from Wal-Mart depart- 
ment stores across her state. Suppose that researchers find 
18 of the 600 sale items have been incorrectly priced by the 
checkout scanner, an error rate of just 3%. If Wal-Mart's 
overall rate of scanning errors is the same as the rate cited 
by the FTC, what is the probability that no more than 3% 
of the sale items purchased by Freda's investigators would 
be incorrectly priced when they are scanned? Given the as- 
sumed sample result, comment on whether Wal-Mart might 
be typical of other companies in terms of their error rate for 
scanning sale items, source: "Scanner Pricing Accuracy Improves," 
Indiana Gazette, December 16, 1998, p. 4. 



8.5 SAMPLING DISTRIBUTIONS WHEN 

THE POPULATION IS FINITE 



When sampling is without replacement and from a finite population, the tech- 
niques we've discussed must be modified slightly. The purpose is to arrive at a 
corrected (reduced) value of the standard error for the sampling distribution. To 
better appreciate why this is necessary, consider the possibility of selecting a sam- 
ple of 990 persons from a population of just 1000. Under these conditions, there 
would be practically no sampling error because the sample would include almost 
everyone in the population. 

Whether we are dealing with the sampling distribution of the mean (3c) or the 
sampling distribution of the proportion (p), the same correction factor is applied. 
This factor depends on the sample size (n) versus the size of the population (N) and, 
as a rule of thumb, should be applied whenever the sample is at least 5% as large as 
the population. When n < 0.05N, the correction will have very little effect. 
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Standard error for the sample mean when sampling without replacement from 
a finite population: 



a- = —7= • y- where n = sample size 

N = population size 

cr = population standard deviation 

Standard error for the sample proportion when sampling without replace- 
ment from a finite population: 



o\ = \/ • \/ — where « = sample size 

P \ n V N - 1 XT 

A/ = population size 

7T = population proportion 

Note that the first term in each of the preceding formulas is the same as we 
would have used earlier in calculating the standard error. The second term 
in each case is the finite population correction factor, and its purpose is to re- 
duce the standard error according to how large the sample is compared to the 
population. 



example 



Finite Populations 

Of the 629 passenger vehicles imported by a South American country in a recent 
year, 117 were Volvos. A simple random sample of 300 passenger vehicles imported 
during that year is taken. 

SOLUTION 

What Is the Probability That at Least 15% of the Vehicles in This Sample 
Will Be Volvos? 

For the 629 vehicles imported, the population proportion of Volvos is tt = 
117/629, or 0.186. The sample size (n = 300) is more than 5% of the size of the 
population (N = 629), so the finite population correction factor is included in the 
calculation of the standard error of the proportion: 



This is the 


same as 


would have been 


calculated 


in 


Section 8.4. 


y 


1 



77(1 



N - n 



N 



The finite population correction term, 
this will be smaller than 1.0 so the 
standard error will be less than if an 
infinite population were involved. 



T 



0.186(1 - 0.186) a 629 - 300 



300 



629 - 1 



0.0225 • 0.7238 = 0.0163 
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Note that the correction factor has done its job. The resulting standard error 
is 72.38% as large as it would have been for an infinite population. Having ob- 
tained a = 0.0163, we can now determine the z-score that corresponds to the 
sample proportion of interest, p = 0.15: 

_ p - 77 _ 0.15 - 0.186 
Z ~ a p 0.0163 " 

Using the standard normal distribution table, the area from z = 0.00 to 
Z = —2.21 is the same as from z = 0.00 to z = +2.21, or 0.4864. The probabil- 
ity that at least 15% of the cars in the sample will be Volvos is 0.5000 (the area 
to the right of the midpoint) + 0.4864, or 0.9864. 



For problems where we happen to be dealing with a sample mean, the same finite 
population correction approach is used. As with the sample proportion, the correc- 
tion term will be less than 1.0 in order to reduce the value of the standard error. 
Again, whenever n < 0.05N, the correction will have little effect. For example, at 
the extreme for this rule of thumb, if n = 100 and N = 2000, the correction term 
will have a value of 0.975. The resulting standard error will be very close to that 
which would have been determined under the assumption of an infinite population. 




8.31 What is meant by a finite population and how can this 
affect the sampling distribution of a mean or a proportion? 

8.32 Under what circumstances is it desirable to use the 
finite population correction factor in describing the 
sampling distribution of a mean or a proportion? 

8.33 Compared to the situation in which the population 
is either extremely large or infinite in size, what effect 
does the use of the finite population term tend to have 
on the resulting value of the standard error? 

8.34 A population of 500 values is distributed such that 
/u = $1000 and cr = $400. For a simple random sample 
of n = 200 values selected without replacement, describe 
the sampling distribution of the mean. 

8.35 A civic organization includes 200 members, who have 
an average income of $58,000, with a standard deviation of 



$10,000. A simple random sample of n = 30 members is 
selected to participate in the annual fund-raising drive. 
What is the probability that the average income of the 
fund-raising group will be at least $60,000? 

8.36 Of the 235 airports available for public use in 
Michigan, 128 are paved and lighted. For a simple 
random sample of n = 50, what is the probability that at 
least 20 of the airports in the sample will be paved and 
lighted? source: General Aviation Manufacturers Association, General 
Aviation Statistical Databook, 2005 Edition, p. 30. 

8.37 A firm's receiving department has just taken in a 
shipment of 300 generators, 20% of which are defective. 
The quality control inspector has been instructed to reject 
the shipment if, in a simple random sample of 40, 15% 
or more are defective. What is the probability that the 
shipment will be rejected? 



8.6 COMPUTER SIMULATION OF 

SAMPLING DISTRIBUTIONS 



Using the computer, we can simulate the collection of simple random samples from 
a population or a probability distribution. For example, Computer Solutions 8.1 de- 
scribes the procedures for using Excel or Minitab to generate a simple random 
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computer solutions 8, 




Sampling Distributions and Computer Simulation 

These procedures simulate 200 samples from a Poisson distribution with a mean of 4.0. In the first 
simulation, the size of each sample is n = 5. In the second simulation, the size of each sample is n = 30. 
The Excel results are displayed in Figure 8.5 and the Minitab results would be similar. 

EXCEL 

1 . Start with a blank worksheet. Click Tools. Click Data Analysis. Click Random Number Generation. Click OK. 

2. Enter 5 into the Number of Variables box. Enter 200 into the Number of Random Numbers box. Within the 
Distribution box, select Poisson. Enter 4.0 into the Lambda box. Select Output Range and enter A2 into the dia- 
log box. Click OK. The 200 simulated samples will be located in A2:E201 , with each row representing one of the 200 
samples. 

3. Click on cell F2 and enter the formula =AVERAGE(A2:E2). Click at the lower right corner of cell F2 and drag down- 
ward to fill the remaining cells in column F. These values are the 200 sample means. Click on cell F1 and enter the title 
for the F column as Means. 

4. Determine the mean of the sample means by clicking on cell G2 and entering the formula =AVERAGE(F2:F201). De- 
termine the standard deviation of the sample means by clicking on cell H2 and entering the formula =STDEV(F2:F201). 

5. Generate the histogram of means when n = 5: Click on cell J1 and enter the title for the J column as Bin. Enter 1.0 
into 12 and 1.2 into J3. Select J2:J3, then click at the lower right corner of cell J3 and drag downward to fill, so that 
7.0 will be in cell J32. 

6. Click Tools. Click Data Analysis. Select Histogram. Click OK. Enter F1:F201 into Input Range. Enter J1:J32 into Bin 
Range. Select Labels. Select Output Range and enter K1 into the dialogue box. Select Chart Output. Click OK. 
Note: In fine-tuning the histogram, we can eliminate the gaps between the bars by right-clicking on one of the bars, 
selecting Format Data Series, selecting Options, then reducing Gap Width to 0 and clicking OK. The histogram is 
shown in Figure 8.5, part (b). 

7. For samples with n = 30, the procedure for generating 200 samples, their means, and the histogram of the sample 
means will be similar to the steps shown above. For example, in step 1 , we would enter 30 into the Number of Vari- 
ables box and enter 200 into the Number of Random Numbers box. For each sample, the sample values would be 
in columns A through AD and the sample means would be in column AE. 



MINITAB 

1 . Click Calc. Select Random Data. Click Poisson. Enter 200 into the Number of rows of data to generate box. 

Enter C1-C5 (the destination columns) into the Store in columns box. Enter 4.0 into the Mean box. Click OK. The 
200 simulated samples will be located in columns C1-C5, with each row constituting a separate sample. 

2. Determine the sample means: Click Calc. Click Row Statistics. Select Mean. Enter C1-C5 into the Input Variables 
box. Enter C6 into the Store result in box. Click OK. Enter Means as the label of column C6. 

3. Generate the histogram for the 200 sample means: Click Graph. Click Histogram. Select Simple and Click OK. Enter 
C6 into the Graph variables box. Click OK. 

4. For samples with n = 30, the procedure for generating 200 samples, their means, and the histogram of the sample 
means will be similar to the steps shown above. For example, the destination in step 1 will be columns C1-C30, and 
the means will be located in column C31. Note that users of the Minitab Student Edition are limited to a worksheet 
size of 5000 data values and must generate fewer samples (e.g., 160) when n = 30. 



sample of observations from a Poisson probability distribution with a mean of 4. 
In part (a) of Figure 8.5, it is evident that the Poisson distribution from which the 
simulated samples are to be taken is neither continuous nor symmetrical. Part (b) of 
Figure 8.5 shows the distribution of sample means when Excel generated 200 simu- 
lated samples, each with a sample size of n = 5. In part (c) of Figure 8.5, each of the 
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FIGURE 8.5 



Although the population is 
a Poisson distribution with a 
mean of 4.0, the sampling 
distribution of the mean still 
approaches the normal 
distribution as the Excel- 
simulated samples become 
larger. The simulations rep- 
resented in parts (b) and (c) 
were generated using the 
Excel procedures in 
Computer Solutions 8.1 . 
The Minitab results would 
be similar. 



(a) The population from which the Excel-simulated samples were drawn 
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(b) Distribution of means for 200 simulated samples, each with n 
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(c) Distribution of means for 200 simulated samples, each with n = 30 
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200 simulated samples is larger, with n = 30. Note that the distribution of sample 
means not only becomes narrower, but it also more closely resembles a normal dis- 
tribution. 

The probability distribution in part (a) of Figure 8.5 has a mean of 4.0. Be- 
cause it is a Poisson distribution, its variance is also 4.0, and its standard deviation 
is the positive square root of the variance, or 2.0. According to the central limit 
theorem, samples of n = 30 from this population should have a mean of 4.0 and 
a standard error of cr/wn = 2.0/V30 = 0.365. For the 200 samples described in 
part (c) of Figure 8.5, the mean was 4.009 and the standard error was 0.3759, 
both very close to the values predicted by the theorem. 




Note: These exercises require Excel, Minitab, or other 
statistical software capable of simulating simple random 
samples from a population or probability distribution. 

8.38 Using the procedures in Computer Solutions 8.1 as 
a general guide, simulate 150 simple random samples of 
size n = 2 from an exponential distribution with a mean 
of 6. Determine the mean and the standard deviation 
of the 150 sample means, then compare these with 
their theoretically expected values. Given the nature of 
the population distribution, would we expect the distri- 
bution of sample means to be at least approximately 
normally distributed? Generate a histogram of the 
sample means and comment on the shape of the 
distribution. 



8.39 Repeat Exercise 8.38, but with n = 30 observations 
in each sample. 

8.40 Using the procedures in Computer Solutions 8.1 as a 
general guide, simulate 150 simple random samples of size 
n = 4 from a normal distribution with a mean of 80 and a 
standard deviation of 20. Determine the mean and the stan- 
dard deviation of the 150 sample means, then compare 
these with their theoretically expected values. Given the 
nature of the population distribution, would we expect the 
distribution of sample means to be at least approximately 
normally distributed? Generate a histogram of the sample 
means and comment on the shape of the distribution. 

8.41 Repeat Exercise 8.40, but with n = 30 observations 
in each sample. 



SUMMARY 8.7 



• The sample mean or sample proportion considered as a random variable 

The two preceding chapters dealt with populations representing discrete and 
continuous random variables, and with the probability distributions for these 
variables. The purpose of this chapter is to examine how sample means and pro- 
portions from a population will tend to be distributed, and to lay the foundation 
for the inferential statistics of the chapters that follow. Inferential statistics in- 
volves the use of sample information to draw conclusions about the population 
from which the sample was drawn. 

• Sampling distribution of the sample mean or the sample proportion 

If we were to take a great many samples of a given size from the same population, 
the result would be a great many different values for the sample mean or sample 
proportion. Such a process involves what is referred to as a sampling distribution 
of the sample mean or the sample proportion, and the sample mean or proportion 
is the random variable of interest. The standard deviation of the sampling distri- 
bution of the mean or proportion is referred to as the standard error of the 
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sampling distribution. As the sample size becomes larger, the standard error of the 
sampling distribution will decrease. 

• The central limit theorem 

If a population is normally distributed, the sampling distribution of the mean will 
also be normal. Regardless of how the population values are distributed, the central 
limit theorem holds that the sampling distribution of the mean will be approxi- 
mately normal for large samples (e.g., n > 30) and will more closely approach the 
normal distribution as the sample size increases. Whenever both hit and n(l — it) 
are & 5, the normal distribution can be used to approximate the binomial, and the 
sampling distribution of the proportion will be approximately normally distributed, 
with the approximation improving for larger values of n and for values of tt that are 
closer to 0.5. 

Just as we can use the z-score to convert a normal distribution to the standard 
normal distribution, the same approach can be used to convert a normal sampling 
distribution to the standard normal distribution. In the process, the z-score can be 
used to determine probabilities associated with the sample mean or the sample 
proportion. 

• When the sample is large compared to the population 

When sampling is without replacement and from a finite population, calculation 
of the standard error of the sampling distribution may require multiplying by a 
correction term called the finite population correction factor. The purpose of the 
correction is to reduce the value of the standard error according to how large the 
sample is compared to the population from which it was drawn. As a rule of 
thumb, the finite population correction is made whenever the sample is at least 
l/20th the size of the population. 

• Using the computer to demonstrate the central limit theorem 

Some computer statistical packages can generate simple random samples from a 
specified distribution. The chapter uses Excel and Minitab to demonstrate the 
central limit theorem by showing how the distribution of means of simulated 
samples from a nonnormal population tends to approach the normal distribution 
as the sample size is increased. 



equations 



iMtf r<* 



Sampling Distribution of the Mean: Simple Random 
Samples from a Normally Distributed Population 

• Regardless of the sample size, the sampling distribution of the mean will be normally 
distributed, with 

• Mean: 



E(x ) = Ms = M 



Standard error: 



where fi 



a 

n - 



population mean 

population standard deviation 

sample size 
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• z-score for the sampling distribution of the mean: 

z = where z = distance from the mean, measured 

* in standard error units 

x = value of the sample mean in which 
we are interested 
cr- = standard error of the sampling 

distribution of the mean, or cr/Vw 

• According to the central limit theorem, the sampling distribution of the mean will be 
approximately normal even when the samples are drawn from a nonnormal popula- 
tion, provided that the sample size is sufficiently large. It is common practice to apply 
the above formulas when n > 30. 

Sampling Distribution of the Proportion, p 

• Mean = E(p) = tt 



Standard error = av = \ — where tt = population proportion 

V fi i 

n = sample size 

z-score for a given value of p: 

z = — where p = sample proportion value of interest 



Standard Error of the Sampling Distribution: Sampling 
without Replacement from a Finite Population 

• Standard error for the sample mean: 

a . IN - n 

°V — ~r ' V 77 7 where n = sample size 
V w V N — 1 . 

N = population size 

<j = population standard deviation 

• Standard error for the sample proportion: 



77(1 — tt) ^ ^ N — n where n = sample size 

N — 1 N = population size 

tt = population proportion 



chapter exercises 



8.42 The campaign manager for a political candidate 
claims that 55% of registered voters favor the candidate 
over her strongest opponent. Assuming that this claim is 
true, what is the probability that in a simple random sam- 
ple of 300 voters, at least 60% would favor the candidate 
over her strongest opponent? 

8.43 For the situation described in Exercise 8.42, suppose 
that only 49% of the registered voters in the sample favor 
the candidate over her strongest opponent. Given this result: 
a. If the campaign manager's claim is correct, what is 

the probability that the sample proportion would be 
no more than 0.49 for a sample of this size? 



b. Based on your answer to part (a), speculate on whether 
the campaign manager's claim might be mistaken. 

8.44 Given the following probability distribution for an 
infinite population with the discrete random variable, x: 



x 

P(x) 



0 

0.2 



I 

0.1 



2 

0.3 



3 

0.4 



Determine the mean and the standard deviation 
of x. 

For the sample size n = 2, determine the mean for 
each possible simple random sample from this 
population. 
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c. For each simple random sample identified in part (b), 
what is the probability that this particular sample 
will be selected? 

d. Combining the results of parts (b) and (c), describe 
the sampling distribution of the mean. 

8.45 In 1999, the average fee paid by H&R Block tax 
preparation customers in Canada was $55.93. Assume 
that the standard deviation of fees was $20 but that we 
have no idea regarding the shape of the population distri- 
bution, source: H&R Block, Inc., 1 999 Annual Report, p. 39. 

a. What additional assumption about the population 
would be needed in order to use the standard normal 
table in determining the probability that the mean fee 
for a simple random sample of 5 customers was less 
than $50? 

b. What is the probability that the mean fee for a simple 
random sample of 36 customers was less than $50? 
What role does the central limit theorem play in 
making it possible to determine this probability? 

8.46 Based on past experience, a telemarketing firm has 
found that calls to prospective customers take an 
average of 2.0 minutes, with a standard deviation of 
1.5 minutes. The distribution is positively skewed, since 
persons who actually become customers require more of 
the caller's time than those who are not home, who 
simply hang up, or who say they're not interested. 
Albert has been given a quota of 220 calls for 
tomorrow, and he works an 8-hour day. Assuming that 
his list represents a simple random sample of those 
persons who could be called, what is the probability 
that Albert will meet or exceed his quota? 

8.47 When a production machine is properly calibrated, 
it requires an average of 25 seconds per unit produced, 
with a standard deviation of 3 seconds. For a simple ran- 
dom sample of n = 36 units, the sample mean is found to 
be x = 26.2 seconds per unit. 

a. What z-score corresponds to the sample mean of 
x = 26.2 seconds? 

b. When the machine is properly calibrated, what is the 
probability that the mean for a simple random sam- 
ple of this size will be at least 26.2 seconds? 

c. Based on the probability determined in part (b), does 
it seem likely that the machine is properly calibrated? 
Explain your reasoning. 

8.48 For a given sample size, the standard error of 
the sample mean is 10 grams. With other factors 
unchanged, to what extent must the sample size be 
increased if the standard error of the sample mean is 
to be just 5 grams? 

8.49 Employees at a corporate headquarters own an 
average of 240 shares of the company's common stock, 
with a standard deviation of 40 shares. For a simple ran- 
dom sample consisting of 5 employees, what assumption 
would be required if we are to use the standard normal 
distribution in determining P(x S 220)? 



8.50 The manufacturer of a quartz travel alarm clock 
claims that, on the average, its clocks deviate from perfect 
time by 30 seconds per month, with a standard deviation 
of 10 seconds. Engineers from a consumer magazine pur- 
chase 40 of the clocks and find that the average clock in 
the sample deviated from perfect accuracy by 34 seconds 
in one month. 

a. If the manufacturer's claim is correct (i.e., /jl = 30 
seconds, cr = 10 seconds), what is the probability 
that the average deviation from perfect accuracy 
would be 34 seconds or more? 

b. Based on your answer to part (a), speculate on the 
possibility that the manufacturer's claim might not be 
correct. 

8.51 Based on past experience, 20% of the contacts 
made by a firm's sales representatives result in a sale 
being made. Charlie has contacted 100 potential 
customers but has made only 10 sales. Assume that 
Charlie's contacts represent a simple random sample 
of those who could have been called upon. Given this 
information: 

a. What is the sample proportion, p = proportion of 
contacts that resulted in a sale being made? 

b. For simple random samples of this size, what is the 
probability that p < 0.10? 

c. Based on your answer to (b), would you tend to 
accept Charlie's explanation that he "just had a 
bad day"? 

8.52 As a project for their high school mathematics class, 
four class members purchase packages of a leading break- 
fast cereal for subsequent weighing and comparison with 
the 12-ounce label weight. The students are surprised 
when they weigh the contents of their four packages and 
find the average weight to be only 11.5 ounces. When 
they write to the company, its consumer relations repre- 
sentative explains that "the contents of the '12-ounce' 
packages average 12.2 ounces and have a standard 
deviation of 0.4 ounces. The amount of cereal will vary 
from one box to the next, and your sample is very 

small — this explains why your average weight could be 
just 11.5 ounces." If the spokesperson's information is 
correct, how unusual would it be for these students to get 
a sample mean as small as the one they obtained? Be sure 
to state any assumptions you have made in obtaining 
your answer. 

8.53 Of 500 vehicles sold by an auto dealership last 
month, 200 have a defect that will require a factory 
recall for dealer correction. For a simple random 
sample of n = 50 of last month's customers, what is 
the probability that no more than 15 will be affected by 
the recall? 

8.54 Seventy percent of the registered voters in a large 
community are Democrats. The mayor will be selecting a 
simple random sample of 20 registered voters to serve on 
an advisory panel that helps formulate policy for the 
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community's parks and recreation facilities. What is the 
probability that the constituency of the committee will be 
no more than 50% Democrats? 

8.55 In the 2000 census, the so-called "long form" 
received by one of every six households contained 
52 questions, ranging from your occupation and 
income all the way to whether you had a bathtub. 
According to the U.S. Census Bureau, the mean 
completion time for the long form is 38 minutes. 
Assuming a standard deviation of 5 minutes and a 
simple random sample of 50 persons who filled out 
the long form, what is the probability that their 
average time for completion of the form was more than 
45 minutes? source: Haya El Nasser, "Census Forms Can Be Filed 
by Computer," USA Today, February 10, 2000, p. 4A. 

8.56 In performing research on extrasensory perception, 
researchers sometimes place two individuals in different 
locations, then have person A (the "sender") view each 
item in a series of stimuli while person B (the "receiver") 
tries to correctly identify the image person A is viewing. 
In carrying out such a study, a researcher enlists the help 
of Amy and Donald as volunteers to "send" and 
"receive," respectively. The researcher randomly shows 
Amy one of two pictures — either a sunny sky or a stormy 
sky — and this procedure is carried out for a total of 200 
exposures at 20-second intervals. If Donald is simply 
guessing, he will tend to be correct 50% of the time. In 
this particular study, Donald was successful in 108 out of 
200 attempts to identify which picture Amy was viewing. 
If someone has absolutely no extrasensory capabilities 
and is simply guessing, what is the probability of guessing 
correctly on at least 108 out of 200 attempts? Given the 



probability you've calculated, would you suggest that 
Donald drop his plan to list himself in the yellow pages 
as a psychic phenomenon? 

8.57 A lighting vendor has described its incandescent 
bulbs as having normally distributed lifetimes with a 
mean of 2000 hours and a standard deviation of 
100 hours. The vendor is faced with an especially 
demanding industrial customer who insists on conducting 
a longevity test before committing to a very large order. 
The customer's purchasing department plans to test a 
simple random sample of 16 bulbs and to place the large 
order only if the sample mean is at least 2050 hours. 
Given the longevity of the vendor's bulbs and the test the 
potential customer will be carrying out, what is the prob- 
ability that the vendor will get the contract? 

8.58 The overall pass rate for law school graduates 
taking the Maryland bar exam has been reported as 
75%. Assume that a certain Maryland law school has 
had 400 of its most recent graduates take the Maryland 
bar exam, but only 60% passed. When asked about 
these results, the dean of this university's law school 
claims his graduates are just as good as others who have 
taken the Maryland bar exam, and the low pass rate for 
the recent graduates was just "one of those statistical 
fluctuations that happen all the time." If this university's 
overall pass rate for the Maryland bar exam were really 
75%, what would be the probability of a simple random 
sample of 400 having a pass rate of 60% or less? Based 
on this probability, comment on the dean's explanation. 
source: U.S. News & World Report, Best Graduate Schools, 2001 Edition, 
p. 51. 




Thorndike Sports Equipment 

As he does on the first business day of each month, Ted has 
just opened up the suggestion box that Luke placed in the 
plant to attract ideas from the employees. Along with the 
usual monthly suggestions that old Luke attempt activities 
that are either inappropriate for his age or physiologically 
impossible, Ted finds a message from a production worker 
who is very concerned about the company's racquetball 
racquet machine: 

Dear Thorndike management: 

I am getting very worried about the racquetball 
racquet machine. Some of the older workers say it was 



here 15 years ago when they started, and I think it is 
just about worn out. I know new machines are expen- 
sive, so I thought I would collect some numbers that 
might help you out in making a decision to replace it. 

I know that the weight of the racquets coming off 
the machine is a big deal, since all of the racquets are 
weighed and then you decide which ones will be 
"Light," "Regular," and whatever you've decided to 
call the heavier model. Anyway, I've been doing some 
research. On my own time, honest. From time to time 
during the day, I take the 30 racquets that have just 
come from the machine and weigh them myself. 



You've told us that some racquets weigh more 
than others, so I can understand that one racquet 
might weigh 245 grams, the very next one might 
weigh 230 grams, and then the next one off the line 
might be all the way up to 250 grams. Like I said be- 
fore, I think the machine is worn out. The first 30 rac- 
quets I weighed yesterday weighed an average of 
236.5 grams, the next 30 weighed an average of 243.1 
grams. I think the machine is going crazy, and you 
should either get it fixed or replace it. 

My friends would get pretty upset if they knew I 
wrote this suggestion, since buying a brand-new ma- 
chine could mean that nobody gets a Christmas bonus 
this year. But I'm a young guy and I'd like for you to 
be successful so that I can have some job security and 
take care of my family. If you end up having a problem 
with the weights of your racquets, I might end up 
being out of a job. 

I'd like to find out if maybe I'm missing something 
here, and if the machine really is OK or not. So my 
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friends don't find out about this suggestion, would 
you please put together a reply and duct-tape it to the 
underside of the pay phone next to the candy ma- 
chine? I'll plan on picking it up after the first work 
shift on the 15th of next month. 

Discarding the suggestions that Luke would probably 
find offensive, Ted mentions the young worker's letter re- 
garding the racquetball racquet machine. Luke asks if the 
machine is still working the same as when the decision was 
made to offer racquets in three different weight categories. 
Ted replies that it seems to be. The average weight is still 
240 grams, the standard deviation is still 10 grams, al- 
though he isn't too sure whether the weights are still nor- 
mally distributed. Luke asks Ted to generate a reply to the 
worker's suggestion and tape it to the underside of the pay 
phone as requested. 
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Distribution of Means: Fair Dice 

This applet simulates the rolling of dice when the dice are 
fair — i.e., for each die, the outcomes (integers 1 through 6) 
are equally likely. You can select the size of each sample 
(n = 1 , 3, or 1 2 dice at a time), and the applet will update 
the relative frequencies with which the sample means have 
occurred. The three versions of this applet are shown and 
described here. 

SAMPLE SIZE = 1 

Each time you click the Roll One Set button, you are rolling 
a single die. N shows the number of samples you have taken 
so far, and the chart shows the relative frequency with which 
each of the possible sample means has occurred. Naturally, 
since each sample is only n = 1 , the result in each sample is 
also the mean of each sample. You can make things happen 
much faster by clicking either the Roll 10 Sets button or the 
Roll 100 Sets button. In the latter cases, you are shown the 
means for the 10 most recent samples. 



Sample Size = 1 
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SAMPLE SIZE = 3 

Each time you click the Roll One Set button, you are now 
rolling three dice. N shows the number of samples (or sets) 
you have taken so far, and the chart is updated to show the 
relative frequency with which each of the possible sample 
means has occurred. 

SAMPLE SIZE = 12 

Each time you click the Roll One Set button, you are rolling 
twelve dice. The procedure is the same as above, except each 
sample mean is now the mean for a set of 1 2 dice. 

Sample Size = 12 
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Applet Exercises 

7.1 Generate 3000 rolls of a single die: Select the Sample 
Size = 1 applet version, and click the Roll 100 Sets button 
30 times. Describe the shape of the distribution of the sam- 
ple results in the relative frequencies chart. Is the shape sim- 
ilar to what you might expect? 

7.2 Generate 3000 samples, each one representing a set of 
three dice: Select the Sample Size = 3 applet version and 
click the Roll 100 Sets button 30 times. Comment on how 
the distribution of sample means has changed from the dis- 
tribution obtained in Applet Exercise 7.1 . 

7.3 Generate 3000 samples, each one representing a set of 
twelve dice: Select the Sample Size = 12 applet version 
and click the Roll 100 Sets button 30 times. How has the 
increase in sample size affected the shape of the distribu- 
tion of sample means compared to those obtained in 
Applet Exercises 7.1 and 7.2? 

7.4 If there were an additional applet version that allowed 
each sample to consist of 100 dice, and we generated 
2000 of these samples, what do you think the distribution 
of sample means would tend to look like? In general, as the 
sample sizes become larger, what shape will the distribu- 
tion of sample means tend to approach? 
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Distribution of Means: Loaded Dice 

This applet simulates the rolling of loaded dice — i.e., for 
each die, the outcomes (integers 1 through 6) are most def- 
initely NOT equally likely, and you should never gamble 
with people who use these things. As with the "honest" 
dice in Applet 7, you can select the size of each sample 
(n = 1, 3, or 12 dice at a time), and the applet will update 
the relative frequencies with which the sample means have 
occurred. The same features and procedures apply as in 
Applet 7, but you will find the results to be quite different. 
Examples of the three versions of the "loaded" applet are 
shown here: 
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Applet Exercises 

8.1 Generate 3000 rolls of a single die: Select the "Sample 
Size = 1" applet version, and click the "Roll 100 Sets" 

button 30 times. Describe the shape of the distribution 
of the sample results in the relative frequencies chart. Does 
the shape look a little peculiar? Can you tell which faces of 
the dice might be "loaded"? 

8.2 Generate 3000 samples, each one representing a set of 
three dice: Select the "Sample Size = 3" applet version, 
and click the "Roll 100 Sets" button 30 times. Comment 
on how the distribution of sample means has changed 
from the distribution obtained in Applet Exercise 8.1 . 

8.3 Generate 3000 samples, each one representing a set of 
twelve dice: Select the "Sample Size = 12" applet version, 
and click the "Roll 100 Sets" button 30 times. How has 
the increase in sample size affected the shape of the distri- 
bution of sample means compared to those obtained in 
Applet Exercises 8.1 and 8.2? 

8.4 If there were an additional applet version that allowed 
each sample to consist of 100 dice, and we generated 
2000 of these samples, what do you think the distribution 
of sample means would tend to look like? In general, as the 
sample sizes become larger, what shape will the distribu- 
tion of sample means tend to approach? 



Work Sampling 

In the workplace, production experts sometimes conduct studies to find out how 
much time workers spend doing various job activities. This information can be used in 
establishing or updating standards for performance, as well as for comparing and 
evaluating worker performance. Among the approaches for determining how much 
time a worker spends at various activities is a technique known as work sampling. 
Compared to alternative methods (such as following the person and timing him or 
her with a stopwatch), work sampling is unobtrusive in that the behavior observed 
is not influenced by the observation process itself. 

In work sampling, a worker is observed at randomly selected points along an 
interval of time; then the proportion of these observations that involve each selected 
activity is determined. For example, if we want to determine how much of the time a 
secretary spends keyboarding, we can observe the person at random times during a 
typical day or week, then calculate the proportion of time that he or she happens to 
be keyboarding. If the person were observed to be keyboarding in 1 00 
of 280 random observations, the sample proportion 

sample da« 



would be 100/280, or 0.357. 

Using this information, along 
with estimation techniques 
described later in this chapter, we 
could then arrive at an interval 
estimate reflecting the likely range of 
values within which the true 
population proportion lies. When you 
finish Section 9.6, you might like to pay 
a short visit back here and verify that 
the 90% confidence interval for the pop- 
ulation proportion is from 0.310 to 
0.404, and that we are 90% confident 
that the person spends somewhere 
between 31 .0% and 40.4% of his or her 
time clicking away at the keyboard. 
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learning 
objectives 

After reading this 
chapter, you should 
be able to: 



Explain the difference between a point estimate and an interval estimate for a 
population parameter. 

Use the standard normal distribution in constructing a confidence interval for a 
population mean or proportion. 

Use the t distribution in constructing a confidence interval for a population 
mean. 

Decide whether the standard normal distribution or the t distribution should 
be used in constructing a given confidence interval. 

Determine how large a simple random sample must be in order to estimate a 
population mean or proportion at specified levels of accuracy and confidence. 



• Use Excel and Minitab to construct confidence intervals. 



9.1 INTRODUCTION 



In Chapter 8, we began with a population having a known mean (/jl) or propor- 
tion (it); then we examined the sampling distribution of the corresponding sam- 
ple statistic (x or p) for samples of a given size. In this chapter, we'll be going 
in the opposite direction — based on sample data, we will be making estimates 
involving the (unknown) value of the population mean or proportion. As men- 
tioned previously, the use of sample information to draw conclusions about the 
population is known as inferential statistics. 

To repeat a very important point, this chapter completes a key transition dis- 
cussed at the beginning of Chapter 7: 

• In Chapter 7, we had access to the population mean and we made probability 
statements about individual x values taken from the population. 

• In Chapter 8, we again had access to the population mean, but we invoked the 
central limit theorem and began making probability statements about the 
means of samples taken from the population. (Beginning with Chapter 8, the 
sample mean itself is considered as a random variable.) 

• In this chapter, we again lack access to the population mean, but we will begin 
using sample data as the basis from which to make probability statements 
about the true (but unknown) value of the population mean. As in Chapter 8, 
we will be relying heavily on the central limit theorem. 

In the following sections, we will use sample data to make both point and 
interval estimates regarding the population mean or proportion. While the 
point estimate is a single number that estimates the exact value of the popula- 
tion parameter of interest (e.g., fi or tt), an interval estimate includes a range 
of possible values that are likely to include the actual population parameter. 
When the interval estimate is associated with a degree of confidence that it 
actually includes the population parameter, it is referred to as a confidence 
interval. 

Point and interval estimates can also be made regarding the difference between 
two population means — /jl 2 ) or proportions (ir l — tt 2 ). These involve data from 
two samples, and they will be discussed in the context of the hypothesis-testing 
procedures of Chapter 11. 
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Whenever sample data are used for estimating a population mean or propor- 
tion, sampling error will tend to be present because a sample has been taken 
instead of a census. As a result, the observed sample statistic (x or p) will differ 
from the actual value of the population parameter (/x or it). Assuming a simple 
random sampling of elements from the population, formulas will be presented for 
determining how large a sample size is necessary to ensure that such sampling 
error is not likely to exceed a given amount. 



7_ t.' >> 



9.1 Differentiate between a point estimate and an interval 9.3 What is necessary for an interval estimate to be a 
estimate for a population parameter. confidence interval? 

9.2 What is meant by inferential statistics, and what role 
does it play in estimation? 




POINT ESTIMATES 9.2 



An important consideration in choosing a sample statistic as a point estimate of 
the value of a population parameter is that the sample statistic be an unbiased es- 
timator. An estimator is unbiased if the expected value of the sample statistic is 
the same as the actual value of the population parameter it is intended to estimate. 
Three important point estimators introduced in the chapter are those for a popu- 
lation mean (//,), a population variance (a 2 ), and a population proportion (tt). 

As Chapter 8 showed, the expected value of the sample mean is the population 
mean, and the expected value of the sample proportion is the population propor- 
tion. As a result, x and p are unbiased estimators of fi and tt, respectively. Table 9.1 
presents a review of the applicable formulas. Note that the divisor in the formula 
for the sample variance (s 2 ) is (n — 1). Using (n — 1) as the divisor in calculating the 
variance of the sample results in s 2 being an unbiased estimate of the (unknown) 
population variance, a 2 . The positive square root of s 2 , the sample standard devia- 
tion (s), will not be an unbiased estimate of the population standard deviation (a). 
In practice, however, s is the most frequently used estimator of its population coun- 
terpart, cr. 





Unbiased 




TABLE 9.1 1 


Population Parameter 


Estimator 


Formula 


An estimator is unbiased if 
its expected value is the 
same as the actual value of 
the corresponding popula- 
tion parameter. Listed here 


Mean, /x 


X 


x = ^ 
n 


Variance, a 2 


s 2 


j _ E(x,- - x) 2 

n - 1 

x successes 

P = —, — 

n trials 


are unbiased point estima- 
tors for a population mean, 


Proportion, it 


p 


a population variance, and 
a population proportion. 
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9.4 What is meant when a sample statistic is said to be 
an unbiased estimator} 

9.5 When calculating the sample variance, what procedure 
is necessary to ensure that s 2 will be an unbiased estimator 
of cr 2 ? Will s be an unbiased estimator of tr? 

9.6 During the month of July, an auto manufacturer gives 
its production employees a vacation period so it can tool 
up for the new model run. In surveying a simple random 
sample of 200 production workers, the personnel director 
finds that 38% of them plan to vacation out of state for at 
least one week during this period. Is this a point estimate 
or an interval estimate? Explain. 

9.7 A simple random sample of 8 employees is selected 
from a large firm. For the 8 employees, the number of 



days each was absent during the past month was found to 
be 0, 2, 4, 2, 1, 7, 3, and 2, respectively. 

a. What is the point estimate for /x, the mean number of 
days absent for the firm's employees? 

b. What is the point estimate for o 2 , the variance of the 
number of days absent? 

9.8 The average annual U.S. per capita consumption of 
iceberg lettuce has been estimated as 24.3 pounds. The 
annual per capita consumption 2 years earlier had been 
estimated as 21.6 pounds. Could either or both of these 
consumption figures be considered a point estimate? 
Could the difference between the two consumption 
figures be considered an interval estimate? Explain your 
reasoning in both cases, source: Bureau of the Census, Statistical 
Abstract of the United States 2002, p. 130. 



9.3 A PREVIEW OF INTERVAL ESTIMATES 



When we know the values of the population mean and standard deviation, we can 
(if either the population is normally distributed or n is large) use the standard 
normal distribution in determining the proportion of sample means that will fall 
within a given number of standard error (cr-) units of the known population 
mean. This is exactly what we did in Chapter 8. 

It is typical of inferential statistics that we must use the mean (x) and stan- 
dard deviation (s) of a single sample as our best estimates of the (unknown) 
values of /a and u. However, this does not prevent us from employing x and s in 
constructing an estimated sampling distribution for all means having this sample 
size. This is the basis for the construction of an interval estimate for the popula- 
tion mean. 

When we apply the techniques of this chapter and establish the sample 
mean as the midpoint of an interval estimate for the population mean, the 
resulting interval may or may not include the actual value of /jl. For example, 
in Figure 9.1, six of the seven simple random samples from the same popula- 
tion led to an interval estimate that included the true value of the population 
mean. 

In Figure 9.1, the mean of sample number 1 (x^) is slightly greater than the 
population mean (ju), and the interval estimate based on this sample actually in- 
cludes fi. For sample 3, taken from the same population, the estimation interval 
does not include /x. In Figure 9.1, we can make these observations because the 
value of /u. is known. In practice, however, we will not have the benefit of know- 
ing the actual value of the population mean. Therefore, we will not be able to say 
with complete certainty that an interval based on our sample result will actually 
include the (unknown) value of /jl. 
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Sampling distribution of 
the mean for all possible 
simple random samples 
with a given sample size 



Does this interval 
include the actual 
population mean, /x? 



Yes 



15.0 



11.0 



Yes 



No 



Yes 



Yes 



Yes 



Yes 



A%= M = 20.0 



18.0 
I 

*2 



22.0 
I 



FIGURE 9.1 



Examples of seven different 
interval estimates for a pop- 
ulation mean, with each in- 
terval based on a separate 
simple random sample from 
the population. Six of the 
seven interval estimates in- 
clude the actual value of /j,. 



10.0 



17.0 

I 



8.0 



15.0 

I 



19.0 



14.5 



29.0 



25.0 



23.5 



24.0 



30.5 
I 

*3 



37.5 



22.0 



26.0 
L_ 



33.0 



21.5 

I 



28.5 



fii = fi= 20.0 



The interval estimate for the mean simply describes a range of values that is 
likely to include the actual population mean. This is also the case for our use of 
the sample proportion (p) to estimate the population proportion (ir), as well as 
for our construction of an interval estimate within which the actual value of tt is 
likely to fall. 

The following terms are of great importance in interval estimation: 

INTERVAL ESTIMATE A range of values within which the actual value of the 
population parameter may fall. 

INTERVAL LIMITS The lower and upper values of the interval estimate. 



(continued) 
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CONFIDENCE INTERVAL An interval estimate for which there is a specified de- 
gree of certainty that the actual value of the population parameter will fall within 
the interval. 

CONFIDENCE COEFFICIENT For a confidence interval, the proportion of such in- 
tervals that would include the population parameter if the process leading to the 
interval were repeated a great many times. 

CONFIDENCE LEVEL Like the confidence coefficient, this expresses the degree 
of certainty that an interval will include the actual value of the population param- 
eter, but it is stated as a percentage. For example, a 0.95 confidence coefficient is 
equivalent to a 95% confidence level. 

ACCURACY The difference between the observed sample statistic and the actual 
value of the population parameter being estimated. This may also be referred to as 
estimation error or sampling error. 



To illustrate these and several other terms discussed so far, we have provided 
their values in the following example, which is typical of published statistical find- 
ings. The methods by which the values were determined will become apparent in 
the sections to follow. 



example 




Interval Estimates 

"In our simple random sample of 2000 households, we found the average income 
to be x = $65,000, with a standard deviation, s = $12,000. Based on these data, 
we have 95% confidence that the population mean is somewhere between 
$64,474 and $65,526." 

SOLUTION 

• Point estimate of ft $65,000 

• Point estimate of a $12,000 

• Interval estimate of /t $64,474 to $65,526 

• Lower and upper interval limits for fi $64,474 and $65,526 

• Confidence coefficient 0.95 

• Confidence level 95% 

• Accuracy For 95% of such intervals, the sample mean would not differ from 
the actual population mean by more than $526. 



When constructing a confidence interval for the mean, a key consideration 
is whether we know the actual value of the population standard deviation (a). 
As Figure 9.2 shows, this will determine whether the normal distribution or 
the t distribution (see Section 9.5) will be used in determining the appropriate 
interval. Figure 9.2 also summarizes the procedure for constructing the confi- 
dence interval for the population proportion, a technique that will be discussed 
in Section 9.6. 
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FIGURE 9.2 



Population mean 



y 

a known 



Z-interval, 
with limits: 



x + z 



Section 9.4 
Note 1 



estimate for a 



Population proportion 



a unknown 



(-interval, 
with limits: 

*±,-L. 
in 
and 
df=n-\ 

Section 9.5 
Note 2 



z-interval, 
with limits: 



-P) 



Section 9.6 
Note 3 



1 If the population is not normally distributed, n should be at least 30 for the central limit 
theorem to apply. 

2 When c is unknown, but the underlying population can be assumed to be approximately 
normally distributed, use of the t distribution is a necessity when n < 30. Use of the t dis- 
tribution is also appropriate, however, when a is unknown and the sample sizes are larger. 
Most computer statistical packages routinely use the f-interval for all sample sizes when s is 
used to estimate a. 

3 Assumes that both np and n(l — p) are £ 5. The normal distribution as an approximation 
to the binomial improves as n becomes larger and for values of p that are closer to 0.5. 



This figure provides an 
overview of the methods 
for determining confidence 
interval estimates for a 
population mean or a 
population proportion 
and indicates the chapter 
section in which each is 
discussed. Key assumptions 
are reviewed in the figure 
notes. 




9.9 Exactly what is meant by the accuracy of a point 
estimate? 

9.10 A population is approximately normally distributed 
and the sample size is to be n = 40. What additional factor 
must be considered in determining whether to use the stan- 
dard normal distribution in constructing the confidence 
interval for the population mean? 

9.11 "In surveying a simple random sample of 1000 em- 
ployed adults, we found that 450 individuals felt they 



were underpaid by at least $3000. Based on these results, 
we have 95% confidence that the proportion of the popu- 
lation of employed adults who share this sentiment is 
between 0.419 and 0.481." For this summary statement, 
identify the 

a. point estimate of the population proportion. 

b. confidence interval estimate for the population 
proportion. 

c. confidence level and the confidence coefficient. 

d. accuracy of the sample result. 



CONFIDENCE INTERVAL ESTIMATES 9.4 
FOR THE MEAN: tr KNOWN 



If we don't know the value of the population mean, chances are that we also do 
not know the value of the population standard deviation. However, in some cases, 
usually industrial processes, a may be known while /x is not. If the population 
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cannot be assumed to be normally distributed, the sample size must be at least 30 
for the central limit theorem to apply. When these conditions are met, the confi- 
dence interval for the population mean will be as follows: 

Confidence interval limits for the population mean, cr known: 

sample mean 

population standard deviation 
sample size 

z value corresponding to the level of 
confidence desired (e.g., z = 1.96 
for the 95% confidence level) 
standard error of the sampling 
distribution of the mean 



z — 7= where x 



cr 

n 

z 



rlV^n 



N 


0 


T E 


S 



This application assumes that either 
distributed or (2) the sample size is n ~ 



1) the underlying population is normally 
: 30. Also, an alternative way of describing 



the z value is to refer to it as z a/2 , with a/2 being the area to the right. For exam- 
ple, z 0 would be 1.96. 





Example 







z-lnterval. Mean 

From past experience, the population standard deviation of rod diameters pro- 
duced by a machine has been found to be a = 0.053 inches. For a simple random 
sample of n = 30 rods, the average diameter is found to be x = 1.400 inches. The 
underlying data are in file CX09RODS. 

SOLUTION 

What Is the 95% Confidence Interval for the Population Mean, (i7 

Although we don't know the value of fi, x = 1.400 inches is our best estimate of 
the population mean diameter. As Figure 9.3 shows, the sampling distribution of 
the mean will have a standard error of cr/Vra, or 0.053/V30. For the standard 
normal distribution, 95% of the area will fall between z = —1.96 and z = +1.96. 
We are able to use the standard normal distribution table because n > 30 and the 
central limit theorem can be invoked. As a result, the 95% confidence interval for 
the (unknown) population mean can be calculated as 

x ± zA= = 1.400 ± 1.96^^ or between 1.381 and 1.419 inches 
Vw V30 

Figure 9.3 shows the midpoint (x = 1.400 inches) for the 95% confidence inter- 
val for the mean, along with the lower and upper limits for the confidence inter- 
val. Based on our calculations, the 95% confidence interval for the population 
mean is from 1.381 to 1.419 inches. 
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Normal distribution: 
for 95% confidence, 
zwill be ±1.96. 
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The 95% confidence interval for /jl is 
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1.381 




1.400 









-in 



1.400+1.96 

V30 
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FIGURE 9.3 



Construction of the 95% 
confidence interval for the 
population mean, based on 
a sample of 30 rods for 
which the average diameter 
is 1 .400 inches. From past 
experience, the population 
standard deviation is known 
to be a = 0.053 inches. 
Because a is known, the 
normal distribution can be 
used in determining the 
interval limits. We have 
95% confidence that ft 
is between 1 .381 and 
1 .419 inches. 



More precisely, 95% of such intervals constructed in this way would include 
the population mean. Since we have taken only one sample and constructed just 
one interval, it is technically correct to say we have 95% confidence that this 
particular interval contains the population mean. Although the logic may be 
tempting, this is not the same as saying the probability is 0.95 that this particular 
interval will include the population mean. 

With other factors unchanged, a higher confidence level will require a wider 
confidence interval. Likewise, a lower confidence level will lead to a narrower 
confidence interval. In other words, the more certain we wish to be that the inter- 
val estimate contains the population parameter, the wider the interval will have to 
be. Refer to Seeing Statistics Applet 9, at the end of the chapter, to visually 
demonstrate how the width of the confidence interval changes when higher and 
lower levels of confidence are specified. 



Computer Solutions 9.1 shows how we can use Excel or Minitab to generate 
a confidence interval for the mean when the population standard deviation is 
known or assumed. In this case, we are replicating the 95% confidence interval 
shown in Figure 9.3, and the 30 data values are in file CX09RODS. If we use an 
Excel procedure based on summary statistics, it can be interesting to examine 
"what-if" scenarios to instantly see how changes in the specified confidence level 
would change the width of the confidence interval. 
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computer solutions 9.1 




Confidence Interval for Population Mean, o- Known 

These procedures show how to construct a confidence interval for the population mean when the 
population standard deviation is known. 



EXCEL 





A 


B 


C 


1 


z-Estimate: Mean 




2 








3 






diameter 


4 


Mean 




1.400 


5 


Standard Deviation 


0.052 


6 


Observations 


30 


7 


SIGMA 




0.053 


8 


LCL 




1.381 


9 


UCL 




1.419 



Excel confidence interval for n based on raw data and <r known 

1 . For example, using the 30 rod diameters (file CX09RODS.XLS) on which Figure 9.3 is based: The label and 30 data val- 
ues are in A1:A31. Click Tools. Click Data Analysis Plus. Click Z-Estimate: Mean. Click OK. 

2. Enter A1:A31 into the Input Range box. Enter the known population standard deviation (0.053) into the Standard 
Deviation (SIGMA) box. Click Labels, since the variable name is in the first cell within the field. The desired confi- 
dence level as a decimal fraction is 0.95, so the corresponding alpha value is 1 - 0.95 = 0.05. Enter 0.05 into the 
Alpha box. Click OK. The confidence interval results will be as shown above. 

Excel confidence interval for fi based on summary statistics and tr known 

1 . For example, withx = 1 .400, o- = 0.053, and n = 30, as in Figure 9.3: Open the ESTlMATORS.XLS workbook, supplied 
with the text. 

2. Using the arrows at the bottom left, select the z-Estimate_Mean worksheet. Enter the sample mean (1.4), the known 
sigma (0.053), the sample size (30), and the desired confidence level as a decimal fraction (0.95). 

(Note: As an alternative, you can use Excel worksheet template TMZINT.XLS, supplied with the text. The steps are described 
within the template.) 

MINI TAB 

Confidence interval for fi based on raw data and <r known 
One-Sample Z: diameter 



The assumed standard deviation = 0.053 



Variable N Mean StDev SE Mean 95% CI 

diameter 30 1.40000 0.05196 0.00968 (1.38103, 1.41897) 

1 . For example, using the data (file CX09RODS.MTW) on which Figure 9.3 is based, with the 30 data values in column C1 : 
Click Stat. Select Basic Statistics. Click 1-Sample Z. 

2. Select Samples in columns and enter C1 into the box. Enter the known population standard deviation (0.053) into 
the Standard deviation box. The Perform hypothesis test box should be left blank — we will not be doing hypoth- 
esis testing until the next chapter. 
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3. Click Options. Enter the desired confidence level as a percentage (95.0) into the Confidence Level box. Within the 
Alternative box, select not equal. Click OK. Click OK. The printout also includes the sample mean (1 .400), the known 
sigma (0.053), the sample size (30), and the standard error of the mean (calculated as 0.053/\/30 = 0.00968). Al- 
though the sample standard deviation is shown, it is not used in the construction of this confidence interval. 

Confidence interval for fi based on summary statistics and <r known 

Follow the procedure in the previous steps for raw data, but in step 2 select Summarized data and enter 30 into the 
Sample size box and 1.4 into the Mean box. 





9.12 What role does the central limit theorem play in the 
construction of a confidence interval for the population 
mean? 

9.13 In using the standard normal distribution to construct 
a confidence interval for the population mean, what two 
assumptions are necessary if the sample size is less than 30? 

9.14 The following data values are a simple random 
sample from a population that is normally distributed, 
with a 2 = 25.0: 47, 43, 33, 42, 34, and 41. Construct 
and interpret the 95% and 99% confidence intervals for 
the population mean. 

9.15 A simple random sample of 30 has been collected 
from a population for which it is known that a = 10.0. 
The sample mean has been calculated as 240.0. Construct 
and interpret the 90% and 95% confidence intervals for 
the population mean. 

9.16 A simple random sample of 25 has been collected 
from a normally distributed population for which it is 
known that cr = 17.0. The sample mean has been calcu- 
lated as 342.0, and the sample standard deviation is 

s = 14.9. Construct and interpret the 95% and 99% 
confidence intervals for the population mean. 

9.17 The administrator of a physical therapy facility has 
found that postoperative performance scores on a knee 
flexibility test have tended to follow a normal distribution 
with a standard deviation of 4. For a simple random sam- 
ple of ten patients who have recently had knee surgery, 
the scores are as follows: 101, 92, 94, 88, 52, 93, 76, 84, 
72, and 98. Construct and interpret the 90% and 95% 
confidence intervals for the population mean. 

9.18 In testing the heat resistance of electrical components, 
safety engineers for an appliance manufacturer routinely 
subject wiring connectors to a temperature of 450 degrees 
Fahrenheit, then record the amount of time it takes for 
the connector to melt and cause a short circuit. Past expe- 
rience has shown the standard deviation of failure times 



to be 6.4 seconds. In a simple random sample of 40 con- 
nectors from a very large production run, the mean time 
until failure was found to be 35.5 seconds. Construct and 
interpret the 99% confidence interval for /jl = the mean 
time until failure for all of the connectors from the 
production run. 

9.19 An assembly process includes a torque wrench 
device that automatically tightens compressor housing 
bolts; the device has a known process standard deviation 
of cr = 3 lb-ft in the torque applied. A simple random 
sample of 35 nuts is selected, and the average torque to 
which they have been tightened is 150 lb-ft. What is the 
95% confidence interval for the average torque being 
applied during the assembly process? 

9.20 A machine that stuffs a cheese-filled snack product 
can be adjusted for the amount of cheese injected into 
each unit. A simple random sample of 30 units is selected, 
and the average amount of cheese injected is found to 

be x = 3.5 grams. If the process standard deviation is 
known to be cr = 0.25 grams, construct the 95% confi- 
dence interval for /jl = the average amount of cheese 
being injected by the machine. 

9.21 In Exercise 9.20, if the sample size had been n = 5 
instead of n = 30, what assumption would have to be made 
about the population distribution of filling weights in order 
to use z values in constructing the confidence interval? 

/ data Set / Note: Exercises 9.22 and 9.23 require a 
computer and statistical software. 

9.22 For one of the tasks in a manufacturing process, the 
mean time for task completion has historically been 
35.0 minutes, with a standard deviation of 2.5 minutes. 
Workers have recently complained that the machinery used 
in the task is wearing out and slowing down. In response to 
the complaints, plant engineers have measured the time re- 
quired for a sample consisting of 100 task operations. The 
100 sample times, in minutes, are in data file XR09022. 
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Using the mean for this sample, and assuming that the pop- 
ulation standard deviation has remained unchanged at 2.5 
minutes, construct the 95% confidence interval for the pop- 
ulation mean. Is 35.0 minutes within the confidence inter- 
val? Interpret your "yes" or "no" answer in terms of 
whether the mean time for the task may have changed. 

9.23 Sheila Johnson, a state procurement manager, is 
responsible for monitoring the integrity of a wide range of 
products purchased by state agencies. She is currently 
examining a sample of paint containers recently received 
from a long-time supplier. According to the supplier, the 
process by which the cans are filled involves a small 



amount of variation from one can to the next, and the stan- 
dard deviation is 0.25 fluid ounces. The 40 cans in Sheila's 
sample were examined to determine how much paint they 
contained, and the results (in fluid ounces) are listed in data 
file XR09023. Using the mean for this sample, and assuming 
that the population standard deviation is 0.25 fluid ounces, 
construct the 90% confidence interval for the population 
mean volume for the cans of paint provided by the supplier. 
If the labels on the paint cans say the mean content for such 
containers is 100.0 fluid ounces, would your confidence in- 
terval tend to support this possibility? 



9.5 CONFIDENCE INTERVAL ESTIMATES 

FOR THE MEAN: v UNKNOWN 



It is rare that we know the standard deviation of a population but have no knowl- 
edge about its mean. For this reason, the techniques of the previous section are 
much less likely to be used in practice than those discussed here. Whenever the 
population standard deviation is unknown, it must be estimated by the sample 
standard deviation, s. For such applications, there is a continuous distribution 
called the Student's t distribution. 

The Student's t Distribution 
Description 

Also referred to as simply the t distribution, this distribution is really a family of 
continuous, unimodal, bell-shaped distributions. It was developed in the early 
1900s by W. S. Gossett, who used the pen name "Student" because his company 
did not permit employees to publish their research results. The t distribution is the 
probability distribution for the random variable t = (x — /a)/(s/V«). It has a 
mean of zero, but its shape is determined by what is called the number of degrees 
of freedom (df). For confidence interval applications, the specific member of the 
family is determined by df — n — 1 . 

The term degrees of freedom refers to the number of values that remain free 
to vary once some information about them is already known. For example, if four 
items have a mean of 10.0, and three of these items are known to have values of 
8, 12, and 7, there is no choice but for the fourth item to have a value of 13. In 
effect, one degree of freedom has been lost. 

The t distribution tends to be flatter and more spread out than the normal dis- 
tribution, especially for very small sample sizes. Figure 9.4 compares the approx- 
imate shape of a standard normal distribution with that of a t distribution for 
which df = 6. The t distribution converges to the normal distribution as the sam- 
ple size (and df) increases, and as the number of degrees of freedom approaches 
infinity, the two distributions are actually identical. As with our use of z previ- 
ously in this chapter, t represents distance in terms of standard error units. Seeing 
Statistics Applet 10, at the end of the chapter, can be used in seeing how the shape 
of the t distribution responds to different of/Values. 
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Standard normal 
distribution 



t distribution 
with df=6 




FIGURE 9.4 



A comparison of the 
approximate shape of 
the standard normal distri- 
bution with that of a f dis- 
tribution having 6 degrees 
of freedom. The shape of 
the f distribution is flatter 
and more spread out, 
but approaches that of 
the standard normal 
distribution as the number 
of degrees of freedom 
increases. 





Example 


& - : 




tSrJUrlP^!' 




t Distribution Table 



Using the t Distribution Table. A table for t values that correspond to selected 
areas beneath the t distribution appears on the pages immediately preceding the 
back cover. A portion of the table is reproduced as Table 9.2 on page 282. In gen- 
eral, it is used in the same way as the standard normal table, but there are two 
exceptions: (1) the areas provided are for the right tail only, and (2) it is neces- 
sary to refer to the appropriate degrees of freedom (df) row in finding the appro- 
priate t value. 

SOLUTION 

For a Sample Size of n = 15, What t Values Would Correspond to an Area 
Centered at t = 0 and Having an Area beneath the Curve of 95%? 

The area of interest beneath the curve can be expressed as 0.95, so the total area 
in both tails combined will be (1.00 — 0.95), or 0.05. Since the curve is symmet- 
rical, the area in just one tail will be 0.05/2, or 0.025. The number of degrees 
of freedom will be the sample size minus 1, or df = n — 1, or 15 — 1 = 14. 
Referring to the 0.025 column and the df = 14 row of the table, we find that 
the value of t corresponding to a right-tail area of 0.025 is t = +2.145. Because 
the curve is symmetrical, the value of t for a left-tail area of 0.025 will be 
t = -2.145. 

Note that these values of t (t = ±2.145) are farther apart than the z values 
(z = ±1.96) that would have led to a 95% area beneath the standard normal 
curve. Remember that the shape of the t distribution tends to be flatter and more 
spread out than that of the normal distribution, especially for small samples. 

For a Sample Size of n = 99, What t Values Would Correspond to an Area 
Centered at t = 0 and Having an Area beneath the Curve of 90%? 

In this case, the proportion of the area beneath the curve is 0.90, so each tail will 
have an area of (1.00 — 0.90)/2, or 0.05. Therefore, we will refer to the 0.05 
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TABLE 9.2 











A portion of the Student's 
f distribution table. The t 
distribution is really a family 
of symmetric, continuous 
distributions with a mean of 
f = 0. The specific member 
of the distribution depends 
on the number of degrees 
of freedom, or df. As df 
increases, the f distribution 
approaches the normal dis- 
tribution, and the f values 
in the infinity row are 
identical to the z values 
for the standard normal 
distribution. 




(For a right-tail area of 0.025 and 
df= 15, the? value is 2.131.) 







0 


t 






a 


0.10 


0.05 


0.025 


0.01 


0.005 


df= 1 


3.078 


6.314 


12.706 


31.821 


63.657 


2 


1.886 


2.920 


4.303 


6.965 


9.925 


3 


1.638 


2.353 


3.182 


4.541 


5.841 


4 


1.533 


2.132 


2.776 


3.747 


4.604 


5 


1.476 


2.015 


2.571 


3.365 


4.032 


6 


1.440 


1.943 


2.447 


3.143 


3.707 


7 


1.415 


1.895 


2.365 


2.998 


3.499 


8 


1.397 


1.860 


2.306 


2.896 


3.355 


9 


1.383 


1.833 


2.262 


2.821 


3.250 


10 


1.372 


1.812 


2.228 


2.764 


3.169 


11 


1.363 


1.796 


2.201 


2.718 


3.106 


12 


1.356 


1.782 


2.179 


2.681 


3.055 


13 


1.350 


1.771 


2.160 


2.650 


3.012 


14 


1.345 


1.761 


2.145 


2.624 


2.977 


15 


1.341 


1.753 


2.131 


2.602 


2.947 


98 


1.290 


1.661 


1.984 


2.365 


2.627 


99 


1.290 


1.660 


1.984 


2.365 


2.626 


100 


1.290 


1.660 


1.984 


2.364 


2.626 


"Infinity" 


1.282 


1.645 


1.960 


2.326 


2.576 



column in the t table. Subtracting 1 from the sample size of 99, df = 99 — 1, 
or 98. Using the 0.05 column and the df = 98 row, the corresponding t value is 
t = +1.661. Since t = +1.661 corresponds to a right-tail area of 0.05, t = 
— 1.661 will correspond to a left-tail area of 0.05. This is due to the symmetry of 
the distribution, and the distance from t = —1.661 to t = +1.661 will include 
90% of the area beneath the curve. You can use Seeing Statistics Applet 11, at the 
end of the chapter, to further examine areas beneath the t distribution curve. 



Should you encounter a situation in which the number of degrees of freedom 
exceeds the df = 100 limit of the t distribution table, just use the corresponding z 
value for the desired level of confidence. These z values are listed in the df = infinity 
row in the t distribution table. 



Chapter 9: Estimation from Sample Data 



283 



Using the t table instead of the standard normal table to which you've become ac- 
customed may seem cumbersome at first. As we mentioned previously, however, 
the ^-interval is the technically appropriate procedure whenever s has been used to 
estimate cr. This is also the method you will either use or come into contact with 
when dealing with computer statistical packages and their construction of confi- 
dence intervals. When using computer statistical packages, it's easy to routinely 
(and correctly) use the t distribution for constructing confidence intervals when- 
ever cr is unknown and being estimated by s. 

Confidence Intervals Using the t Distribution 

Aside from the use of the t distribution, the basic procedure for estimating 
confidence intervals is similar to that of the previous section. The appropriate 
t value is used instead of z, and s replaces cr. The t distribution assumes the under- 
lying population is approximately normally distributed, but this assumption is 
important only when the sample is small — that is, n < 30. The interval estimate is 
summarized as follows: 

Confidence interval limits for the population mean, cr unknown: 

sample mean 

sample standard deviation 
sample size 

t value corresponding to the level of 
confidence desired, with df = n — 1 
(e.g., t = 2.201 for 95% confidence, 
n = 12, and df = 12 - 1 = 11) 
estimated standard error of the sampling 
distribution of the mean 

(If n < 30, it must be assumed that the underlying population is approximately 
normally distributed.) 



Example 



t-lnterval. Mean 

A simple random sample of n = 90 manufacturing employees has been selected 
from those working throughout a state. The average number of overtime 
hours worked last week was x = 8.46 hours, with a sample standard deviation of 
s = 3.61 hours. The underlying data are in file CX09OVER. 

SOLUTION 

What Is the 98% Confidence Interval for the Population Mean, fi7 

The first step in determining the appropriate value of t is to identify the column 
of the t distribution table to which we must refer. Since the confidence level is 
98%, the right-tail area of interest is (1.00 - 0.98)/2, or 0.01. For this sample 
size, the number of degrees of freedom will be 90 — 1, or df = 89. Referring to 



AN IMPORTANT 
NOTE 



t — f= where x 
s 
n 
t 



slVn 
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FIGURE 9.5 



Although the sample size in 
this example is relatively 
large (n = 90), the f distri- 
bution was used in 
constructing the 98% confi- 
dence interval for the popu- 
lation mean. This is because 
the population standard de- 
viation is unknown and is 
being estimated by the 
sample standard deviation 
(s = 3.61 hours). 













Area = 0.98 




The t distribution: 
with df= n - 1 = 89. 
For 98% confidence, 
t will be ±2.369. 


I I I 


t = -2.369 


0 


t = +2.369 







The 98% confidence interval for /x is 













3.61 


8.46-2.369 




V90 





8.46 









x + t-L. 

V« 




8.46 + 2.369 


3.61 


V90 



7.56 




8.46 




9.36 







the 0.01 column and the df = 89 row, we determine t = +2.369. Due to the 
symmetry of the t distribution, 98% of the area beneath the curve will be 
between t = —2.369 and t = +2.369. For the results summarized in Figure 9.5, 
the underlying calculations for the 98% confidence interval are 



8.46 ± 2.369 



3.61 



or between 7.56 and 9.36 hours 



90 



A REMINDER 



If n is so large (e.g., n > 101) that df exceeds the finite limits of the t table, just 
use the infinity row of the table. In the preceding example, if n were &101, we 
would refer to the 0.05 column and the df = infinity row and obtain a t value of 
1.645. This is the same as using z instead of t. For such large samples, the z and t 
distributions are similar enough that the z will he a very close approximation. 



Computer Solutions 9.2 shows how we can use Excel or Minitab to generate 
a confidence interval for the mean when the population standard deviation is un- 
known. In this case, we are replicating the 98% confidence interval shown in Fig- 
ure 9.5, and the 90 data values are in file CX09OVER. Once again, if we use an 
Excel procedure based on summary statistics, it can be interesting to examine 
"what-if" scenarios to instantly see how changes in the specified confidence level 
would change the width of the confidence interval. 
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Confidence Interval for Population Mean, o- Unknown 

These procedures show how to construct a confidence interval for the population mean when the 
population standard deviation is unknown. 

EXCEL 





A 


B 


C 


D 


1 


t-Estimate: Mean 






2 










3 








hours 


4 


Mean 






8.4603 


5 


Standard Deviation 




3.61 


6 


LCL 






7.559 


7 


UCL 






9.362 



Excel confidence interval for fi based on raw data and o- unknown 

1 . For example, using the 90 overtime data values (file CX09OVER.XLS) on which Figure 9.5 is based: The label and 90 
data values are in A1:A91. Click Tools. Click Data Analysis Plus. Click t-Estimate: Mean. Click OK. 

2. Enter A1:A91 into the Input Range box. Click Labels. The desired confidence level as a decimal fraction is 0.98, so 
enter the corresponding alpha value (0.02) into the Alpha box. Click OK. The lower portion of the printout lists the 
lower and upper limits for the 98% confidence interval. 

Excel confidence interval for fi based on summary statistics and <r unknown 

1 . Open the ESTlMATORS.XLS workbook, supplied with the text. 

2. Using the arrows at the bottom left, select the f-Estimate_Mean worksheet. Enter the sample mean (8.46), the sam- 
ple standard deviation (3.61), the sample size (90), and the desired confidence level as a decimal fraction (0.98). 

(Note: As an alternative, you can use the Excel worksheet template TMTINT.XLS, supplied with the text. The steps are de- 
scribed within the template.) 

MINITAB 

Confidence interval for fi based on raw data and <r unknown 
One-Sample T: hours 

Variable N Mean StDev SE Mean 98% CI 

hours 90 8.460 3.610 0.381 (7.559, 9.362) 

1 . For example, using the data (file CX09OVER.MTW) on which Figure 9.5 is based, with the 90 data values in column C1: 
Click Stat. Select Basic Statistics. Click 1-Sample t. 

2. Select Samples in columns and enter C1 into the box. The Perform hypothesis test box should be left blank — we 
will not be doing hypothesis testing until the next chapter. 

3. Click Options. Enter the desired confidence level as a percentage (98.0) into the Confidence Level box. Within the 
Alternative box, select not equal. Click OK. Click OK. 

Confidence interval for fi based on summary statistics and <r unknown 

Follow the procedure in the previous steps for raw data, but in step 2 select Summarized data, enter 90 into the Sample size 
box, 8.46 into the Mean box, and 3.61 into the Standard deviation box. 
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9.24 When the t distribution is used in constructing a 
confidence interval based on a sample size of less than 30, 
what assumption must be made about the shape of the 
underlying population? 

9.25 Why are the t values listed in the df = infinity row 
of the t distribution table identical to the z values that 
correspond to the same right-tail areas of the standard 
normal distribution? What does this indicate about the 
relationship between the t and standard normal 
distributions? 

9.26 In using the t distribution table, what value of t 
would correspond to an upper-tail area of 0.025 for 
19 degrees of freedom? 

9.27 In using the t distribution table, what value of t 
would correspond to an upper-tail area of 0.10 for 
28 degrees of freedom? 

9.28 For df = 25, determine the value of A that 
corresponds to each of the following probabilities: 

a. P(t> A) = 0.025 

b. P(f< A) = 0.10 

c. P(-A <* < A) = 0.99 

9.29 For df = 85, determine the value of A that 
corresponds to each of the following probabilities: 

a. P(f>A) = 0.10 

b. P(t < A) = 0.025 

c. P(-A<t<A) = 0.98 

9.30 Given the following observations in a simple 
random sample from a population that is approximately 
normally distributed, construct and interpret the 90% 
and 95% confidence intervals for the mean: 

67 79 71 98 74 70 59 102 92 96 

9.31 Given the following observations in a simple 
random sample from a population that is approximately 
normally distributed, construct and interpret the 95% 
and 99% confidence intervals for the mean: 

66 34 59 56 51 45 38 58 52 52 
50 34 42 61 53 48 57 47 50 54 

9.32 A consumer magazine has contacted a simple random 
sample of 33 owners of a certain model of automobile and 
asked each owner how many defects had to be corrected 
within the first 2 months of ownership. The average num- 
ber of defects was x = 3.7, with a standard deviation of 
1.8 defects. 

a. Use the t distribution to construct a 95% confidence 
interval for /jl = the average number of defects for 
this model. 

b. Use the z distribution to construct a 95% confidence 
interval for /j, = the average number of defects for this 
model. 



c. Given that the population standard deviation is not 
known, which of these two confidence intervals 
should be used as the interval estimate for /x? 

9.33 The service manager of Appliance Universe has 
recorded the times for a simple random sample of 50 
refrigerator service calls taken from last year's service 
records. The sample mean and standard deviation were 
25 minutes and 10 minutes, respectively. 

a. Construct and interpret the 95% confidence interval 
for the mean. 

b. It's quite possible that the population of such times is 
strongly skewed in the positive direction — that is, 
some jobs, such as compressor replacement, might 
take 3 or 4 hours. If this were true, would the inter- 
val constructed in part (a) still be appropriate? 
Explain your answer. 

9.34 An automobile rental agency has the following 
mileages for a simple random sample of 20 cars that were 
rented last year. Given this information, and assuming 
the data are from a population that is approximately 
normally distributed, construct and interpret the 90% 
confidence interval for the population mean: 

55 35 65 64 69 37 88 80 39 61 

54 50 74 92 59 50 38 59 29 60 miles 

9.35 One of the most popular products sold by a manu- 
facturer of electrical fuses is the standard 30-ampere 
fuse used by electrical contractors in the construction 
industry. The company has tested a simple random sam- 
ple of 16 fuses from its production output and found the 
amperages at which they "blew" to be as shown here. 
Given this information, and assuming the data are from a 
normally distributed population, construct and interpret 
the 95% confidence interval for the population mean 
amperage these fuses will withstand. 

30.6 30.2 27.7 28.5 29.0 27.5 28.9 28.1 

30.3 30.8 28.5 30.3 30.0 28.5 29.0 28.2 amperes 

9.36 The author of an entry-level book on using Microsoft 
Word has carried out a test in which 35 novices were pro- 
vided with the book, a computer, and the assignment of 
converting a complex handwritten document with text and 
tables into a Microsoft Word file. The novices required an 
average time of 105 minutes, with a standard deviation of 
20 minutes. Construct and interpret the 90% confidence 
interval for the population mean time it would take all 
such novices to complete this task. 

9.37 An office equipment manufacturer has developed a 
new photocopy machine and would like to estimate the 
average number of 8 1 / 2 -by-ll copies that can be made 
using a single bottle of toner. For a simple random sample 
of 20 bottles of toner, the average was 1535 pages, with a 
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standard deviation of 30 pages. Making and stating 
whatever assumptions you believe are necessary, 
construct and interpret the 95% confidence interval for 
the population mean. 

9.38 Researchers have estimated that office workers in 
Germany receive an average of 15.0 fax messages per day. 
Assuming this finding to be based on a simple random sam- 
ple of 80 German office workers, with a sample standard 
deviation of s = 3.5 messages, construct and interpret the 
95% confidence interval for the population mean. Given 
this confidence interval, would it seem very unusual if 
another sample of this size were to have a mean of 16.5 
faxes? source: Anne R. Carey and Genevieve Lynn, "Message Overload?" 
USA Today, September 13, 1999, p. 1B. 

9.39 According to Nielsen//NetRatings, the average visitor 
to the American Greetings website spends 11.85 minutes 
at the site. Assuming this finding to be based on a simple 
random sample of 20 visitors to the site, with a sample 
standard deviation of s = 3.0 minutes, and from a popula- 
tion of visiting times that is approximately normally 
distributed, construct and interpret the 98% confidence 
interval for the population mean. Given this confidence 
interval, would it seem very unusual if another sample of 
this size were to have a mean visiting time of 13.0 minutes? 
source: "Top Web Properties," USA Today, April 27, 2000, p. 3D. 



/ data Set / Note: Exercises 9.40 and 9.41 require a 
computer and statistical software. 

9.40 In an article published in a British medical journal, 
Japanese researchers found that adults who were 
videotaped in a relaxed setting blinked an average of 
15.0 times per minute. Under the assumption that this 
finding was based on the sample data in file XR09040, 
construct and interpret the 99% confidence interval for 
the population mean. Based on this confidence interval, 
would it seem very unusual if another sample of this 
size were to exhibit a mean blink rate of 16.0 times per 
minute? source: "Blink Factor," USA Today, August 9, 1999, p. 3D. 

9.41 Automotive researchers have reported that building 
the Ford Mustang required an average of 22.3 labor hours. 
Under the assumption that this finding was based on the 
sample data in file XR09041 , construct and interpret the 
95% confidence interval for the population mean number 
of labor hours required for this model. Based on this 
confidence interval, would it seem very unusual if 
another sample of this size were to require an average of 
22.9 labor hours for production of a Ford Mustang? 
source: Michael Woodyard, "U.S. Makers Narrow Efficiency Gap," 
Automotive News, June21, 1999, p. 8. 



CONFIDENCE INTERVAL ESTIMATES FOR THE 9.6 
POPULATION PROPORTION 



Determining a confidence interval estimate for the population proportion requires 
that we use the sample proportion (p) for two purposes: (1) as a point estimate of 
the (unknown) population proportion, tt, and (2) in combination with the sample 
size (n) in estimating the standard error of the sampling distribution of the sample 
proportion for samples of this size. 

The technique of this section uses the normal distribution as an approximation 
to the binomial distribution. This approximation is considered satisfactory when- 
ever np and n(l — p) are both >5, and becomes better for large values of n and 
whenever p is closer to 0.5. The midpoint of the confidence interval is the sample 
proportion, and the lower and upper confidence limits are determined as follows: 

Confidence interval limits for the population proportion: 



Vp(l — p) . . number of successes 
where p = sample proportion = : - — — 
number of trials 

n = sample size 

Z = z value corresponding to desired level of 
confidence (e.g., z = 1.96 for 95% confidence) 



^ — = estimated standard error of the 

sampling distribution of the proportion 



288 



Part 3: Sampling Distributions and Estimation 



Example 



z-lnterval. Proportion 

In a USA Today/CNN poll, 1406 adults were randomly selected from across the United 
States. In response to the question, "Do you agree that the current system discourages 
the best candidates from running for president?" 22% responded "strongly agree." 1 

SOLUTION 

What Is the 95% Confidence Interval for the Population Proportion Who 
Would Have Answered "Strongly Agree" to the Question Posed? 

The sample proportion p = 0.22 is our point estimate of tt and the midpoint of 
the interval. Since the confidence level is to be 95%, z will be ±1.96. The result- 
ing confidence interval, shown in Figure 9.6, is 

p ± z\l*^ = 0.22 ± !. 9< ABEm = 0.198 to 0.242 
V n V 1406 

From these results, we have 95% confidence that the population proportion is 
somewhere between 0.198 and 0.242. Expressed in terms of percentage points, 
the interval would be from 19.8% to 24.2% for the percentage of the population 
who would have "strongly agreed," and the interval width would be (24.2 — 19.8), 
or 4.4 percentage points. 



Source: Jean Becker, "Voters Favor a National Primary," USA Today, February 5, 1988, p. 8A. 



FIGURE 9.6 



The 95% confidence interval 
for a population proportion, 
based on a political poll hav- 
ing a sample proportion of 
p = 0.22 and a sample size 
ofn = 1406. We have 95% 
confidence that tt is 
between 0.198 and 0.242. 



The 95% confidence interval for iris 



n 



0.22 -lW 0-22(1 -0.22) 
1406 

or 0.198 




0.22 



■ 0.220 




0.242 
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Computer Solutions 9.3 shows how we can use Excel or Minitab to generate 
a confidence interval for a population proportion. In this case, we are replicating 
the 95% confidence interval shown in Figure 9.6. As always, if we use an Excel 
procedure based on summary statistics, it can be interesting to examine "what-if" 
scenarios to instantly see how changes in the specified confidence level would 
change the width of the confidence interval. 




mputer solutions 9.3 

— I ill _ 4 i a 




Confidence Interval for Population Proportion 

These procedures show how to construct a confidence interval for the population proportion. 

EXCEL 





A B 


C 


D 


E 


1 


z-Estimate of a Proportion 








2 












3 


Sample proportion 


0.22 


Confidence Interval Estimate 






4 


Sample size 


1406 


0.22 


plus/minus 


0.022 


5 


Confidence level 


0.95 


Lower confidence limit 




0.198 


6 






Upper confidence limit 




0.242 



Excel confidence interval for ti based on summary statistics 

1. For example, with n = 1406 and p = 0.22, as in Figure 9.6: Open the ESTlMATORS.XLS workbook, supplied with 
the text. 

2. Using the arrows at the bottom left, select the z-Estimate_Proportion worksheet. Enter the sample proportion 
(0.22), the sample size (1406), and the desired confidence level as a decimal fraction (0.95). The confidence interval 
appears as shown here. 

(Note: As an alternative, you can use Excel worksheet template TMPINT.XLS, supplied with the text. The steps are described 
within the template.) 

Excel confidence interval for it based on raw data 

1 . For example, if we had 20 data values that were coded (1 = female, 2 = male), with the label and data values in 
A1 :A21 : Click Tools. Click Data Analysis Plus. Click Z-Estimate: Proportion. Click OK. 

2. Enter A1:A21 into the Input Range box. Enter 1 into the Code for Success box. Click Labels. The desired confidence 
level as a decimal fraction is 0.95, so enter the corresponding alpha value (0.05) into the Alpha box. Click OK. 



MINITAB 

Minitab confidence interval for tt based on summary statistics 

Test and CI for One Proportion 

Sample X N Sample p 95% CI 

1 309 1406 0.219772 (0.198128, 0.241417) 

Using the normal approximation. 

1 . This interval is based on the summary statistics for Figure 9.6: Click Stat. Select Basic Statistics. Click 1 Proportion. 
Select Summarized Data. Enter the sample size (in this case, 1406) into the Number of Trials box. Multiply the sam- 
ple proportion (0.22) times the sample size (1406) to get the number of "successes" or "events" as (0.22)(1406) = 
309.32. Round to the nearest integer and enter the result (309) into the Number of Events box. The Perform 
hypothesis test box should be left blank. 



{continued) 
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2. Click Options. Enter the desired confidence level as a percentage (95.0) into the Confidence Level box. Within the 
Alternative box, select not equal. Click to select Use test and interval based on normal distribution. Click OK. 
Click OK. 

Minitab confidence interval for tt based on raw data 

1. For example, if column C1 contains 20 data values that are coded (1 = female, 2 = male): Click Stat. Select Basic 
Statistics. Click 1 Proportion. Select Samples in columns and enter C1 into the dialog box. 

2. Follow step 2 in the summary-information procedure. Note: Minitab will select the larger of the two codes (i.e., 2 = 
male) as the "success" or "event" and provide the sample proportion and the confidence interval for the 
population proportion of males. To obtain the results for females, just recode the data so females will have the higher 
code number: Click Data. Select Code. Click Numeric to Numeric. Enter C1 into both the Code data from columns 
box and the Into columns box. Enter 1 into the Original values box. Enter 3 into the New box. Click OK. The new 
codes will be (3 = female, 2 = male). 




9.42 Under what conditions is it appropriate to use the 
normal approximation to the binomial distribution in 
constructing the confidence interval for the population 
proportion? 

9.43 A pharmaceutical company found that 46% of 1000 
U.S. adults surveyed knew neither their blood pressure nor 
their cholesterol level. Assuming the persons surveyed to be 
a simple random sample of U.S. adults, construct a 95% 
confidence interval for tt = the population proportion of 
U.S. adults who would have given the same answer if a 
census had been taken instead of a survey. 

9.44 An airline has surveyed a simple random sample of 
air travelers to find out whether they would be interested 
in paying a higher fare in order to have access to e-mail 
during their flight. Of the 400 travelers surveyed, 80 
said e-mail access would be worth a slight extra cost. 
Construct a 95% confidence interval for the population 
proportion of air travelers who are in favor of the 
airline's e-mail idea. 

9.45 In response to media inquiries and concerns 
expressed by groups opposed to violence, the president of 
a university with over 25,000 students has agreed to sur- 
vey a simple random sample of her students to find out 
whether the student body thinks the school's "Plundering 
Pirate" mascot should be changed to one that is less 
aggressive in name and appearance. Of the 200 students 
selected for participation in the survey, only 20% believe 
the school should select a new and more kindly mascot. 
Construct a 90% confidence interval for the population 
proportion of students who believe the mascot should be 
changed. Based on the sample findings and associated 



confidence interval, comment on the credibility of a local 
journalist's comment that "over 50%" of the students 
would like a new mascot. 

9.46 In examining a simple random sample of 100 sales 
invoices from several thousand such invoices for the previous 
year, a researcher finds that 65 of the invoices involved 
customers who bought less than $2000 worth of merchan- 
dise from the company during that year. Construct a 90% 
confidence interval for the proportion of all sales invoices 
that were for customers buying less than $2000 worth of 
merchandise during the year. 

9.47 Survey researchers estimate that 40% of U.S. 
women age 18-29 save in a 401(k) or individual retire- 
ment account. Assuming the persons surveyed to be a 
simple random sample of 1000 U.S. women in this age 
group, construct a 95% confidence interval for tt = the 
population proportion of U.S. women in this age group 
who would have given the same answer if a census had 
been taken instead of a survey, source: Anne R. Carey and 
Sam Ward, "Women Saving for Retirement," USA Today, September 5, 
1996, p. IB. 

9.48 A study by the Society of Human Resource Manage- 
ment found 23% of U.S. business executives surveyed 
believe that an employer has no right to read employees' 
e-mail. Assuming that the survey included a simple 
random sample of 1200 executives, construct a 90% 
confidence interval for tt = the population proportion of 
U.S. business executives who believe that employers have 
no right to read employees' e-mail, source: Anne R. Carey and 
Marcy E. Mullins, "Bosses OK E-Mail Oversight," USA Today, April 22, 1996, 
p. 1A. 
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9.49 According to Nielsen Media Research viewership 
data, the top television broadcast of all time was the last 
episode of M*A*S*H, which aired on February 28, 1983, 
and was viewed by an estimated 60.2% of all TV 
households. Assuming this estimate was based on a simple 
random sample of 1800 TV households, what is the 95% 
confidence interval for tt = the proportion of all TV house- 
holds who viewed the last episode of M*A*S*H} source: 
77ie World Almanac and Book of Facts 2003, p. 283. 

9.50 In a major industry where well over 100,000 manufac- 
turing employees are represented by a single union, a simple 
random sampling of n = 100 union members finds that 57% 
of those in the sample intend to vote for the new labor con- 
tract negotiated by union and management representatives. 

a. What is the 99% confidence interval for tt = the 
population proportion of union-represented employees 
who intend to vote for the labor contract? 

b. Based on your response to part (a), does contract 
approval by the union appear to be a "sure thing"? 
Why or why not? 

9.51 Repeat Exercise 9.50, but assume that the sample size 
was n = 900 instead of n = 100. 

9.52 Based on its 1999 survey, Student Monitor reports that 
20% of U.S. college students used the Internet for job hunt- 
ing during the month preceding the survey. Assuming this 
finding to be based on a simple random sample of 1600 col- 
lege students, construct and interpret the 90% confidence 
interval for the population proportion of college students 
who used the Internet for job hunting during this period. 
source: Julie Stacey, "Online Extracurricular Activities," USA Today, March 13, 
2000, p. 10D. 

9.53 According to Keynote Systems, Wal-Mart's website 
was available 95% of the time during a holiday shopping 
season. Assuming this finding to be based on a simple ran- 
dom sample of 200 attempts, construct and interpret the 
90% confidence interval for the population proportion of 
the time the Wal-Mart site was available during this period. 
source: "Measuring How Key Web Sites Handle Holiday Shopping Rush," 
USA Today, November 17, 1999, p. 3B. 

9.54 A Pathfinder Research Group survey estimates that, of 
U.S. adults who have a favorite among The Three Stooges, 
Moe is the favorite of 31% of them. Assuming this finding 



to be based on a simple random sample of 800 Stooge fans 
who have a favorite Stooge, construct and interpret the 
95% confidence interval for the population proportion 
whose favorite Stooge is Moe. source: Anne R. Carey and Marcy 
E. Mullins, "Favorite Stooges," USA Today, November 22, 1996, p. 1D. 

9.55 Estelle McCarthy, a candidate for state office in New 
Jersey, has been advised that she must get at least 65% of 
the union vote in her state. A recent political poll of likely 
voters included 800 union-member respondents, and 60% 
of them said they intended to vote for Ms. McCarthy. 
Based on the survey, construct and interpret the 95% confi- 
dence interval for the population proportion of likely-voter 
union members who intend to vote for Ms. McCarthy. Is 
the 65% level of support within the confidence interval? 
Given your answer to the preceding question, comment on 
the possibility that Ms. McCarthy might not succeed in 
obtaining the level of union support she needs. 

/ data Set / Note: Exercises 9.56 and 9.57 require a 
computer and statistical software. 

9.56 In the documentation that accompanies its products 
that are returned for in-warranty service, a manufacturer of 
electric can openers asks the customer to indicate the 
reason for the return. The codes and return problem cate- 
gories are: (1) "doesn't work," (2) "excessive noise," and 
(3) "other." Data file XR09056 contains the problem codes 
for a simple random sample of 300 product returns. Based 
on this sample, construct and interpret the 95% confidence 
interval for the population proportion of returns that were 
because the product "doesn't work." 

9.57 An investment counselor has purchased a large mail- 
ing list consisting of 50,000 potential investors. Before 
creating a brochure to send to members of the list, the 
counselor mails a questionnaire to a small simple random 
sampling of them. In one of the questions, the respondent is 
asked, "Do I think of myself as someone who enjoys taking 
risks?" The response codes are (1) "Yes" and (2) "No." 
The results for the 200 investors who answered this ques- 
tion are represented by the codes listed in data file XR09057. 
Based on this sample, construct and interpret the 99% con- 
fidence interval for the population proportion of investors 
on the counselor's mailing list who think of themselves as 
someone who enjoys taking risks. 



SAMPLE SIZE DETERMINATION 



In our interval estimates to this point, we have taken our results, including a 
stated sample size, then constructed a confidence interval. In this section, we'll 
proceed in the opposite direction — in other words, we will decide in advance the 
desired confidence interval width, then work backward to find out how large a 
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sample size is necessary to achieve this goal. Central to our discussion in this sec- 
tion is the fact that the maximum likely sampling error (accuracy) is one-half the 
width of the confidence interval. 

Estimating the Population Mean 

To show how the necessary sample-size equation is obtained, we will consider a 
case in which a is known. Of especial importance is that the distance from the 
midpoint to the upper confidence limit can be expressed as either (1) the maxi- 
mum likely sampling error (e) or (2) z times the standard error of the sampling 
distribution. Since the two quantities are the same, we can set up the following 
equation and solve for n: 



'Vn 



Solving for n 



-> n 



Required sample size for estimating a population mean: 



where n = required sample size 

z = z value for which ±z corresponds to the 



(7 



desired level of confidence 
known (or, if necessary, estimated) value of 
the population standard deviation 
maximum likely error that is acceptable 



One way of estimating an unknown <r is to use a relatively small-scale pilot 
study from which the sample standard deviation is used as a point estimate of the 
population standard deviation. A second approach is to estimate a by using the 
results of a similar study done at some time in the past. A third method is to esti- 
mate cr as 1/6 the approximate range of data values. 



Example 



Sample Size, Estimating a Mean 

A state politician would like to determine the average amount earned during sum- 
mer employment by state teenagers during the past summer's vacation period. She 
wants to have 95% confidence that the sample mean is within $50 of the actual 
population mean. Based on past studies, she has estimated the population stan- 
dard deviation to be a = $400. 

SOLUTION 

What Sample Size Is Necessary to Have 95% Confidence That x Will Be 
Within $50 of the Actual Population Mean? 

For this situation, 95% confidence leads to a z value of 1.96, e = the $50 maxi- 
mum likely error that is acceptable, and the estimated value of o-is $400. The nec- 
essary sample size will be 

z 2 -o- 2 1.96 2 -400 2 nArn 
n = j — = 775 = 245.9 persons, rounded up to 246 
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Since we can't include a fraction of a person in the sample, we round up to 
n = 246 to ensure 95% confidence in being within $50 of the population mean. 
Whenever the calculated value of n is not an integer, it is a standard (though 
slightly conservative) practice to round up to the next integer value. 

Note that if we cut the desired maximum likely error in half, the necessary 
sample size will quadruple. This is because the e term in the denominator is 
squared. The desire for extremely accurate results can lead to sample sizes that 
grow very rapidly in size (and expense) as the specified value for e is reduced. 



Estimating the Population Proportion 

As when estimating the population mean, the maximum likely error (e) in esti- 
mating a population proportion will be one-half of the eventual confidence inter- 
val width. Likewise, the distance from the midpoint of the confidence interval to 
the upper confidence limit can be described in two ways. Setting them equal and 
solving for n gives the following result: 



Pi 1 - P) . rrr~- — 2 1 - P) 



Solving for n 



-) n 



Required sample size 


for estimating 


; a population proportion: 


z 2 p(l - p) 
n = s 


where n = 


required sample size 




z = 


z value for which ±z corresponds to the 
desired level of confidence 




P = 


the estimated value of the population 
proportion (As a conservative strategy, 
use p = 0.5 if you have no idea as to 
the actual value of v.) 




e = 


maximum likely error that is acceptable 



In applying the preceding formula, we should first consider whether the true 
population proportion is likely to be either much less or much greater than 0.5. If 
we have absolutely no idea as to what tt might be, using p = 0.5 is the conserva- 
tive strategy to follow. This is because the required sample size, n, is proportional 
to the value of p(l — p), and this value is largest whenever p = 0.5. 

If we are totally uncertain regarding the actual population proportion, we 
may wish to conduct a pilot study to get a rough idea of its value. If we can esti- 
mate the population proportion as being either much less or much more than 0.5, 
we can obtain the desired accuracy with a smaller sample than would have other- 
wise been necessary. 

If the population proportion is felt to be within a range, such as "between 
0.20 and 0.40," we should use as our estimate the value that is closest to 0.5. 
For example, if we believe the population proportion is somewhere between 
0.20 and 0.40, it should be estimated as 0.40 when calculating the required 
sample size. 



Sampling Error in Survey Research 



When survey results are published, they are sometimes 
accompanied by an explanation of how the survey was 
conducted and how much sampling error could have been 
present. Persons who have had a statistics course need only 
know the size of the sample and the sample proportion or 
percentage for a given question. For the general public, 
however, the explanation of survey methods and sampling 
error needs a bit of rephrasing. The following statement 
accompanied the results of a survey commissioned by the 
Associated Press. 

How Poll Was Conducted 

The Associated Press poll on taxes was taken Feb. 14-20 
using a random sample of 1 009 adult Americans. 

No more than one time in 20 should chance variations 
in the sample cause the results to vary by more than 3 per- 
centage points from the answers that would be obtained if 
all Americans were polled. 

The first sentence in this description briefly describes the 
size and nature of the sample. The second sentence de- 
scribes the confidence level as 1 9/20, or 95%, and the sam- 
pling error as plus or minus 3 percentage points. Using the 
techniques of this chapter, we can verify the calculations in 



this description. For 95% confidence, z = 1 .96. Since some 
questions may have a population proportion of 0.5 (the 
most conservative value to use when determining the re- 
quired sample size), this is used in the following calculations: 



and 



z 2 p(1 - p) 



z l p(\ - p) 



where p = estimated 
population proportion 



(1.96) 2 (0.5)(1 - 0.5) 



1009 



= 0.0309 



As these calculations indicate, the Associated Press 
rounded down to the nearest full percentage point (from 
3.09 to 3.0) in its published explanation of the sampling 
error. This is not too unusual, since the general public would 
probably have enough to digest in the description quoted 
above without having to deal with decimal fractions. 



Source: "How Poll Was Conducted," in Howard Goldberg, "Most Not Ready 
to Scrap Tax System," Indiana Gazette, February 26, 1 996, p. 3. 



example 




Sample Size, Estimating a Proportion 

A tourist agency researcher would like to determine the proportion of U.S. adults 
who have ever vacationed in Mexico and wishes to be 95% confident that the 
sampling error will be no more than 0.03 (3 percentage points). 

SOLUTION 

Assuming the Researcher Has No Idea Regarding the Actual Value of the Pop- 
ulation Proportion, What Sample Size Is Necessary to Have 95% Confidence 
That the Sample Proportion Will Be within 0.03 (3 Percentage Points) of the 
Actual Population Proportion? 

For the 95% level of confidence, the z value will be 1.96. The maximum acceptable 
error is e = 0.03. Not wishing to make an estimate, the researcher will use p = 0.5 
in calculating the necessary sample size: 



z 2 p(l - p) 1.96 2 (0.5)(1 - 0.5) 



0.03 2 



1067.1 persons, rounded up to 1068 
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If the Researcher Believes the Population Proportion Is No More Than 0.3, 
and Uses p = 0.3 as the Estimate, What Sample Size Will Be Necessary? 

Other factors are unchanged, so z remains 1.96 and e is still specified as 0.03. 
However, the p(l — p) term in the numerator will be reduced due to the assump- 
tion that the population proportion is no more than 0.3. The required sample size 
will now be 

jpV ~ P) 1-96 2 (0.3)(1 - 0.3) 

n = ~ = i = 896.4 persons, rounded up to 897 

e~ 0.0j z 

As in determining the necessary size for estimating a population mean, lower 
values of e lead to greatly increased sample sizes. For example, if the researcher 
estimated the population proportion as being no more than 0.3, but specified a 
maximum likely error of 0.01 instead of 0.03, he would have to include nine 
times as many people in the sample (8068 instead of 897). 



Computer Solutions 9.4 (page 296) shows how we can use Excel to deter- 
mine the necessary sample size for estimating a population mean or proportion. 
With these procedures, it is very easy to examine "what-if" scenarios and in- 
stantly see how changes in confidence level or specified maximum likely error 
will affect the required sample size. 




9.58 "If we want to cut the maximum likely error in half, 
we'll have to double the sample size." Is this statement 
correct? Why or why not? 

9.59 In determining the necessary sample size in making 
an interval estimate for a population mean, it is necessary 
to first make an estimate of the population standard devi- 
ation. On what bases might such an estimate be made? 

9.60 From past experience, a package-filling machine has 
been found to have a process standard deviation of 0.65 
ounces of product weight. A simple random sample is to 
be selected from the machine's output for the purpose of 
determining the average weight of product being packed 
by the machine. For 95% confidence that the sample 
mean will not differ from the actual population mean by 
more than 0.1 ounces, what sample size is required? 

9.61 Based on a pilot study, the population standard 
deviation of scores for U.S. high school graduates taking a 
new version of an aptitude test has been estimated as 3.7 
points. If a larger study is to be undertaken, how large a 
simple random sample will be necessary to have 99% con- 
fidence that the sample mean will not differ from the 
actual population mean by more than 1.0 points? 

9.62 A consumer agency has retained an independent 
testing firm to examine a television manufacturer's claim 



that its 25-inch console model consumes just 110 watts of 
electricity. Based on a preliminary study, the population 
standard deviation has been estimated as 1 1 .2 watts for 
these sets. In undertaking a larger study, and using a simple 
random sample, how many sets must be tested for the firm 
to be 95% confident that its sample mean does not differ 
from the actual population mean by more than 3.0 watts? 

9.63 A national political candidate has commissioned a 
study to determine the percentage of registered voters who 
intend to vote for him in the upcoming election. To have 
95% confidence that the sample percentage will be within 
3 percentage points of the actual population percentage, 
how large a simple random sample is required? 

9.64 Suppose that Nabisco would like to determine, with 
95% confidence and a maximum likely error of 0.03, the 
proportion of first graders in Pennsylvania who had 
Nabisco's Spoon-Size Shredded Wheat for breakfast at 
least once last week. In determining the necessary size of 
a simple random sample for this purpose: 

a. Use 0.5 as your estimate of the population proportion. 

b. Do you think the population proportion could really 
be as high as 0.5? If not, repeat part (a) using an 
estimated proportion that you think would be more 
likely to be true. What effect does your use of this 
estimate have on the sample size? 
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computer solutions 9.4 



Sample Size Determination 

These procedures determine the necessary sample size for estimating a population mean or a population 
proportion. 

EXCEL 





A B 


C 


D 


1 


Sample size required for estimating a 


2 


population mean: 






3 










4 


Estimate for sigma: 




400.00 


5 


Maximum likely error, e: 


50.00 


6 










7 


Confidence level desired: 


0.95 


8 


alpha = (1 - conf. level desired): 


0.05 


9 


The corresponding z value is: 


1.960 


10 










11 


The required sample size is n = 


245.9 



Sample size for estimating a population mean, using Excel worksheet template TMNFORMU.XLS that accom- 
panies the text 

To determine the necessary sample size for estimating a population mean within $50 and with 95% confidence, as- 
suming a population standard deviation of $400: Open Excel worksheet TMNFORMU.XLS. Enter the estimated sigma 
(400), the maximum likely error (50), and the specified confidence level as a decimal fraction (0.95). The required sam- 
ple size (in cell D1 1) should then be rounded up to the nearest integer (246). This procedure is also described within 
the worksheet template. 

Caution Do not save any changes when exiting Excel. 

Sample size for estimating a population proportion, using Excel worksheet template TMNFORPl.XLS that 
accompanies the text 

To determine the necessary sample size for estimating a population proportion within 0.03 (3 percentage points) and 
with 95% confidence: Open Excel worksheet TMNFORPl.XLS. Enter the estimate for pi (0.50), the maximum likely error 
(0.03), and the specified confidence level as a decimal fraction (0.95). The required sample size (in cell D1 1) should then 
be rounded up to the nearest integer (1068). (Note: If you have knowledge about the population and can estimate pi 
as either less than or greater than 0.50, use your estimate and the necessary sample size will be smaller. Otherwise, be 
conservative and use 0.50 as your estimate.) This procedure is also described within the worksheet template. 

Caution Do not save any changes when exiting Excel. 



9.65 The Chevrolet dealers of a large county are conduct- 
ing a study to determine the proportion of car owners in 
the county who are considering the purchase of a new car 
within the next year. If the population proportion is 
believed to be no more than 0.15, how many owners 
must be included in a simple random sample if the dealers 
want to be 90% confident that the maximum likely error 
will be no more than 0.02? 

9.66 In Exercise 9.65, suppose that (unknown to the 
dealers) the actual population proportion is really 0.35. If 



they use their estimated value (tt < 0.15) in determining 
the sample size and then conduct the study, will their 
maximum likely error be greater than, equal to, or less 
than 0.02? Why? 

9.67 In reporting the results of their survey of a simple 
random sample of U.S. registered voters, pollsters claim 
95% confidence that their sampling error is no more than 
4 percentage points. Given this information only, what 
sample size was used? 
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WHEN THE POPULATION IS FINITE 9.8 



Whenever sampling is without replacement and from a finite population, it may be 
necessary to modify slightly the techniques for confidence-interval estimation and 
sample size determination in the preceding sections. As in Chapter 8, the general 
idea is to reduce the value of the standard error of the estimate for the sampling 
distribution of the mean or proportion. As a rule of thumb, the methods in this sec- 
tion should be applied whenever the sample size (n) is at least 5% as large as the 
population. When n < 0.05N, there will be very little difference in the results. 



Confidence-Interval Estimation 

Whether we are dealing with interval estimation for a population mean (/x) or a 
population proportion (it), the confidence intervals will be similar to those in Fig- 
ure 9.2. The only difference is that the "±" term will be multiplied by the "finite 
population correction factor" shown in Table 9.3. As in Chapter 8, this correc- 
tion depends on the sample size (n) and the population size (N). 

As an example of how this works, we'll consider a situation such as that in 
Section 9.5, where a confidence interval is to be constructed for the population 
mean, and the sample standard deviation (s) is used as an estimate of the popula- 
tion standard deviation (cr). In this case, however, the sample will be relatively 
large compared to the size of the population. 



Example 



Interval Estimates, Finite Population 

According to the Bureau of the Census, the population of Kent County, Texas, is 
812 persons. 2 For purposes of our example, assume that a researcher has inter- 
viewed a simple random sample of 400 persons and found that their average 



Confidence Interval Estimate for the Population Mean, <r Known 



Infinite Population 



Finite Population 



x ± z- 



X ± z 



N — n\ 



v n \ Vn V A/ — 1 1 

Confidence Interval Estimate for the Population Mean, cr Unknown 



Infinite Population 



Finite Population 



x ± t- 



x±t\ 



N — n\ 



n VN- 7/ 



Confidence Interval Estimate for the Population Proportion 
Infinite Population 



Finite Population 



P(1 - p) 



P(1 - P) 



N - n 
N — 7 



TABLE 9.3 



Summary of confidence 
interval formulas when 
sampling without replace- 
ment from a finite popula- 
tion. As a rule of thumb, 
they should be applied 
whenever the sample is at 
least 5% as large as the 
population. The formulas 
and terms are similar to 
those in Figure 9.2 but 
include a "finite population 
correction factor," the 
value of which depends 
on the relative sizes of 
the sample (n) and 
population (A/). 



2 Source: The World Almanac and Book of Facts 2003, p. 459. 
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number of years of formal education is x = 
of s = 4.3 years. 

SOLUTION 



11.5 years, with a standard deviation 



Considering That the Population Is Finite and n a 0.05/v, What Is the 
95% Confidence Interval for the Population Mean? 

Since the number of degrees of freedom (df = n — 1 = 400 — 1, or 399) exceeds 
the limits of our t distribution table, the t distribution and normal distribution can 
be considered to be practically identical, and we can use the infinity row of the t 
table. The appropriate column in this table will be 0.025 for a 95% confidence in- 
terval, and the entry in the infinity row of this column is a t value of 1.96. 

Since s is being used to estimate a, and the sample is more than 5% as large 
as the population, we will use the "<r unknown" formula of the finite population 
expressions in Table 9.3. The 95% confidence interval for the population mean 
can be determined as 



The finite population correction term: 
this will be smaller than 1.0 so the 
standard error will be less than if an 
infinite population were involved. 

Y 

X±t Vn' VN-1 

i 4.3 , /812 - 400 
= 11.5 ± 1.96 —7— * \ — r~ 
|V400 V 812-1 | 

= 11.5 ± 1.96(0.215 • 0.713) = 11.5 ± 0.300 

or from 11.200 to 11.800 



As expected, the finite correction term (0.713) is less than 1.0 and leads to a 
95% confidence interval that is narrower than if an infinite population had been 
assumed. (Note: If the population had been considered infinite, the resulting inter- 
val would have been wider, with lower and upper limits of 11.079 and 11.921 
years, respectively.) 



Sample Size Determination 

As in confidence-interval estimation, the rule of thumb is to change our sample 
size determination procedure slightly whenever we are sampling without replace- 
ment from a finite population and the sample is likely to be at least 5% as large 
as the population. Although different in appearance, the following formulas are 
applied in the same way that we used their counterparts in Section 9.7. 

If you were to substitute an N value of infinity into each of the following equa- 
tions, you would find that the right-hand term in the denominator of each would 
be eliminated, and the result would be an expression exactly the same as its coun- 
terpart in Section 9.7. 
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Required sample size for estimating the mean of a finite population: 
a 1 

n — -j 2 where n = required sample size 

H N = population size 

z z = z value for which ±z corresponds to the 

desired level of confidence 
a = known (or, if necessary, estimated) value 

of the population standard deviation 
e = maximum likely error that is acceptable 

Required sample size, estimating the proportion for a finite population: 

P(l ~P) , j . 

~2 7T~Z — \ where n = required sample size 



+ - — ~ — N = population size 



e 1 , p( l - p) 

z = z value for which ±z corresponds to the 
desired level of confidence 
the estimated value of the population 
proportion (As a conservative strategy, 
use p = 0.5 if you have no idea as to 
the actual value of it.) 
maximum likely error that is acceptable 



Example 












Sample Size, Finite Population 



The Federal Aviation Administration (FAA) lists 8586 pilots holding commercial 
helicopter certificates. 3 Suppose the FAA wishes to question a simple random 
sample of these individuals to find out what proportion are interested in switch- 
ing jobs within the next 3 years. Assume the FAA wishes to have 95% confidence 
that the sample proportion is no more than 0.04 (i.e., 4 percentage points) away 
from the true population proportion. 

SOLUTION 

Considering That the Population Is Finite, What Sample Size Is Necessary to 
Have 95% Confidence That the Sample Proportion Will Not Differ from the 
Population Proportion by More Than 0.04? 

Since the actual population proportion who are interested in switching jobs has 
not been estimated, we will be conservative and use p = 0.5 in deciding on the 
necessary sample size. For the 95% confidence level, z will be 1.96. Applying the 



^Source: General Aviation Manufacturers Association, General Aviation Statistical Database, 2005 
Edition, p. 37. 
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finite population formula, with N = 8586, the number of pilots who should be in- 
cluded in the sample is 



P(l ~ P) 
e 2 | p(l ~ 

z 1 N 



0.5(1 - 0.5) 



P) 



0.04 2 0.5 1 

T + ~ 

1.96 2 



0.5) 



561.0 



8586 



Had the population been infinite, the required sample size would have been 
calculated as in Section 9.7. This would have resulted in n = 600.25, rounded up 
to 601. By recognizing that the population is finite, we are able to achieve the de- 
sired confidence level and maximum error with a sample size that includes only 
561 pilots instead of 601. 




exercises 



9.68 As a rule of thumb, under what conditions should the 
finite population correction be employed in determining 
confidence intervals and calculating required sample sizes? 

9.69 Compared to situations where the population is either 
infinite or very large compared to the sample size, what 
effect will the finite population correction tend to have on 

a. the width of a confidence interval? 

b. the required size of a sample? 

9.70 The personnel manager of a firm with 200 employ- 
ees has selected a simple random sample of 40 employees 
and examined their health-benefit claims over the past 
year. The average amount claimed during the year was 
$260, with a standard deviation of $80. Construct and 
interpret the 95% confidence interval for the population 
mean. Was it necessary to make any assumptions about 
the shape of the population distribution? Explain. 

9.71 Of 1200 undergraduates enrolled at a univer- 
sity, a simple random sample of 600 have been surveyed 
to measure student support for a $5 activities fee increase 
to help fund women's intercollegiate athletics at the 
NCAA division 1A level. Of those who were polled, 55% 
supported the fee increase. Construct and interpret the 
95% and 99% confidence intervals for the population 
proportion. Based on your results, comment on the possi- 
bility that the fee increase might lose when it is voted on 
at next week's university-wide student referendum. 

9.72 A local environmental agency has selected a simple 
random sample of 16 homes to be tested for tap-water 
lead. Concentrations of lead were found to have a mean of 
12 parts per billion and a standard deviation of 4 parts per 
billion. Considering that the homes were selected from a 
community in which there are 100 homes, construct and 
interpret the 95% confidence interval for the population 
mean. Based on your results, comment on the possibility 
that the average lead concentration in this community's 



homes might exceed the Environmental Protection Agency's 
recommended limit of 15 parts per billion of lead. 

9.73 A simple random sample is to be drawn from a pop- 
ulation of 800. In order to have 95% confidence that the 
sampling error in estimating tt is no more than 0.03, 
what sample size will be necessary? 

9.74 A simple random sample is to be drawn from a pop- 
ulation of 2000. The population standard deviation has 
been estimated as being 40 grams. In order to have 99% 
confidence that the sampling error in estimating /x is no 
more than 5 grams, what sample size will be necessary? 

9.75 There are 100 members in the United States Senate. A 
political scientist wants to estimate, with 95% confidence 
and within 3 percentage points, the percentage who own 
stock in foreign companies. How many senators should be 
interviewed? Explain any assumptions you used in obtain- 
ing your recommended sample size. 

9.76 A transportation company operates 200 trucks and 
would like to use a hidden speed monitor device to record 
the maximum speed at which a truck is operated during 
the period that the device is installed. The trucks are driven 
primarily on interstate highways, and the company wants 
to estimate the average maximum speed for its fleet with 
90% confidence and within 2 miles per hour. Using (and 
explaining) your own estimate for the population standard 
deviation, determine the number of trucks on which the 
company should install the hidden speed-recording device. 

9.77 A research firm supports a consumer panel of 2000 
households that keep written diaries of their weekly gro- 
cery expenditures. The firm would like to estimate, with 
95% confidence and within 4 percentage points, the per- 
centage of its panel households who would be interested 
in providing more extensive information in return for an 
extra $50 per week remuneration. How many of the 
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households should be surveyed? Explain any assumptions 
you used in obtaining your recommended sample size. 

9.78 A university official wants to estimate, with 99% 
confidence and within $2, the average amount that 
members of fraternities and sororities spend at local 
restaurants during the first week of the semester. If the 
total fraternity/sorority membership is 300 people, how 
many members should be included in the sample? Use 



(and explain) your own estimate for the population stan- 
dard deviation. 

9.79 A quality-management supervisor believes that no 
more than 5% of the items in a recent shipment of 2000 
are defective. If she wishes to determine, within 1 percent- 
age point and with 99% confidence, the percentage of 
defective items in the shipment, how large a simple 
random sample would be necessary? 



SUMMARY 9.9 



• Inferential statistics: point and interval estimates for a population parameter 

Chapter 8 examined the sampling distribution of a sample mean or a sample pro- 
portion from a known population. In this chapter, the emphasis has been on the 
estimation of an unknown population mean (/jl) or proportion (tt) on the basis of 
sample statistics. Point estimates involve using the sample mean (x) or proportion 
(p) as the single best estimate of the value of the population mean or proportion. 
Interval estimates involve a range of values that may contain the actual value of 
the population parameter. When interval estimates are associated with a degree of 
certainty that they really do include the true population parameter, they are re- 
ferred to as confidence intervals. 

• Constructing a confidence interval for a population mean or a population 
proportion 

The procedure appropriate to constructing an interval estimate for the population 
mean depends largely on whether the population standard deviation is known. 
Figure 9.2 summarizes these procedures and their underlying assumptions. Al- 
though the interval is often associated with interval estimates based on small 
samples, it is appropriate for larger samples as well. Using computer statistical 
packages, we can easily and routinely apply the t distribution for interval esti- 
mates of the mean whenever a is unknown, even for very large sample sizes. 

A trade-off exists between the degree of confidence that an interval contains 
the population parameter and the width of the interval itself. The more certain we 
wish to be that the interval estimate contains the parameter, the wider the interval 
will have to be. 

• Sample size determination 

Accuracy, or sampling error, is equal to one-half of the confidence interval width. 
The process of sample size determination anticipates the width of the eventual 
confidence interval, then determines the required sample size that will limit the 
maximum likely sampling error to an acceptable amount. 

• When the sample is large compared to the population 

As in Chapter 8, when sampling is without replacement from a finite population, 
it is appropriate to use a finite population correction factor whenever the sample is 
at least 5% of the size of the population. Such corrections are presented for both 
interval estimation and sample size determination techniques within the chapter. 

• Computer-generated confidence intervals 

Most computer statistical packages are able to construct confidence interval esti- 
mates of the types discussed in the chapter. Examples of Excel and Minitab outputs 
are provided for a number of chapter examples in which such confidence intervals 
were developed. 
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Confidence Interval Limits for the Population Mean, <r Known 

x ± z—j= where x = sample mean 

n a = population standard deviation 

n = sample size 

z = z value for desired confidence level 
a/y/n = standard error of the sampling 
distribution of the mean 

(Assumes that either (1) the underlying population is normally distributed or (2) the sam- 
ple size is n > 30.) 

Confidence Interval Limits for the Population Mean, a- Unknown 

x ± t —j= where x = sample mean 

s = sample standard deviation 
n = sample size 

t = t value corresponding to the level of 
confidence desired, with df = n — 1 
s/Vw = estimated standard error of the sampling 
distribution of the mean 

(If n < 30, this requires the assumption that the underlying population is approximately 
normally distributed.) 

Confidence Interval Limits for the Population Proportion 



Vp(l — p) . number of successes 
where p = sample proportion = : — — 
n number of trials 

n = sample size 

z = z value corresponding to desired level of 

confidence (e.g., z = 1.96 for 95% confidence) 



^ — = estimated standard error of the sampling 

distribution of the proportion 



Required Sample Size for Estimating a Population Mean 



z 



2 



(T 2 



where n = required sample size 

z = z value for which ±z corresponds to the 

desired level of confidence 
a = known (or, if necessary, estimated) value of 

the population standard deviation 
e = maximum likely error that is acceptable 



Required Sample Size for Estimating a Population Proportion 

z 2 pa -p) 



where n = required sample size 

z = z value for desired level of confidence 
p = estimated value of the population 

proportion (if not estimated, use p = 0.5) 
e = maximum likely error that is acceptable 
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Confidence Interval Estimates When the Population Is Finite 

• For the population mean, a known: 



' n 



N-n 



N - 1 



where n = sample size 

N = population size 



For the population mean, a unknown: 

- s 
x ± t\ — 

For the population proportion: 



n 



N-n 



N - 1 



p(l-p) j N-n ] 
N — 1 1 



Required Sample Size for Estimating the Mean of a Finite Population 

a 2 , . , , . 

n — — 2 2 where » = required sample size 

~2 H N = population size 

z z = z value for desired level of confidence 



cr = known (or estimated) value of the population 

standard deviation 
e = maximum likely error that is acceptable 



Required Sample Size for Estimating the Proportion for a Finite Population 

P(l-P) 



P(l-P) 
N 



where n = required sample size 
N = population size 

z = z value for desired level of confidence 

p = the estimated population proportion (if not 

estimated, use p = 0.5) 
e = maximum likely error that is acceptable 



chapter exercises 



9.80 In a destructive test of product quality, a briefcase 
manufacturer places each of a simple random sample of the 
day's production in a viselike device and measures how 
many pounds it takes to crush the case. From past experi- 
ence, the standard deviation has been found to be 21.5 
pounds. For 35 cases randomly selected from today's 
production, the average breaking strength was 341.0 
pounds. Construct and interpret the 99% confidence inter- 
val for the mean breaking strength of the briefcases 
produced today. 

9.81 Working independently, each of two researchers has 
devised a sampling plan to be carried out for the purpose of 
constructing a 90% confidence interval for the mean of a 



certain population. What is the probability that neither of 
their confidence intervals will include the population mean? 

9.82 The accompanying data represent one-way 
commuting times (minutes) for a simple random sample 
of 15 persons who work at a large assembly plant. The 
data are also in file XR09082. Assuming an approximately 
normal distribution of commuting times for those who 
work at the plant, construct and interpret the 90% and 
95% confidence intervals for the mean. 



21.7 26.8 
39.0 28.0 
30.0 33.6 



33.1 
24.7 
33.3 



27.9 23.5 
28.4 28.9 
34.1 35.1 
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9.83 A torque wrench used in the final assembly of cylin- 
der heads has a process standard deviation of 5.0 lb-ft. The 
engineers have specified that a process average of 135 lb-ft 
is desirable. For a simple random sample of 30 nuts that 
the machine has recently tightened, the sample mean is 
137.0 lb-ft. Construct and interpret the 95% confidence 
interval for the current process mean. Discuss the possibil- 
ity that the machine may be in need of adjustment to 
correct the process mean. 

9.84 There are approximately 109 million television 
households in the United States. A ratings service would 
like to know, within 5 percentage points and with 95% 
confidence, the percentage of these households who tune 
in to the first episode of a network miniseries. How many 
television households must be included in the sample? 
source: The World Almanac and Book of Facts 2006, p. 278. 

9.85 In Exercise 9.84, a small-scale preliminary survey 
has indicated that no more than 20% of the television 
households will tune in to the first episode of the 
miniseries. Given this information, how large must the 
sample be? 

9.86 In a survey of 500 U.S. adults, 45% of them said 
that lounging at the beach was their "dream vacation." 
Assuming this to be a simple random sample of U.S. 
adults, construct and interpret the 95% and 99% 
confidence intervals for the proportion of U.S. adults who 
consider lounging at the beach to be their dream vacation. 

9.87 For the following simple random sample of house- 
hold incomes (thousands of dollars) from a large county, 
construct and interpret the 90% and 95% confidence in- 
tervals for the population mean. The data are also in file 
XR09087. 

58.3 50.0 58.1 33.5 51.1 38.1 42.3 60.4 55.8 46.2 

40.4 52.5 51.3 47.5 48.5 59.3 40.9 37.1 39.1 43.6 
55.3 42.3 48.2 42.8 61.1 34.7 35.5 52.9 44.7 51.5 

9.88 For a new process with which the production 
personnel have little experience, neither the standard de- 
viation nor the mean of the process is known. Twenty dif- 
ferent simple random samples, each with n = 50, are to 
be drawn from the process, and a 90% confidence inter- 
val for the mean is to be constructed for each sample. 
What is the probability that at least 2 of the confidence 
intervals will not contain the population mean? 

9.89 There were 904 new Subway Restaurants franchises 
opened during 2002. Suppose that Subway wished to sur- 
vey a simple random sample of the new franchisees to 
find out what percentage of them were totally pleased 
with their relationship with the company. If Subway 
wanted to have 90% confidence in being within 3 
percentage points of the population percentage who are 
pleased, how many of the new franchisees would have to 
be included in the sample? source: Subway.com. June 13, 2003. 



9.90 In Exercise 9.89, suppose Subway has carried out 
the study, using the sample size determined in that exer- 
cise, and 27.5% of the franchisees say they are pleased 
with their relationship with Subway. Construct and 
interpret the 95% confidence interval for the population 
percentage. 

9.91 A research firm wants to be 90% confident that a 
population percentage has been estimated to within 3 
percentage points. The research manager calculates the 
necessary sample size with 0.5 as his estimate of the pop- 
ulation proportion. A new business school graduate who 
has just joined the firm questions the research manager 
further, and they agree that the population proportion is 
no more than 0.3. If interviews cost $10 each, how much 
money has the new graduate just saved the company? 

9.92 The activities director of a large university has 
surveyed a simple random sample of 100 students for 
the purpose of determining approximately how many 
students to expect at next month's awards ceremony to 
be held in the gymnasium. Forty of the students said they 
plan to attend. What are the upper and lower 95% confi- 
dence limits for the number of the university's 10,000 
students who plan to attend the awards ceremony? 

9.93 A research firm has found that 39% of U.S. adults 
in the over-$75,000 income category work at least 51 
hours per week. Assuming this was a simple random sam- 
ple of 500 adults in this income group, construct and 
interpret the 95% and 99% confidence intervals for the 
proportion who work at least 51 hours per week. For 
each of the confidence intervals, identify and explain the 
maximum likely error in the study. 

9.94 For a process having a known standard deviation, a 
simple random sample of 35 items is selected. If the width 
of the 95% confidence interval is identified as y, express the 
width of the 99% confidence interval as a multiple of y. 

9.95 The makers of Count Chocula breakfast cereal 
would like to determine, within 2 percentage points and 
with 99% confidence, the percentage of U.S. senior 
citizens who have Count Chocula for breakfast at least 
once a week. What sample size would you recommend? 

9.96 In a work-sampling study, an industrial engineer 
has observed the activities of a clerical worker on 121 
randomly selected times during a workweek. On 32 of 
these occasions, the employee was talking on the 
telephone. For an 8-hour day, what are the upper and 
lower 95% confidence limits for the number of minutes 
this employee talks on the phone? 

9.97 A researcher would like to determine, within 3 per- 
centage points and with 90% confidence, the percentage of 
Americans who have a certain characteristic. If she feels 
certain that the percentage is somewhere between 20% 
and 40%, how many persons should be included in the 
sample? 
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9.98 In a survey of 1320 executives who oversee corpo- 
rate data systems, 24% said they had experienced losses 
caused by computer viruses during the past year. Assum- 
ing the executives were a simple random sample of all 
such executives, construct and interpret the 90% 
confidence interval for the population proportion who 
were monetarily harmed by computer viruses that year. 

9.99 An airline would like to determine, within 3 
percentage points and with 95% confidence, the percent- 
age of next month's customers who judge the courtesy of 
its employees as being "very good to excellent." What 
sample size would you recommend? 

9.100 A consultant conducts a pilot study to estimate a 
population standard deviation, then determines how large 
a simple random sample will be necessary to have a given 
level of confidence that the difference between x and n 
will be within the maximum error specified by her client. 
The necessary sample size has been calculated as n = 
100. If the client suddenly decides that the maximum 
error must be only one-fourth that originally specified, 
what sample size will now be necessary? 

9.101 There are 1733 machinery rebuilding and repair- 
ing companies in the United States. A tool manufacturer 
wishes to survey a simple random sample of these firms 
to find out what proportion of them are interested in a 
new tool design. If the tool manufacturer would like to be 
95% confident that the sample proportion is within 0.01 
of the actual population proportion, how many machin- 
ery rebuilding and repairing companies should be 
included in the sample? source: American Business Information, 
Sales Leads & Mailing Lists, August 1999, p. 22. 

9.102 In Exercise 9.101, suppose the tool manufacturer 
has carried out the study, using the sample size determined 
in that exercise, and 39.0% of the machinery rebuilding 
and repairing companies are interested in the new tool 
design. Construct and interpret the 95% confidence 
interval for the population percentage. 

9.103 The Colgate-Palmolive Company has 37,700 
employees. If the company wishes to estimate, within 

2 percentage points and with 99% confidence, the percent- 
age of employees who are interested in participating in a 
new stock option benefits program, how large a simple 
random sample will be necessary? source: Colgate-Palmolive 
Company, 2002 Annual Report, p. 8. 

9.104 To gain information about competitors' products, 
companies sometimes employ "reverse engineering," 
which consists of buying the competitor's product, then 
taking it apart and examining the parts in great detail. 
Engaging in this practice, a bicycle manufacturer intends 
to buy two or more of a leading competitor's mountain 
bikes and measure the tensile strength of the crossbar 
portion of the frame. Past experience has shown these 
strengths to be approximately normally distributed with a 
standard deviation of 20 pounds per square inch (psi). 



If the bike purchaser wants to have 90% confidence that 
the sampling error will be no more than 5 psi, how many 
of the competitor's mountain bikes should be purchased 
for destructive testing? 

9.105 A survey of business travelers found that 40% of 
those surveyed utilize hotel exercise facilities during their 
stay. Under the assumption that a simple random sample 
of 1000 business travelers were surveyed, construct and 
interpret the 90% and 95% confidence intervals for the 
proportion of business travelers who use their hotel's 
exercise facilities. 

9.106 A researcher, believing tt to be no more than 0.40, 
calculates the necessary sample size for the confidence 
level and maximum likely error he has specified. Upon 
completing the study, he finds the sample proportion to 
be 0.32. Is the maximum likely error greater than, equal 
to, or less than that originally specified? Explain. 

9.107 A truck loaded with 8000 electronic circuit boards 
has just pulled into a firm's receiving dock. The supplier 
claims that no more than 3 % of the boards fall outside the 
most rigid level of industry performance specifications. In a 
simple random sample of 300 boards from this shipment, 
12 fall outside these specifications. Construct the 95% 
confidence interval for the percentage of all boards in this 
shipment that fall outside the specifications, then comment 
on whether the supplier's claim would appear to be 
correct. 

9.108 A researcher has estimated that U.S. college 
students spend an average of 17.2 hours per week on the 
Internet. Assuming a simple random sample of 500 
college students and a sample standard deviation of 

1.4 hours per week, construct and interpret the 99% 
confidence interval for the population mean. 

/ data Set / Note: Exercises 9.109-9.111 require a 
computer and statistical software. 

9.109 According to the National Restaurant Association, 
the average check for a sit-down dinner is $25. Such a 
finding could have been based on data like the 800 sam- 
ple checks in file XR09109. Using the data in this file, con- 
struct and interpret the 95% confidence interval for the 
population mean, source: "U.S. Dining-OutTab: $1B a Day," USA 
Today, May 25, 2000, p. 1D. 

9.110 For taxpayers having an adjusted gross income of 
$1 million or more, the Internal Revenue Service reports 
that the average deduction for gifts to charity was 
$144,700. Curious to see how his state compares, a legis- 
lator surveys a simple random sample of 200 taxpayers 
from his state who are in this gross income category, with 
the data as shown in file XR09110. Using the data in this 
file, construct and interpret the 90% confidence interval 
for the mean charitable-gifts deduction for all of the 
state's taxpayers who are in the $1 million or more 
adjusted gross income category. Is $144,700 within the 
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confidence interval? Given the answer to the preceding 
question, comment on whether the state's taxpayers who 
are in this income group might not be typical of those in 
the nation as a whole in terms of their tax-deductible 
charitable contributions, source: "Brilliant Deductions, Taxing 
Questions," USA Today, March 3, 2000, p. 3B. 

9.111 To avoid losing part of their federal highway fund 
allocation, state safety administrators must ensure that 
interstate speed limits are adequately enforced within 
their state. In an upcoming test, federal researchers will 
be randomly selecting and clocking a very large sample 
of vehicles on a given section of the state's portion of an 



interstate highway that has historically had a relatively 
high accident rate. In anticipation of the upcoming study, 
state administrators randomly select and clock 100 vehi- 
cles along this route, obtaining the speeds shown in data 
file XR09111. Construct and interpret the 95% confidence 
interval for the population mean vehicle speed along this 
stretch of highway. Based on this interval, comment on 
whether the mean speed for the population of vehicles 
using this part of the highway might be 70 mph, the cut- 
off above which federal highway funds become 
endangered. 
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Thorndike Sports Equipment (Thorndike Video Unit Four) 



Seeing the fishing pole in his grandfather's office, Ted 
Thorndike's first thought is that old Luke is going to go 
fishing again and leave him to manage the store. He is 
quite surprised to learn the fishing pole is actually an inspi- 
ration for a new series of ads that Luke has in mind. 

The elder Thorndike explains, "Ted, this fishing pole 
is made of graphite, the same stuff that goes into our Graf- 
Pro racquetball racquets. It's so flexible and strong that it 
can be bent so the two ends actually touch each other. 
They even show this in the ads." Although Luke realizes 
that you can't do exactly the same thing with a racquetball 
racquet, he'd like to put some of his racquets into a hori- 
zontal mounting device, then see how much weight they'll 
take before they break. 

If the amount of weight is impressive enough, Luke 
plans to include this kind of test in the television advertise- 
ments he's planning for the firm's racquetball racquets. 
However, he wants to be careful not to brag about the 
racquet being able to hold too much weight, since the firm 
could get into trouble with the government and other 
truth-in-advertising advocates. 

He asks Ted to set up a test in which racquets are 
mounted horizontally, then the weight on the end is 



gradually increased until they break. Based on the test re- 
sults, a weight value would be selected such that the aver- 
age racquet would almost certainly be able to withstand 
this amount. Although accuracy is important, Ted has 
been instructed not to break more than 15 or 20 racquets 
in coming up with an average for all the racquets. 

For 20 racquets subjected to this severe test, the 
weight (in pounds) at which each one failed was as fol- 
lows. The data are also in file THORN09. 
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Ted believes it's reasonable to assume the population of 
breaking strengths is approximately normally distributed. 
Because of Luke's concern about being able to support the 
advertising claim, he wants to be very conservative in esti- 
mating the population mean for these breaking strengths. 
Ted needs some help in deciding how conservative he 
would like to be, and in coming up with a number that can 
be promoted in the ads. 



Springdale Shopping Survey 

The case in Chapter 2 listed 30 questions asked of 150 re- 
spondents in the community of Springdale. The coding key 
for these responses was also provided in this earlier 



exercise. The data are in file SHOPPING. In this exercise, 
some of the estimation techniques presented in the chapter 
will be applied to the survey results. You may assume that 

(continued) 
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Confidence Interval Size 

This applet allows us to construct and view z-intervals for 
the population mean by using the slider to specify the con- 
fidence level. As in Figure 9.3, the sample mean is 1.400 
inches, the sample size is 30, and the population standard 
deviation is known to be 0.053 inches. 

Note that the confidence interval limits shown in the 
graph may sometimes differ slightly from those we would 



95.0% Confidence Interval 
Data \A 





calculate using the pocket calculator and our standard nor- 
mal distribution table. This is because the applet is using 
more exact values for z than we are able to show within 
printed tables like the one in the text. 

Applet Exercises 

9.1 With the slider positioned so as to specify a 95% confi- 
dence interval for (jl, what are the upper and lower 
confidence limits? 

9.2 Move the slider so that the confidence interval is now 
99%. Describe how the increase in the confidence level has 
changed the width of the confidence interval. 

9.3 Move the slider so that the confidence interval is now 
80%. Describe how the decrease in the confidence level 
has changed the width of the confidence interval. 

9.4 Position the slider at its extreme left position, then 
gradually move it to the far right. Describe how this move- 
ment changes the confidence level and the width of the 
confidence interval. 



.42 1.43 

1.419 



these respondents represent a simple random sample of all 
potential respondents within the community and that the 
population is large enough that application of the finite 
population correction would not make an appreciable dif- 
ference in the results. 

Managers associated with shopping areas like these 
find it useful to have point estimates regarding variables 
describing the characteristics and behaviors of their cus- 
tomers. In addition, it is helpful for them to have some 
idea as to the likely accuracy of these estimates. Therein 
lies the benefit of the techniques presented in this chapter 
and applied here. 

1. Item C in the description of the data collection instru- 
ment lists variables 7, 8, and 9, which represent the re- 
spondent's general attitude toward each of the three 
shopping areas. Each of these variables has numerically 
equal distances between the possible responses, and for 
purposes of analysis they may be considered to be of the 
interval scale of measurement. 



a. Determine the point estimate, then construct the 95% 
confidence interval for /jl 7 = the average attitude to- 
ward Springdale Mall. What is the maximum likely 
error in the point estimate of the population mean? 

b. Repeat part (a) for fi s and /jl 9 , the average attitudes 
toward Downtown and West Mall, respectively. 

2. Given the breakdown of responses for variable 26 (sex 
of respondent), determine the point estimate, then 
construct the 95% confidence interval for tt 26 = the 
population proportion of males. What is the maxi- 
mum likely error in the point estimate of the popula- 
tion proportion? 

3. Given the breakdown of responses for variable 28 
(marital status of respondent), determine the point 
estimate, then construct the 95% confidence interval 
for 7T 2g = the population proportion in the "single or 
other" category. What is the maximum likely error in 
the point estimate of the population proportion? 



Comparing the Normal and Student t Distributions 




In this applet, we use a slider to change the number of de- 
grees of freedom and shape for the Student f distribution 
and then observe how the resulting shape compares to that 
of the standard normal distribution. The standard normal 




distribution is fixed and shown in red, and the Student t dis- 
tribution is displayed in blue. 

Applet Exercises 

10.1 Move the slider so that df = 5. Describe the shape of 
the f distribution compared to that of the standard normal 
distribution. 

10.2 Move the slider downward so that df = 2. How has 
this decrease changed the shape of the f distribution? 

10.3 Gradually move the slider upward so that increases 
from 2 to 10. Describe how the shape of the t distribution 
changes along the way. 

10.4 Position the slider so that df = 2, then gradually move 
it upward until df = 100. Describe how the shape of the f 
distribution changes along the way. 
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Student t Distribution Areas 



Applet Exercises 

11.1 With the slider set so that df = 9 and the left text box 
containing f = 3.25, what is the area beneath the curve be- 
tween f = -3.25 and r = +3.25? 

11.2 Gradually move the slider upward until df= 89. What 
effect does this have on the f value shown in the text 
box? 

11.3 Position the slider so that df = 2, then gradually move 
it upward until df = 100. Describe how the value in the t 
text box and the shape of the t distribution change along 
the way. 

11.4 With the slider set so that df = 9, enter 0.10 into the 
two-tail probability text box at the right. What value of t now 
appears in the left text box? To what right-tail area does this 
correspond? Verify the value of tfordf = 9 and this right-tail 
area by using the f table immediately preceding the back 
cover of the book. 



In this applet, we use a slider to change the number of de- 
grees of freedom for the f distribution, and text boxes allow 
us to change the t value or the two-tail probability for a given 
df. When changing a text-box entry, be sure the cursor is still 
within the box before pressing the enter or return key. 
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Fat-Free or Regular Pringles: 
Can Tasters Tell the Difference? 



When the makers of Pringles potato chips came out with new Fat-Free Pringles, they 
wanted the fat-free chips to taste just as good as their already successful regular 
Pringles. Did they succeed? In an independent effort to answer this question, 
USA Today hired registered dietitian Diane Wilke to give 44 people a chance to see 
whether they could tell the difference between the two kinds of Pringles. Each tester 
was given two bowls of chips — one containing Fat-Free Pringles, the other containing 
regular Pringles — and nobody was told which was which. 

On average, if the two kinds of chips really taste the same, we'd expect such 
testers to have a 50% chance of correctly identifying the bowl containing the fat-free 
chips. However, 25 of the 44 testers (56.8%) successfully identified the bowl with the 
fat-free chips. 

Does this result mean that Pringles failed in its attempt to make the 
products taste the same, or could the difference between „.„„7 

the observed 56.8% and the theoretical , rpthan n alT 

50% have happened just by correct sig ni 

chance? Actually, if the chips Is th e 

really taste the same and we were 
to repeat this type of test many 
times, pure chance would lead to 
about 1 /5 of the tests yielding a 
sample percentage at least as high 
as the 56.8% observed here. Thus, 
this particular test would not allow us 
to rule out the possibility that the 
chips taste the same. After reading 
Sections 1 0.3 and 1 0.6 of this chapter, 
you'll be able to verify how we reached 
this conclusion. For now, just trust us 
and read on. Thanks. 



Source: Beth Ashley, "Taste Testers Notice Little Difference 
Between Products," USA Today, September 30, 1996, 
p. 6D. Interested readers may also refer to Fiona Haynes, 
"Do Low-Fat Foods Really Taste Different?", 
http://lowfatcooking.about.com, August 9, 2006. 
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learning 
objectives 

After reading this 
chapter, you should 
be able to: 



• Describe the meaning of a null and an alternative hypothesis. 

• Transform a verbal statement into appropriate null and alternative hypotheses, 
including the determination of whether a two-tail test or a one-tail test is 
appropriate. 

• Describe what is meant by Type I and Type II errors, and explain how these can be 
reduced in hypothesis testing. 

• Carry out a hypothesis test for a population mean or a population proportion, 
interpret the results of the test, and determine the appropriate business 
decision that should be made. 

• Determine and explain the p-value for a hypothesis test. 

• Explain how confidence intervals are related to hypothesis testing. 

• Determine and explain the power curve for a hypothesis test and a given 
decision rule. 

• Determine and explain the operating characteristic curve for a hypothesis test 
and a given decision rule. 



10.1 INTRODUCTION 



In statistics, as in life, nothing is as certain as the presence of uncertainty. How- 
ever, just because we're not 100% sure of something, that's no reason why we 
can't reach some conclusions that are highly likely to be true. For example, if a 
coin were to land heads 20 times in a row, we might be wrong in concluding that 
it's unfair, but we'd still be wise to avoid engaging in gambling contests with its 
owner. In this chapter, we'll examine the very important process of reaching con- 
clusions based on sample information — in particular, of evaluating hypotheses 
based on claims like the following: 

• Titus Walsh, the director of a municipal transit authority, claims that 35% of 
the system's ridership consists of senior citizens. In a recent study, independent 
researchers find that only 23% of the riders observed are senior citizens. 
Should the claim of Walsh be considered false? 

• Jackson T. Backus has just received a railroad car of canned beets from his 
grocery supplier, who claims that no more than 20% of the cans are dented. 
Jackson, a born skeptic, examines a random sample from the shipment and 
finds that 25% of the cans sampled are dented. Has Mr. Backus bought a 
batch of botched beets? 

Each of the preceding cases raises a question of "believability" that can be 
examined by the techniques of this chapter. These methods represent inferential 
statistics, because information from a sample is used in reaching a conclusion 
about the population from which the sample was drawn. 

Null and Alternative Hypotheses 

The first step in examining claims like the preceding is to form a null hypothesis, 
expressed as H 0 ("H sub naught"). The null hypothesis is a statement about the 
value of a population parameter and is put up for testing in the face of numerical 
evidence. The null hypothesis is either rejected or fails to be rejected. 
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The null hypothesis tends to be a "business as usual, nothing out of the ordi- 
nary is happening" statement that practically invites you to challenge its truthful- 
ness. In the philosophy of hypothesis testing, the null hypothesis is assumed to be 
true unless we have statistically overwhelming evidence to the contrary. In other 
words, it gets the benefit of the doubt. 

The alternative hypothesis, H 1 ("H sub one"), is an assertion that holds if the 
null hypothesis is false. For a given test, the null and alternative hypotheses 
include all possible values of the population parameter, so either one or the other 
must be false. 

There are three possible choices for the set of null and alternative hypotheses 
to be used for a given test. Described in terms of an (unknown) population mean 
(ijl), they might be listed as shown below. Notice that each null hypothesis has an 
equality term in its statement (i.e., "=," or "<"). 

Null Alternative 

Hypothesis Hypothesis 

H 0 :ii = $10 Hy- /x * $10 (/a is $10, or it isn't.) 

H 0 :ii>$10 Hj:/i<$10 (fj, is at least $10, or it is less.) 

H 0 : fx < $10 Hy jj, > $10 (fi is no more than $10, or it is more.) 



Directional and Nondirectional Testing 

A directional claim or assertion holds that a population parameter is greater than 
(>), at least (&), no more than (<), or less than (<) some quantity. For example, 
Jackson's supplier claims that no more than 20% of the beet cans are dented. 

A nondirectional claim or assertion states that a parameter is equal to some 
quantity. For example, Titus Walsh claims that 35% of his transit riders are senior 
citizens. 

Directional assertions lead to what are called one-tail tests, where a null 
hypothesis can be rejected by an extreme result in one direction only. A nondirec- 
tional assertion involves a two-tail test, in which a null hypothesis can be rejected 
by an extreme result occurring in either direction. 

Hypothesis Testing and the Nature of the Test 

When formulating the null and alternative hypotheses, the nature, or purpose, of 
the test must also be taken into account. To demonstrate how (1) directionality 
versus nondirectionality and (2) the purpose of the test can guide us toward the 
appropriate testing approach, we will consider the two examples at the beginning 
of the chapter. For each situation, we'll examine (1) the claim or assertion leading 
to the test, (2) the null hypothesis to be evaluated, (3) the alternative hypothesis, 
(4) whether the test will be two-tail or one-tail, and (5) a visual representation of 
the test itself. 

Titus Walsh 

1. Titus' assertion: "35% of the riders are senior citizens." 

2. Null hypothesis: H Q : tt = 0.35, where it = the population proportion. The 
null hypothesis is identical to his statement since he's claimed an exact value 
for the population parameter. 

3. Alternative hypothesis: Hy tt # 0.35. If the population proportion is not 
0.35, then it must be some other value. 
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FIGURE 10.1 



Hypothesis tests can be 
two-tail (a) or one-tail (b), 
depending on the purpose 
of the test. A one-tail test 
can be either left-tail 
(not shown) or 
right-tail (b). 



H„: 77=0.35 H, : ir* 0.35 




Reject H 0 



Reject H 0 



0.35 

Proportion of senior citizens in a random sample of transit riders 
(a) Titus Walsh: "35% of the transit riders are senior citizens" 



# 0 :<jr<0.20 H l :ir>Q20 



■ Do not reject H 0 - 



- Rejec t H 0 - 



0.20 

Proportion of dented containers in a random sample of beet cans 
(b) Jackson Backus' supplier: "No more than 20% of the cans are dented" 



4. A two-tail test is used because the null hypothesis is nondirectional. 

5. As part (a) of Figure 10.1 shows, tt = 0.35 is at the center of the hypothesized 
distribution, and a sample with either a very high proportion or a very low 
proportion of senior citizens would lead to rejection of the null hypothesis. 
Accordingly, there are reject areas at both ends of the distribution. 

Jackson T. Backus 

1. Supplier's assertion: "No more than 20% of the cans are dented." 

2. Null hypothesis: H 0 : tt < 0.20, where tt = the population proportion. In this 
situation, the null hypothesis happens to be the same as the claim that led to 
the test. This is not always the case when the test involves a directional claim 
or assertion. 

3. Alternative hypothesis: H^. tt > 0.20. Jackson's purpose in conducting the 
test is to determine whether the population proportion of dented cans could 
really be greater than 0.20. 

4. A one-tail test is used because the null hypothesis is directional. 

5. As part (b) of Figure 10.1 shows, a sample with a very high proportion of 
dented cans would lead to the rejection of the null hypothesis. A one-tail test 
in which the rejection area is at the right is known as a right-tail test. Note 
that in part (b) of Figure 10.1, the center of the hypothesized distribution is 
identified as tt = 0.20. This is the highest value for which the null hypothesis 
could be true. From Jackson's standpoint, this may be viewed as somewhat 
conservative, but remember that the null hypothesis tends to get the benefit of 
the doubt. 
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A. VERBAL STATEMENT IS AN EQUALITY, "=". 




Example: "Average tire life Qs] 35,000 miles." 

H 0 . il = 35,000 miles 
Hy /x + 35,000 miles 



Categories of verbal 
statements and typical null 
and alternative hypotheses 
for each 



B. VERBAL STATEMENT IS ">" OR "<" (NOT > OR <). 



Example: "Average tire life | is at least] 35,000 miles. 



H 0 . /x > 35,000 miles 
Hy /x < 35,000 miles 



Example: "Average tire life |is no more than | 35,000 miles." 

H 0 . /x < 35,000 miles 
Hy /x > 35,000 miles 



In directional tests, the directionality of the null and alternative hypotheses 
will be in opposite directions and will depend on the purpose of the test. For 
example, in the case of Jackson Backus, Jackson was interested in rejecting 
H 0 : 77 < 0.20 only if evidence suggested it to be higher than 0.20. As we proceed 
with the examples in the chapter, we'll get more practice in formulating null and 
alternative hypotheses for both nondirectional and directional tests. Table 10.1 
offers general guidelines for proceeding from a verbal statement to typical null 
and alternative hypotheses. 



Whenever we reject a null hypothesis, there is a chance that we have made a 
mistake — i.e., that we have rejected a true statement. Rejecting a true null 
hypothesis is referred to as a Type I error, and our probability of making such 
an error is represented by the Greek letter alpha (a). This probability, which is 
referred to as the significance level of the test, is of primary concern in hypoth- 
esis testing. 

On the other hand, we can also make the mistake of failing to reject a false null 
hypothesis — this is a Type II error. Our probability of making it is represented by 
the Greek letter beta (j8). Naturally, if we either fail to reject a true null hypothesis 
or reject a false null hypothesis, we've acted correctly. The probability of rejecting 
a false null hypothesis is called the power of the test, and it will be discussed in 
Section 10.7. The four possibilities are shown in Table 10.2 (page 314). In hypoth- 
esis testing, there is a necessary trade-off between Type I and Type II errors: For a 
given sample size, reducing the probability of a Type I error increases the probabil- 
ity of a Type II error, and vice versa. The only sure way to avoid accepting false 
claims is to never accept any claims. Likewise, the only sure way to avoid rejecting 
true claims is to never reject any claims. Of course, each of these extreme 
approaches is impractical, and we must usually compromise by accepting a reason- 
able risk of committing either type of error. 
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TABLE 10.2 



A summary of the 
possibilities for mistakes 
and correct decisions in 
hypothesis testing. The 
probability of incorrectly 
rejecting a true null 
hypothesis is a, the 
significance level. The 
probability that the test will 
correctly reject a false null 
hypothesis is (1 - /3), the 
power of the test. 



'Do not reject H Q " 



Hypothesis tests says 



'Reject H„" 



THE NULL HYPOTHESIS (H 0 ) IS REALLY 
TRUE FALSE 



Correct decision. 


Incorrect decision 
(Type II error). 
Probability of making 
this error is /3. 


Incorrect decision 
(Type I error). Probability 
of making this error is a, 
the significance level. 


Correct decision. 
Probability (1 - /3) is 
the power of the test. 



10.1 What is the difference between a null hypothesis and 
an alternative hypothesis? Is the null hypothesis always the 
same as the verbal claim or assertion that led to the test? 
Why or why not? 

10.2 For each of the following pairs of null and alterna- 
tive hypotheses, determine whether the pair would be 
appropriate for a hypothesis test. If a pair is deemed 
inappropriate, explain why. 



a. 
b. 
c. 
d. 
c. 

f. 



H, 



M — 10, 
(i = 30, 
fjL > 90, 
M < 75, 
x > 15, 
x = 58, 



Hj: /x < 10 
Hj: /x + 30 
Hj: /x < 90 
Hj: /x < 85 
H t :x< 15 
H^x * 58 



10.3 For each of the following pairs of null and alterna- 
tive hypotheses, determine whether the pair would be 
appropriate for a hypothesis test. If a pair is deemed 
inappropriate, explain why. 



a. 
b. 
c. 
d. 

c. 

f. 



H 0 : it > 0.30, H l 
H, 



H 0 : it = 0.72, 
H 0 : 77 < 0.25, 
H 0 : 77 > 0.48, 
H 0 : 77 < 0.70, 
H Q : p > 0.65, 



77 < 0.35 
77 * 0.72 
77 > 0.25 
77 > 0.48 

77 > 0.70 

p < 0.65 



10.4 The president of a company that manufactures 
central home air conditioning units has told an 
investigative reporter that at least 85% of its 
homeowner customers claim to be "completely 
satisfied" with the overall purchase experience. If the 
reporter were to subject the president's statement to 
statistical scrutiny by questioning a sample of the com- 
pany's residential customers, would the test be one-tail 



or two-tail? What would be the appropriate null and 
alternative hypotheses? 

10.5 On CNN and other news networks, guests often 
express their opinions in rather strong, persuasive, and 
sometimes frightening terms. For example, a scientist 
who strongly believes that global warming is taking place 
will warn us of the dire consequences (such as rising sea 
levels, coastal flooding, and global climate change) she 
foresees if we do not take her arguments seriously. If the 
scientist is correct, and the world does not take her seri- 
ously, would this be a Type I error or a Type II error? 
Briefly explain your reasoning. 

10.6 Many law enforcement agencies use voice-stress 
analysis to help determine whether persons under interro- 
gation are lying. If the sound frequency of a person's 
voice changes when asked a question, the presumption is 
that the person is being untruthful. For this situation, 
state the null and alternative hypotheses in verbal terms, 
then identify what would constitute a Type I error and a 
Type II error in this situation. 

10.7 Following a major earthquake, the city engineer 
must determine whether the stadium is structurally sound 
for an upcoming athletic event. If the null hypothesis is 
"the stadium is structurally sound," and the alternative 
hypothesis is "the stadium is not structurally sound," 
which type of error (Type I or Type II) would the engineer 
least like to commit? 

10.8 A state representative is reported as saying that about 
10% of reported auto thefts involve owners whose cars 
have not really been stolen, but who are trying to defraud 
their insurance company. What null and alternative 
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hypotheses would be appropriate in evaluating the 
statement made by this legislator? 

10.9 In response to the assertion made in Exercise 10.8, 
suppose an insurance company executive were to claim 
the percentage of fraudulent auto theft reports to be "no 
more than 10%." What null and alternative hypotheses 
would be appropriate in evaluating the executive's 
statement? 

10.10 For each of the following statements, formulate 
appropriate null and alternative hypotheses. Indicate 
whether the appropriate test will be one-tail or two-tail, 
then sketch a diagram that shows the approximate loca- 
tion of the "rejection" region(s) for the test. 

a. "The average college student spends no more than 
$300 per semester at the university's bookstore." 

b. "The average adult drinks 1.5 cups of coffee per 
day. " 

c. "The average SAT score for entering freshmen is at 
least 1200." 

d. "The average employee put in 3.5 hours of overtime 
last week." 



10.11 In administering a "field sobriety" test to suspected 
drunks, officers may ask a person to walk in a straight 
line or close his eyes and touch his nose. Define the Type I 
and Type II errors in terms of this setting. Speculate on 
physiological variables (besides the drinking of alcoholic 
beverages) that might contribute to the chance of each 
type of error. 

10.12 In the judicial system, the defense attorney argues 
for the null hypothesis that the defendant is innocent. In 
general, what would be the result if judges instructed 
juries to 

a. never make a Type I error? 

b. never make a Type II error? 

c. compromise between Type I and Type II errors? 

10.13 Regarding the testing of pharmaceutical companies' 
claims that their drugs are safe, a U.S. Food and Drug 
Administration official has said that it's "better to turn 
down 1000 good drugs than to approve one that's 
unsafe." If the null hypothesis is H 0 : "The drug is not 
harmful," what type of error does the official appear to 
favor? 



HYPOTHESIS TESTING: BASIC PROCEDURES 1 0.2 



There are several basic steps in hypothesis testing. They are briefly presented here 
and will be further explained through examples that follow. 

1. Formulate the null and alternative hypotheses. As described in the preceding 
section, the null hypothesis asserts that a population parameter is equal to, no 
more than, or no less than some exact value, and it is evaluated in the face of 
numerical evidence. An appropriate alternative hypothesis covers other possi- 
ble values for the parameter. 

2. Select the significance level. If we end up rejecting the null hypothesis, there's 
a chance that we're wrong in doing so — i.e., that we've made a Type I error. 
The significance level is the maximum probability that we'll make such a mis- 
take. In Figure 10.1, the significance level is represented by the shaded area(s) 
beneath each curve. For two-tail tests, the level of significance is the sum of 
both tail areas. In conducting a hypothesis test, we can choose any signifi- 
cance level we desire. In practice, however, levels of 0.10, 0.05, and 0.01 tend 
to be most common — in other words, if we reject a null hypothesis, the max- 
imum chance of our being wrong would be 10%, 5%, or 1%, respectively. 
This significance level will be used to later identify the critical value(s). 

3. Select the test statistic and calculate its value. For the tests of this chapter, the 
test statistic will be either z or t, corresponding to the normal and t distribu- 
tions, respectively. Figure 10.2 (page 316) shows how the test statistic is selected. 
An important consideration in tests involving a sample mean is whether the 
population standard deviation (a) is known. As Figure 10.2 indicates, the z-test 
(normal distribution and test statistic, z) will be used for hypothesis tests 
involving a sample proportion. 
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FIGURE 10.2 



An overview of the process 
of selecting a test statistic 
for single-sample 
hypothesis testing. Key 
assumptions are reviewed 
in the figure notes. 
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Section 10.3 
Note 1 



Section 10.5 
Note 2 



Section 10.6 
Note 3 



'The z distribution: If the population is not normally distributed, n should be a30 for the central limit theorem to apply. 
The population u is usually not known. 

2 The t distribution: For an unknown a, and when the population is approximately normally distributed, the t-test 
is appropriate regardless of the sample size. As n increases, the normality assumption becomes less important. If 
n < 30 and the population is not approximately normal, nonparametric testing (e.g., the sign test for central tendency, 
in Chapter 14) may be applied. The f-test is "robust" in terms of not being adversely affected by slight departures from 
the population normality assumption. 

3 When mr > 5 and n(1 — it) > 5, the normal distribution is considered to be a good approximation to the binomial dis- 
tribution. If this condition is not met, the exact probabilities must be derived from the binomial distribution. Most prac- 
tical business settings involving proportions satisfy this condition, and the normal approximation is used in this chapter. 
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4. Identify critical value(s) for the test statistic and state the decision rule. The 
critical value(s) will bound rejection and nonrejection regions for the null 
hypothesis, H Q . Such regions are shown in Figure 10.1. They are determined 
from the significance level selected in step 2. In a one-tail test, there will 
be one critical value since H Q can be rejected by an extreme result in just one 
direction. Two-tail tests will require two critical values since H 0 can be 
rejected by an extreme result in either direction. If the null hypothesis 
were really true, there would still be some probability (the significance level, 
a) that the test statistic would be so extreme as to fall into a rejection region. 
The rejection and nonrejection regions can be stated as a decision rule speci- 
fying the conclusion to be reached for a given outcome of the test (e.g., 
"Reject H 0 if z > 1.645, otherwise do not reject"). 

5. Compare calculated and critical values and reach a conclusion about the null 
hypothesis. Depending on the calculated value of the test statistic, it will fall 
into either a rejection region or the nonrejection region. If the calculated 
value is in a rejection region, the null hypothesis will be rejected. Otherwise, 
the null hypothesis cannot be rejected. Failure to reject a null hypothesis does 
not constitute proof that it is true, but rather that we are unable to reject it 
at the level of significance being used for the test. 

6. Make the related business decision. After rejecting or failing to reject the null 
hypothesis, the results are applied to the business decision situation that pre- 
cipitated the test in the first place. For example, Jackson T. Backus may decide 
to return the entire shipment of beets to his distributor. 




10.14 A researcher wants to carry out a hypothesis test 
involving the mean for a sample of size n = 18. She does 
not know the true value of the population standard devi- 
ation, but is reasonably sure that the underlying popula- 
tion is approximately normally distributed. Should she 
use a z- test or a t-test in carrying out the analysis? Why? 

10.15 A research firm claims that 62% of women in the 
40-49 age group save in a 401 (k) or individual retirement 
account. If we wished to test whether this percentage 
could be the same for women in this age group living in 
New York City and selected a random sample of 300 
such individuals from New York, what would be the null 
and alternative hypotheses? Would the test be a z-test or a 
t-test? Why? 

10.16 In hypothesis testing, what is meant by the decision 
rule} What role does it play in the hypothesis-testing 
procedure? 

10.17 A manufacturer informs a customer's design engi- 
neers that the mean tensile strength of its rivets is at 



least 3000 pounds. A test is set up to measure the tensile 
strength of a sample of rivets, with the null and alterna- 
tive hypotheses, H Q : fju > 3000 and H t : /x < 3000. For 
each of the following individuals, indicate whether the 
person would tend to prefer a numerically very high 
(e.g., a = 0.20) or a numerically very low (e.g., 
a = 0.0001) level of significance to be specified for 
the test. 

a. The marketing director for a major competitor of the 
rivet manufacturer. 

b. The rivet manufacturer's advertising agency, which 
has already made the "at least 3000 pounds" claim in 
national ads. 

10.18 It has been claimed that no more than 5% of 
the units coming off an assembly line are defective. 
Formulate a null hypothesis and an alternative 
hypothesis for this situation. Will the test be one-tail 
or two-tail? Why? If the test is one-tail, will it be 
left- tail or right-tail? Why? 
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1 0.3 TESTING A MEAN, POPULATION STANDARD 

DEVIATION KNOWN 



Situations can occur where the population mean is unknown but past experience 
has provided us with a trustworthy value for the population standard deviation. 
Although this possibility is more likely in an industrial production setting, it can 
sometimes apply to employees, consumers, or other nonmechanical entities. 

In addition to the assumption that cr is known, the procedure of this section 
assumes either (1) that the sample size is large (n > 30), or (2) that, if n < 30, the 
underlying population is normally distributed. These assumptions are summarized 
in Figure 10.2. If the sample size is large, the central limit theorem assures us that 
the distribution of sample means will be approximately normally distributed, 
regardless of the shape of the underlying distribution. The larger the sample size, 
the better this approximation becomes. Because it is based on the normal distribu- 
tion, the test is known as the 2-test, and the test statistic is as follows: 



Test statistic, z-test for a sample mean: 




x — flQ 

Z = where a- = 


standard error for the 


a- 

X 


sample mean, = tr/Vn 


X = 


sample mean 


Mo = 


hypothesized population mean 


n = 


sample size 



The symbol /x 0 is the value of that is assumed for purposes of the hypothesis 
test. 

Two-Tail Testing of a Mean, o- Known 







example 





Two-Tail Test 



When a robot welder is in adjustment, its mean time to perform its task is 1.3250 
minutes. Past experience has found the standard deviation of the cycle time to be 
0.0396 minutes. An incorrect mean operating time can disrupt the efficiency of 
other activities along the production line. For a recent random sample of 80 jobs, 
the mean cycle time for the welder was 1.3229 minutes. The underlying data are 
in file CX10WELD. Does the machine appear to be in need of adjustment? 

SOLUTION 

Formulate the Null and Alternative Hypotheses 

H Q ; /a = 1.3250 minutes The machine is in adjustment. 
Jfji /x # 1.3250 minutes The machine is out of adjustment. 

In this test, we are concerned that the machine might be running at a mean speed 
that is either too fast or too slow. Accordingly, the null hypothesis could be 
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rejected by an extreme sample result in either direction. The hypothesized value 
for the population mean is fi Q = 1.3250 minutes, shown at the center of the dis- 
tribution in Figure 10.3. 

Select the Significance Level 

The significance level used will be a = 0.05. If the machine is running properly, 
there is only a 0.05 probability of our making the mistake of concluding that it 
requires adjustment. 

Select the Test Statistic and Calculate Its Value 

The population standard deviation (a) is known and the sample size is large, so 
the normal distribution is appropriate and the test statistic will be z, calculated as 

x - (jl 0 1.3229 - 1.3250 -0.0021 

z = = ^— = = -0.47 

a-* 0.0396/V80 0.00443 

Identify Critical Values for the Test Statistic and State the Decision Rule 

For a two-tail test using the normal distribution and a = 0.05, z = —1.96 and 
z = +1.96 will be the respective boundaries for lower and upper tails of 0.025 
each. These are the critical values for the test, and they identify the rejection and 
nonrejection regions shown in Figure 10.3. The decision rule can be stated as 
"Reject H Q if calculated z < —1.96 or > +1.96, otherwise do not reject." 

Compare Calculated and Critical Values and Reach a Conclusion for the 
Null Hypothesis 

The calculated value, z = —0.47, falls within the nonrejection region of Figure 10.3. 
At the 0.05 level of significance, the null hypothesis cannot be rejected. 

Make the Related Business Decision 

Based on these results, the robot welder is not in need of adjustment. The differ- 
ence between the hypothesized population mean, /jl 0 = 1.3250 minutes, and the 
observed sample mean, x = 1.3229, is judged to have been merely the result of 
chance variation. 
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If we had used the sample information and the techniques of Chapter 9 to construct 
a 95% confidence interval for /jl, the interval would have been 

o- 0.0396 
x ± z^ = 1.3229 ± 1.96 — or from 1.3142 to 1.3316 minutes 

V80 

Notice that the hypothesized value, /jl 0 = 1.3250 minutes, falls within the 95% 
confidence interval — that is, the confidence interval tells us that /jl could be 1.3250 
minutes. This is the same conclusion we get from the nondirectional hypothesis test 
using a = 0.05, and it is not a coincidence. A 100(1 — a)% confidence interval is 
equivalent to a nondirectional hypothesis test at the a level, a relationship that will be 
discussed further in Section 10.4. 



One-Tail Testing of a Mean, o- Known 



Example 



One-Tail Test 

The lightbulbs in an industrial warehouse have been found to have a mean life- 
time of 1030.0 hours, with a standard deviation of 90.0 hours. The warehouse 
manager has been approached by a representative of Extendabulb, a company 
that makes a device intended to increase bulb life. The manager is concerned that 
the average lifetime of Extendabulb-equipped bulbs might not be any greater than 
the 1030 hours historically experienced. In a subsequent test, the manager tests 
40 bulbs equipped with the device and finds their mean life to be 1061.6 hours. 
The underlying data are in file CX10BULB. Does Extendabulb really work? 

SOLUTION 

Formulate the Null and Alternative Hypotheses 

The warehouse manager's concern that Extendabulb-equipped bulbs might not be 
any better than those used in the past leads to a directional test. Accordingly, the 
null and alternative hypotheses are: 

H Q : /jl < 1030.0 hours Extendabulb is no better than the present system. 
H^. /jl > 1030.0 hours Extendabulb really does increase bulb life. 

At the center of the hypothesized distribution will be the highest possible value for 
which H Q could be true, /jl 0 = 1030.0 hours. 

Select the Significance Level 

The level chosen for the test will be a = 0.05. If Extendabulb really has no favor- 
able effect, the maximum probability of our mistakenly concluding that it does 
will be 0.05. 

Select the Test Statistic and Calculate Its Value 

As in the previous test, the population standard deviation (a) is known and the 
sample size is large, so the normal distribution is appropriate and the test statistic 
will be z. It is calculated as 

x - 1061.6 - 1030.0 „ _ 
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H 0 : fj,< 1030 hours 
ffjl ft > 1030 hours 



Do not reject H 0 



/j.q = 1030 hours 



Reject H 0 — >- 
r- Area = 0.05 



Z = +1.645 



Test statistic: 



n 



■-2.22 



FIGURE 10.4 



The warehouse manager 
is concerned that 
Extendabulb might not 
increase the lifetime of 
lightbulbs. This right-tail 
test at the 0.05 level 
suggests otherwise. 



Select the Critical Value for the Test Statistic and State the Decision Rule 

For a right-tail z-test in which a = 0.05, z = +1.645 will be the boundary separat- 
ing the nonrejection and rejection regions. This critical value for the test is included 
in Figure 10.4. The decision rule can be stated as "Reject H 0 if calculated 
z > +1.645, otherwise do not reject." 

Compare Calculated and Critical Values and Reach a Conclusion for the 
Null Hypothesis 

The calculated value, z = +2.22, falls within the rejection region of the diagram 
in Figure 10.4. At the 0.05 level of significance, the null hypothesis is rejected. 

Make the Related Business Decision 

The results suggest that Extendabulb does increase the mean lifetime of the bulbs. 
The difference between the mean of the hypothesized distribution, /jl 0 = 1030.0 
hours, and the observed sample mean, x = 1061.6, is judged too great to have 
occurred by chance. The firm may wish to incorporate Extendabulb into its ware- 
house lighting system. 

Other Levels of Significance 

This test was conducted at the 0.05 level, but would the conclusion have been dif- 
ferent if other levels of significance had been used instead? Consider the following 
possibilities: 

• For the 0.05 level of significance at which the test was conducted. The critical 
z is +1.645, and the calculated value, z = 2.22, exceeds it. The null hypothe- 
sis is rejected, and we conclude that Extendabulb does increase bulb life. 

• For the 0.025 level of significance. The critical z is +1.96, and the calculated 
value, z = 2.22, exceeds it. The null hypothesis is rejected, and we again con- 
clude that Extendabulb increases bulb life. 

• For the 0.005 level of significance. The critical z is +2.58, and the calculated 
value, z = 2.22, does not exceed it. The null hypothesis is not rejected, and we 
conclude that Extendabulb does not increase bulb life. 
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As these possibilities suggest, using different levels of significance can lead to 
quite different conclusions. Although the primary purpose of this exercise was to 
give you a little more practice in hypothesis testing, consider these two key ques- 
tions: (1) If you were the manufacturer of Extendabulb, which level of signifi- 
cance would you prefer to use in evaluating the test results? (2) On which level of 
significance might the manufacturer of a competing product wish to rely in dis- 
cussing the Extendabulb test? We will now examine these questions in the context 
of describing the p-value method for hypothesis testing. 

The p-value Approach to Hypothesis Testing 

There are two basic approaches to conducting a hypothesis test: 

• Using a predetermined level of significance, establish critical value(s), then see 
whether the calculated test statistic falls into a rejection region for the test. This 
is similar to placing a high-jump bar at a given height, then seeing whether you 
can clear it. 

• Determine the exact level of significance associated with the calculated value 
of the test statistic. In this case, we're identifying the most extreme critical 
value that the test statistic would be capable of exceeding. This is equivalent 
to your jumping as high as you can with no bar in place, then having 
the judges tell you how high you would have cleared if there had been a 
crossbar. 

In the two tests carried out previously, we used the first of these approaches, 
making the hypothesis test a "yes-no" decision. In the Extendabulb example, 
however, we did allude to what we're about to do here by trying several different 
significance levels in our one-tail test examining the ability of Extendabulb to 
increase the lifetime of lightbulbs. 

We saw that Extendabulb showed a significant improvement at the 0.05 and 
0.025 levels, but was not shown to be effective at the 0.005 level. In our high- 
jumping analogy, we might say that Extendabulb "cleared the bar" at the 0.05 
level, cleared it again when it was raised to the more demanding 0.025 level, but 
couldn't quite make the grade when the bar was raised to the very demanding 
0.005 level of significance. In summary: 

• 0.05 level Extendabulb significantly increases bulb life (e.g., "clears the 
high-jump bar"). 

• 0.025 level Extendabulb significantly increases bulb life ("clears the bar"). 

• p-value level Extendabulb just barely shows significant improvement in bulb 
life ("clears the bar, but lightly touches it on the way over"). 

• 0.005 level Extendabulb shows no significant improvement in bulb life 
("insufficient height, fails to clear"). 

As suggested by the preceding, and illustrated in part (a) of Figure 10.5, there 
is some level of significance (the /7-value) where the calculated value of the test 
statistic is exactly the same as the critical value. For a given set of data, the p-value 
is sometimes referred to as the observed level of significance. It is the lowest pos- 
sible level of significance at which the null hypothesis can be rejected. (Note: The 
lowercase p in "p-value" is not related to the symbol for the sample proportion.) 
For the Extendabulb test, the calculated value of the test statistic was z = 2.22. 
For a critical z = +2.22, the right-tail area can be found using the normal distri- 
bution table at the back of the book. 
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p-value = 0.0132 



/xq = 1030 hours 



(a) p-value for one-tail (Extendabulb) example of Figure 10.4 



Test statistic: 
z = 2.22 



FIGURE 10.5 



The p-value of a test is 
the level of significance 
where the observed value 
of the test statistic is 
exactly the same as a 
critical value for that level. 
These diagrams show the 
p-values, as calculated in 
the text, for two of the 
tests performed in this 
section. When the 
hypothesis test is two-tail, 
as in part (b), the p-value is 
the sum of two tail areas. 




p-value/2 = 0.3192 



p-value = 2(0.3192) = 0.6384 
p-value/2 = 0.3192 



/x 0 = 1.3250 minutes 
Test statistic: 



Z = +0.47 



z = -0.47 

(b) p-value for two-tail (robot welder) example of Figure 10.3 



Referring to the normal distribution table, we see that 2.22 standard error units 
to the right of the mean includes an area of 0.4868, leaving (0.5000 — 0.4868), or 
0.0132, in the right-tail area. This identifies the most demanding level of signifi- 
cance that Extendabulb could have achieved. If we had originally specified a 
significance level of 0.0132 for our test, the critical value for z would have been 
exactly the same as the value calculated. Thus, the p-value for the Extendabulb test 
is found to be 0.0132. 

The Extendabulb example was a one-tail test — accordingly, the p-value was 
the area in just one tail. For two-tail tests, such as the robot welder example of 
Figure 10.3, the p-value will be the sum of both tail areas, as shown in part (b) of 
Figure 10.5. The calculated test statistic was z = —0.47, resulting in an area 
of (0.5000 - 0.1808), or 0.3192, in the left tail of the distribution. Since the 
robot welder test was two-tail, the 0.3192 must be multiplied by 2 to get the 
p-value of 0.6384. 
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Computer-Assisted Hypothesis Tests and p-values 

When the hypothesis test is computer-assisted, the output will include a p-value 
for your interpretation. Regardless of whether a p-value has been approximated 
by your own calculations and table reference, or is a more exact value included in 
a computer printout, it can be interpreted as follows: 



Reject the null hypothesis. The sample result 
is more extreme than you would have been 
willing to attribute to chance. 



Do not reject the null hypothesis. The 

sample result is not more extreme than you 
would have been willing to attribute to chance. 



Interpreting the p-value in a computer printout: 

Yes — 




Is the p-value < your specified level 
of significance, a! 




No 



Computer Solutions 10.1 shows how we can use Excel or Minitab to carry out 
a hypothesis test for the mean when the population standard deviation is known or 
assumed. In this case, we are replicating the hypothesis test in Figure 10.4, using the 
40 data values in file CX10BULB. The printouts in Computer Solutions 10.1 show 
the p-value (0.0132) for the test. This p-value is essentially making the following 
statement: "If the population mean really is 1030 hours, there is only a 0.0132 
probability of getting a sample mean this large (1061.6 hours) just by chance." 
Because the p-value is less than the level of significance we are using to reach our 
conclusion (i.e., p-value = 0.0132 is < a = 0.05), H Q : /jl < 1030 is rejected. 



computer solutions 10.1 



Hypothesis Test for Population Mean, a Known 

These procedures show how to carry out a hypothesis test for the population mean when the population 
standard deviation is known. 



EXCEL 





A 


B 


C 


D 


1 


Z-Test: Mean 






2 








hours 


3 


Mean 






1061.61 


4 


Standard Deviation 




93.60 


5 


Observations 




40 


6 


Hypothesized Mean 




1030 


7 


SIGMA 






90 


8 


z Stat 






2.221 


9 


P(Z<=z) one-tail 




0.0132 


10 


z Critical one-tail 




1.645 


11 


P(Z<=z) two-tail 




0.0264 


12 


z Critical two-tail 




1.96 
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Excel hypothesis test for fi based on raw data and tr known 

1 . For example, for the 40 bulb lifetimes (file CX10BULB.XLS) on which Figure 1 0.4 is based, with the label and 40 data 
values in A1:A41: Click Tools. Click Data Analysis Plus. Click Z-Test: Mean. Click OK. 

2. Enter A1:A41 into the Input Range box. Enter the hypothesized mean (1030) into the Hypothesized Mean box. 
Enter the known population standard deviation (90.0) into the Standard Deviation (SIGMA) box. Click Labels, since 
the variable name is in the first cell within the field. Enter the level of significance for the test (0.05) into the Alpha 
box. Click OK. The printout includes the p-value for this one-tail test, 0.0132. 

Excel hypothesis test for fi based on summary statistics and tr known 

1. For example, with x = 1061.6, a = 90.0, and n = 40, as in Figure 10.4: Open the TEST STATISTICS.XLS workbook, 
supplied with the text. 

2. Using the arrows at the bottom left, select the z-Test_Mean worksheet. Enter the sample mean (1061 .6), the known sigma 
(90.0), the sample size (40), the hypothesized population mean (1030), and the level of significance for the test (0.05). 

(Note: As an alternative, you can use Excel worksheet template TMZTEST.XLS, supplied with the text. The steps are 
described within the template.) 

MINITAB 

Minitab hypothesis test for fi based on raw data and <r known 
One-Sample Z: hours 

Test of mu = 1030 vs > 1030 

The assumed standard deviation = 90 

95% 
Lower 

Variable N Mean StDev SE Mean Bound Z P 

hours 40 1061.6 93.6 14.2 1038.2 2.22 0.013 

1 . For example, using the data (file CX10BULB.MTW) on which Figure 10.4 is based, with the 40 data values in column C1: 
Click Stat. Select Basic Statistics. Click 1-Sample Z. 

2. Select Samples in Columns and enter C1 into the box. Enter the known population standard deviation (90.0) into 
the Standard deviation box. Select Perform hypothesis test and enter the hypothesized population mean (1030) 
into the Hypothesized mean: box. 

3. Click Options. Enter the desired confidence level as a percentage (95.0) into the Confidence Level box. Within the 
Alternative box, select greater than. Click OK. Click OK. By default, this test also provides the lower boundary of 
the 95% confidence interval (unless another confidence level has been specified). 

Minitab hypothesis test for fi based on summary statistics and tr known 

Follow the procedure in steps 1 through 3, above, but in step 2 select Summarized data and enter 40 and 1061.6 into 
the Sample size and Mean boxes, respectively. 







exercises 










w 






'Ms \ 


















_ 




f 






If r% 


__ 





10.19 What is the central limit theorem, and how is it necessary to use the z-statistic in carrying out a hypothesis 
applicable to hypothesis testing? test for the population mean? 

10.20 If the population standard deviation is known, 10.21 What is a p-value, and how is it relevant to 
but the sample size is less than 30, what assumption is hypothesis testing? 
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10.22 The p-value for a hypothesis test has been reported 
as 0.03. If the test result is interpreted using the a = 0.05 
level of significance as a criterion, will H 0 be rejected? 
Explain. 

10.23 The p-value for a hypothesis test has been reported 
as 0.04. If the test result is interpreted using the a = 0.01 
level of significance as a criterion, will H 0 be rejected? 
Explain. 

10.24 A hypothesis test is carried out using the a = 0.01 
level of significance, and H Q cannot be rejected. What 

is the most accurate statement we can make about the 
p-value for this test? 

10.25 For each of the following tests and z values, 
determine the p-value for the test: 

a. Right-tail test and z = 1.54 

b. Left-tail test and z = -1.03 

c. Two-tail test and z = —1.83 

10.26 For each of the following tests and z values, deter- 
mine the p-value for the test: 

a. Left-tail test and z = —1.62 

b. Right-tail test and z = 1.43 

c. Two-tail test and z = 1.27 

10.27 For a sample of 35 items from a population for 
which the standard deviation is <x = 20.5, the sample 
mean is 458.0. At the 0.05 level of significance, test 
H Q : /jl = 450 versus H^. fi + 450. Determine and 
interpret the p-value for the test. 

10.28 For a sample of 12 items from a normally 
distributed population for which the standard deviation 
is a = 17.0, the sample mean is 230.8. At the 0.05 level 
of significance, test H 0 : fju < 220 versus Hj: fj, > 220. 
Determine and interpret the p-value for the test. 

10.29 A quality-assurance inspector periodically 
examines the output of a machine to determine whether 
it is properly adjusted. When set properly, the machine 
produces nails having a mean length of 2.000 inches, 
with a standard deviation of 0.070 inches. For a sample 
of 35 nails, the mean length is 2.025 inches. Using the 
0.01 level of significance, examine the null hypothesis 
that the machine is adjusted properly. Determine and 
interpret the p-value for the test. 

10.30 In the past, patrons of a cinema complex have 
spent an average of $2.50 for popcorn and other snacks, 
with a standard deviation of $0.90. The amounts of these 
expenditures have been normally distributed. Following 
an intensive publicity campaign by a local medical 
society, the mean expenditure for a sample of 18 patrons 
is found to be $2.10. In a one-tail test at the 0.05 level 

of significance, does this recent experience suggest a 
decline in spending? Determine and interpret the p-value 
for the test. 



10.31 Following maintenance and calibration, an extru- 
sion machine produces aluminum tubing with a mean out- 
side diameter of 2.500 inches, with a standard deviation 
of 0.027 inches. As the machine functions over an 
extended number of work shifts, the standard deviation 
remains unchanged, but the combination of accumulated 
deposits and mechanical wear causes the mean diameter 
to "drift" away from the desired 2.500 inches. For a 
recent random sample of 34 tubes, the mean diameter was 
2.509 inches. At the 0.01 level of significance, does the 
machine appear to be in need of maintenance and calibra- 
tion? Determine and interpret the p-value for the test. 

10.32 A manufacturer of electronic kits has found that the 
mean time required for novices to assemble its new circuit 
tester is 3 hours, with a standard deviation of 0.20 hours. 
A consultant has developed a new instructional booklet 
intended to reduce the time an inexperienced kit builder 
will need to assemble the device. In a test of the effective- 
ness of the new booklet, 15 novices require a mean of 
2.90 hours to complete the job. Assuming the population 
of times is normally distributed, and using the 0.05 level 
of significance, should we conclude that the new booklet 

is effective? Determine and interpret the p-value for the 
test. 

/ data Set / Note: Exercises 10.33 and 10.34 require 
a computer and statistical software. 

10.33 According to Remodeling magazine, the average 
cost to convert an existing room into a home office with 
custom cabinetry and rewiring for electronic equipment is 
$5976. Assuming a population standard deviation of 
$1000 and the sample of home office conversion prices 
charged for 40 recent jobs performed by builders in a 
region of the United States, examine whether the mean 
price for home office conversions for builders in this 
region might be different from the average for the nation 
as a whole. The underlying data are in file XR10033. 
Identify and interpret the p-value for the test. Using the 
0.025 level of significance, what conclusion will be 
reached? source: National Association of Homebuilders, 1998 Housing 
Facts, Figures, and Trends, p. 38. 

10.34 A machine that fills shipping containers with drive- 
way filler mix is set to deliver a mean fill weight of 70.0 
pounds. The standard deviation of fill weights delivered 
by the machine is known to be 1.0 pounds. For a recent 
sample of 35 containers, the fill weights are listed in data 
file XR10034. Using the mean for this sample, and assum- 
ing that the population standard deviation has remained 
unchanged at 1.0 pounds, examine whether the mean fill 
weight delivered by the machine might now be something 
other than 70.0 pounds. Identify and interpret the p-value 
for the test. Using the 0.05 level of significance, what 
conclusion will be reached? 
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CONFIDENCE INTERVALS AND HYPOTHESIS TESTING 1 0.4 



In Chapter 9, we constructed confidence intervals for a population mean or pro- 
portion. In this chapter, we sometimes carry out nondirectional tests for the null 
hypothesis that the population mean or proportion could have a given value. 
Although the purposes may differ, the concepts are related. 

In the previous section, we briefly mentioned this relationship in the context of 
the nondirectional test summarized in Figure 10.3. Consider this nondirectional 
test, carried out at the a = 0.05 level: 

1. Null and alternative hypotheses: H Q : /i, = 1.3250 minutes and H^. /jl ¥= 1.3250 
minutes. 

2. The standard error of the mean: a- = a/Vn = 0.0396/V80, or 0.00443 
minutes. 

3. The critical z values for a two-tail test at the a = 0.05 level are z = —1.96 
and z = +1.96. 

4. Expressing these z values in terms of the sample mean, critical values for x 
would be calculated as 1.325 ± 1.96(0.00443), or 1.3163 minutes and 1.3337 
minutes. 

5. The observed sample mean was x = 1.3229 minutes. This fell within the 
acceptable limits and we were not able to reject H Q . 

Based on the a = 0.05 level, the nondirectional hypothesis test led us to con- 
clude that H Q : /x = 1.3250 minutes was believable. The observed sample mean 
(1.3229 minutes) was close enough to the 1.3250 hypothesized value that the dif- 
ference could have happened by chance. 

Now let's approach the same situation by using a 95% confidence interval. As 
noted previously, the standard error of the sample mean is 0.00443 minutes. Based on 
the sample results, the 95% confidence interval for /jl is 1.3229 ± 1.96(0.00443), or 
from 1.3142 minutes to 1.3316 minutes. In other words, we have 95% confidence 
that the population mean is somewhere between 1.3142 minutes and 1.3316 minutes. 
If someone were to suggest that the population mean were actually 1.3250 minutes, 
we would find this believable, since 1.3250 falls within the likely values for /jl that 
our confidence interval represents. 

The nondirectional hypothesis test was done at the a = 0.05 level, the confi- 
dence interval was for the 95% confidence level, and the conclusion was the same 
in each case. As a general rule, we can state that the conclusion from a nondirec- 
tional hypothesis test for a population mean at the a level of significance will be 
the same as the conclusion based on a confidence interval at the 100(1 — a)% 
confidence level. 

When a hypothesis test is nondirectional, this equivalence will be true. This 
exact statement cannot be made about confidence intervals and directional tests — 
although they can also be shown to be related, such a demonstration would take 
us beyond the purposes of this chapter. Suffice it to say that confidence intervals 
and hypothesis tests are both concerned with using sample information to make a 
statement about the (unknown) value of a population mean or proportion. Thus, 
it is not surprising that their results are related. 

By using Seeing Statistics Applet 12, at the end of the chapter, you can see how 
the confidence interval (and the hypothesis test conclusion) would change in 
response to various possible values for the sample mean. 
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10.35 Based on sample data, a confidence interval has 
been constructed such that we have 90% confidence that 
the population mean is between 120 and 180. Given this 
information, provide the conclusion that would be 
reached for each of the following hypothesis tests at 

the a = 0.10 level: 

a. H Q : /jl = 170 versus H^: /jl # 170 

b. H Q : /jl = 110 versus H^: /jl + 110 

c. Hq. /jl = 130 versus H^. (jl # 130 

d. H 0 : n = 200 versus H t : + 200 

10.36 Given the information in Exercise 10.27, construct 
a 95% confidence interval for the population mean, then 
reach a conclusion regarding whether /x could actually 



be equal to the value that has been hypothesized. How 
does this conclusion compare to that reached in 
Exercise 10.27? Why? 

10.37 Given the information in Exercise 10.29, construct 
a 99% confidence interval for the population mean, then 
reach a conclusion regarding whether (jl could actually 
be equal to the value that has been hypothesized. How 
does this conclusion compare to that reached in 
Exercise 10.29? Why? 

10.38 Use an appropriate confidence interval in reaching 
a conclusion regarding the problem situation and null 
hypothesis for Exercise 10.31. 



1 0.5 TESTING A MEAN, POPULATION STANDARD 

DEVIATION UNKNOWN 



The true standard deviation of a population will usually be unknown. As Fig- 
ure 10.2 shows, the t-test is appropriate for hypothesis tests in which the sample 
standard deviation (s) is used in estimating the value of the population standard 
deviation, o\ The i-test is based on the t distribution (with number of degrees of 
freedom, df = n — 1 ) and the assumption that the population is approximately 
normally distributed. As the sample size becomes larger, the assumption of popu- 
lation normality becomes less important. 

As we observed in Chapter 9, the t distribution is a family of distributions 
(one for each number of degrees of freedom, df). When dfis small, the t distribu- 
tion is flatter and more spread out than the normal distribution, but for larger 
degrees of freedom, successive members of the family more closely approach the 
normal distribution. As the number of degrees of freedom approaches infinity, the 
two distributions become identical. 

Like the z-test, the Mest depends on the sampling distribution for the sample 
mean. The appropriate test statistic is similar in appearance, but includes s instead 
of a, because s is being used to estimate the (unknown) value of a. The test statis- 
tic can be calculated as follows: 

Test statistic, £-test for a sample mean: 

where s- = estimated standard error for the 
sample mean, = slVn 
x = sample mean 
/i- 0 = hypothesized population mean 
n = sample size 
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Two-Tail Testing of a Mean, o- Unknown 



Example 




Two-Tail Test 

The credit manager of a large department store claims that the mean balance for 
the store's charge account customers is $410. An independent auditor selects a 
random sample of 18 accounts and finds a mean balance of x = $511.33 and 
a standard deviation of s = $183.75. The sample data are in file CX10CRED. If 
the manager's claim is not supported by these data, the auditor intends to exam- 
ine all charge account balances. If the population of account balances is 
assumed to be approximately normally distributed, what action should the 
auditor take? 



SOLUTION 

Formulate the Null and Alternative Hypotheses 

H Q : /jl = $410 The mean balance is actually $410. 
iL: jju # $410 The mean balance is some other value. 

In evaluating the manager's claim, a two-tail test is appropriate since it is a nondi- 
rectional statement that could be rejected by an extreme result in either direction. 
The center of the hypothesized distribution of sample means for samples of 
n = 18 will be /i 0 = $410. 

Select the Significance Level 

For this test, we will use the 0.05 level of significance. The sum of the two tail 
areas will be 0.05. 

Select the Test Statistic and Calculate Its Value 

The test statistic is t = (x — /jl 0 )/s-, and the t distribution will be used to describe 
the sampling distribution of the mean for samples of n = 18. The center of the 
distribution is /jl 0 = $410, which corresponds to t = 0.000. Since the population 
standard deviation is unknown, s is used to estimate a. The sampling distribution 
has an estimated standard error of 

$183 - 75 $43.31 



'n V18 
and the calculated value of t will be 



_ x - fjL 0 _ $511.33 - $410.00 _ 
f ~ s- ~ $43.31 



Identify Critical Values for the Test Statistic and State the Decision Rule 

For this test, a = 0.05, and the number of degrees of freedom will be 
df = (n — 1), or (18 — 1) = 17. The t distribution table at the back of the book 
provides one-tail areas, so we must identify the boundaries where each tail area is 
one-half of a, or 0.025. Referring to the 0.025 column and 17th row of the table, 
the critical values for the test statistic are found to be t = —2.110 and 
t = +2.110. (Although the "—2.110" is not shown in the table, we can identify 
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this as the left-tail boundary because the distribution is symmetrical.) The rejec- 
tion and nonrejection areas are shown in Figure 10.6, and the decision rule can be 
stated as "Reject H Q if the calculated t is either < —2.110 or > +2.110, other- 
wise do not reject." 

Compare the Calculated and Critical Values and Reach a Conclusion 
for the Null Hypothesis 

The calculated test statistic, t = 2.340, exceeds the upper boundary and falls into 
this rejection region. H Q is rejected. 

Make the Related Business Decision 

The results suggest that the mean charge account balance is some value other than 
$410. The auditor should proceed to examine all charge account balances. 



One-Tail Testing of a Mean, <r Unknown 



example 



One-Tail Test 

The Chekzar Rubber Company, in financial difficulties because of a poor rep- 
utation for product quality, has come out with an ad campaign claiming that the 
mean lifetime for Chekzar tires is at least 60,000 miles in highway driving. Skep- 
tical, the editors of a consumer magazine purchase 36 of the tires and test 
them in highway use. The mean tire life in the sample is x = 58,341.69 miles, 
with a sample standard deviation of s = 3632.53 miles. The sample data are in 
file CX10CHEK. 
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SOLUTION 

Formulate the Null and Alternative Hypotheses 

Because of the directional nature of the ad claim and the editors' skepticism regard- 
ing its truthfulness, the null and alternative hypotheses are 

H Q : /jl > 60,000 miles The mean tire life is at least 60,000 miles. 
H^. jju < 60,000 miles The mean tire life is under 60,000 miles. 

Select the Significance Level 

For this test, the significance level will be specified as 0.01. 
Select the Test Statistic and Calculate Its Value 

The test statistic is t = (x — /i 0 )/s-, and the t distribution will be used to describe 
the sampling distribution of the mean for samples of n = 36. The center of the 
distribution is the lowest possible value for which H Q could be true, or 
/jl q = 60,000 miles. Since the population standard deviation is unknown, s is used 
to estimate o\ The sampling distribution has an estimated standard error of 

s 3632.53 miles .„ ., 

— = 605.42 miles 



-2.739 



f n V36 

and the calculated value of t will be 

_ x - fi 0 _ 58,341.69 - 60,000.00 
*~ s- ~ 605.42 



Identify the Critical Value for the Test Statistic and State the Decision Rule 

For this test, a has been specified as 0.01. The number of degrees of freedom is 
df = (n — 1), or (36 — 1) = 35. The t distribution table is now used in finding 
the value of t that corresponds to a one-tail area of 0.01 and df = 35 degrees 
of freedom. Referring to the 0.01 column and 35th row of the table, this critical 
value is found to be t = —2.438. (Although the value listed is positive, remem- 
ber that the distribution is symmetrical, and we are looking for the left-tail 
boundary.) The rejection and nonrejection regions are shown in Figure 10.7, and 



FIGURE 10.7 




The Chekzar Rubber 
Company has claimed that, 
in highway use, the mean 
lifetime of its tires is at least 
60,000 miles. At the 0.01 
level in this left-tail test, the 
claim is not supported. 



1 



,1*0= 60,000 miles 

t = -2.438 



Test statistic: 
t = -2.739 
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the decision rule can be stated as "Reject H Q if the calculated t is less than 
—2.438, otherwise do not reject." 

Compare the Calculated and Critical Values and Reach a Conclusion 
for the Null Hypothesis 

The calculated test statistic, t = —2.739, is less than the critical value, t = —2.438, 
and falls into the rejection region of the test. The null hypothesis, H Q : /jl > 60,000 
miles, must be rejected. 

Make the Related Business Decision 

The test results support the editors' doubts regarding Chekzar's ad claim. The 
magazine may wish to exert either readership or legal pressure on Chekzar to 
modify its claim. 



Compared to the i-test, the z-test is a little easier to apply if the analysis is carried 
out by pocket calculator and references to a statistical table. (There are lesser 
"gaps" between areas listed in the normal distribution table compared to values 
provided in the t table.) Also, courtesy of the central limit theorem, results can be 
fairly satisfactory when n is large and s is a close estimate of cr. 

Nevertheless, the i-test remains the appropriate procedure whenever cr is 
unknown and is being estimated by s. In addition, this is the method you will 
either use or come into contact with when dealing with computer statistical pack- 
ages handling the kinds of analyses in this section. For example, with Excel, 
Minitab, SYSTAT, SPSS, SAS, and others, we can routinely (and correctly) apply 
the £-test whenever s has been used to estimate cr. 

An important note when using statistical tables to determine p-values: For 
i-tests, the p-value can't be determined as exactly as with the z-test, because the 
t table areas include greater "gaps" (e.g., the 0.005, 0.01, 0.025 columns, and so 
on). However, we can narrow down the £-test p-value to a range, such as 
"between 0.01 and 0.025." 

For example, in the Chekzar Rubber Company Mest of Figure 10.7, the cal- 
culated t statistic was t = —2.739. We were able to reject the null hypothesis at 
the 0.01 level (critical value, t = —2.438), and would also have been able to reject 
Hq at the 0.005 level (critical value, t = —2.724). Based on the t table, the most 
accurate conclusion we can reach is that the p-value for the Chekzar test is less 
than 0.005. Had we used the computer in performing this test, we would have 
found the actual p-value to be 0.0048. 

Computer Solutions 10.2 shows how we can use Excel or Minitab to carry out 
a hypothesis test for the mean when the population standard deviation is unknown. 
In this case, we are replicating the hypothesis test shown in Figure 10.6, using the 
18 data values in file CX10CRED. The printouts in Computer Solutions 10.2 show 
the p-value (0.032) for the test. This p-value represents the following statement: 
"If the population mean really is $410, there is only a 0.032 probability of getting 
a sample mean this far away from $410 just by chance." Because the p-value is less 
than the level of significance we are using to reach a conclusion (i.e., p-value = 
0.032 is < a = 0.05), H Q : /jl = $410 is rejected. 

In the Minitab portion of Computer Solutions 10.2, the 95% confidence inter- 
val is shown as $420.0 to $602.7. The hypothesized population mean ($410) does 
not fall within the 95% confidence interval; thus, at this confidence level, the 
results suggest that the population mean is some value other than $410. This same 
conclusion was reached in our two-tail test at the 0.05 level of significance. 
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Hypothesis Test for Population Mean, o- Unknown 

These procedures show how to carry out a hypothesis test for the population mean when the population 
standard deviation is unknown. 



EXCEL 





A 


B 


C 


D 


1 


t-Test: Mean 






2 










3 








balance 


4 


Mean 






511.33 


5 


Standard Deviation 




183.750 


6 


Hypothesized Mean 




410 


7 


df 






17 


8 


t Stat 






2.3396 


9 


P(T<=t) one-tail 




0.0159 


10 


t Critical one-tail 




1 .7396 


11 


P(T<=t) two-tail 




0.0318 


12 


t Critical two-tail 




2.1098 



Excel hypothesis test for fi based on raw data and tr unknown 

1. For example, for the credit balances (file CX10CRED.XLS) on which Figure 10.6 is based, with the label and 18 data 
values in A1:A19: Click Tools. Click Data Analysis Plus. Click t-Test: Mean. Click OK. 

2. Enter A1:A19 into the Input Range box. Enter the hypothesized mean (410) into the Hypothesized Mean box. Click 
Labels. Enter the level of significance for the test (0.05) into the Alpha box. Click OK. The printout shows the p-value 
for this two-tail test, 0.0318. 

Excel hypothesis test for fi based on summary statistics and tr unknown 

1. For example, with x = 51 1 .33, s = 183.75, and n = 18, as in Figure 10.6: Open the TEST STATISTICS.XLS workbook, 
supplied with the text. 

2. Using the arrows at the bottom left, select the t-Test_Mean worksheet. Enter the sample mean (511.33), the sample 
standard deviation (183.75), the sample size (18), the hypothesized population mean (410), and the level of signifi- 
cance for the test (0.05). 

(Note: As an alternative, you can use Excel worksheet template TMTTEST.XLS, supplied with the text. The steps are 
described within the template.) 

MINITAB 

Minitab hypothesis test for fi based on raw data and <r unknown 
One-Sample T: balance 

Test of mu = 410 vs not = 410 

Variable N Mean StDev SE Mean 95% CI TP 

balance 18 511.3 183.8 43.3 (420.0, 602.7) 2.34 0.032 

1 . For example, using the data (file CX10CRED.MTW) on which Figure 10.6 is based, with the 18 data values in column C1: 
Click Stat. Select Basic Statistics. Click 1-Sample t. 

2. Select Samples in Columns and enter C1 into the box. Select Perform hypothesis test and enter the hypothesized 
population mean (410) into the Hypothesized mean: box. 



(continued) 
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3. Click Options. Enter the desired confidence level as a percentage (95.0) into the Confidence Level box. Within the 
Alternative box, select not equal. Click OK. Click OK. 

Minitab hypothesis test for ft based on summary statistics and <t unknown 

Follow the procedure in steps 1 through 3, above, but in step 1 select Summarized data and enter 18, 511.33, and 
183.75 into the Sample size. Mean, and Standard deviation boxes, respectively. 



exercises 




p i 


















10.39 Under what circumstances should the f-statistic be 
used in carrying out a hypothesis test for the population 
mean? 

10.40 For a simple random sample of 40 items, x = 25.9 
and s = 4.2. At the 0.01 level of significance, test 

H Q : /x = 24.0 versus H t : ^ + 24.0. 

10.41 For a simple random sample of 15 items from a 
population that is approximately normally distributed, 
x = 82.0 and s = 20.5. At the 0.05 level of significance, 
test H Q : fji > 90.0 versus Hy fjt < 90.0. 

10.42 The average age of passenger cars in use in the 
United States is 9.0 years. For a simple random sample 
of 34 vehicles observed in the employee parking area of a 
large manufacturing plant, the average age is 10.3 years, 
with a standard deviation of 3.1 years. At the 0.01 level of 
significance, can we conclude that the average age of cars 
driven to work by the plant's employees is greater than the 
national average? source: www.polk.com, August 9, 2006. 

10.43 The average length of a flight by regional airlines 
in the United States has been reported as 299 miles. If a 
simple random sample of 30 flights by regional airlines 
were to have x = 314.6 miles and s = 42.8 miles, would 
this tend to cast doubt on the reported average of 

299 miles? Use a two-tail test and the 0.05 level of 
significance in arriving at your answer, source: Bureau of 
the Census, Statistical Abstract of the United States 2002, p. 665. 

10.44 The International Coffee Association has reported 
the mean daily coffee consumption for U.S. residents as 
1.65 cups. Assume that a sample of 38 people from a 
North Carolina city consumed a mean of 1.84 cups of 
coffee per day, with a standard deviation of 0.85 cups. In a 
two-tail test at the 0.05 level, could the residents of this 
city be said to be significantly different from their counter- 
parts across the nation? source: www.coffeeresearch.org, 
August 8, 2006. 

10.45 Taxco, a firm specializing in the preparation of 
income tax returns, claims the mean refund for customers 
who received refunds last year was $150. For a random 
sample of 12 customers who received refunds last year, the 



mean amount was found to be $125, with a standard 
deviation of $43. Assuming that the population is approx- 
imately normally distributed, and using the 0.10 level in a 
two-tail test, do these results suggest that Taxco's 
assertion may be accurate? 

10.46 The new director of a local YMCA has been 
told by his predecessors that the average member has 
belonged for 8.7 years. Examining a random sample of 
15 membership files, he finds the mean length of 
membership to be 7.2 years, with a standard deviation 
of 2.5 years. Assuming the population is approximately 
normally distributed, and using the 0.05 level, does this 
result suggest that the actual mean length of membership 
may be some value other than 8.7 years? 

10.47 A scrap metal dealer claims that the mean of his 
cash sales is "no more than $80," but an Internal 
Revenue Service agent believes the dealer is untruthful. 
Observing a sample of 20 cash customers, the agent finds 
the mean purchase to be $91, with a standard deviation 
of $21. Assuming the population is approximately 
normally distributed, and using the 0.05 level of 
significance, is the agent's suspicion confirmed? 

10.48 During 2002, college work-study students earned a 
mean of $1252. Assume that a sample consisting of 45 of 
the work-study students at a large university was found 
to have earned a mean of $1277 during that year, with a 
standard deviation of $210. Would a one-tail test at the 
0.05 level suggest the average earnings of this university's 
work-study students were significantly higher than the 
national mean? source: Bureau of the Census, Statistical Abstract of 
the United States 2002, p. 172. 

10.49 According to the New York Stock Exchange, the 
mean portfolio value for U.S. senior citizens who are 
shareholders is $183,000. Suppose a simple random sam- 
ple of 50 senior citizen shareholders in a certain region of 
the United States is found to have a mean portfolio value 
of $198,700, with a standard deviation of $65,000. From 
these sample results, and using the 0.05 level of 
significance in a two-tail test, comment on whether the 
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mean portfolio value for all senior citizen shareholders in 
this region might not be the same as the mean value 
reported for their counterparts across the nation, source: 
New York Stock Exchange, Fact Book 1998, p. 57. 

10.50 Using the sample results in Exercise 10.49, 
construct and interpret the 95% confidence interval for 
the population mean. Is the hypothesized population 
mean ($183,000) within the interval? Given the presence 
or absence of the $183,000 value within the interval, is 
this consistent with the findings of the hypothesis test 
conducted in Exercise 10.49? 

10.51 It has been reported that the average life for 
halogen lightbulbs is 4000 hours. Learning of this figure, 
a plant manager would like to find out whether the 
vibration and temperature conditions that the facility's 
bulbs encounter might be having an adverse effect on 
the service life of bulbs in her plant. In a test involving 
15 halogen bulbs installed in various locations around 
the plant, she finds the average life for bulbs in the 
sample is 3882 hours, with a standard deviation of 

200 hours. Assuming the population of halogen bulb 
lifetimes to be approximately normally distributed, and 
using the 0.025 level of significance, do the test results 
tend to support the manager's suspicion that adverse con- 
ditions might be detrimental to the operating lifespan of 
halogen lightbulbs used in her plant? source: Cindy Hall and 
Gary Visgaitis, "Bulbs Lasting Longer," USA Today, March 9, 2000, p. 1D. 

10.52 In response to an inquiry from its national office, 
the manager of a local bank has stated that her bank's 
average service time for a drive-through customer is 

93 seconds. A student intern working at the bank 
happens to be taking a statistics course and is curious as 
to whether the true average might be some value other 
than 93 seconds. The intern observes a simple random 
sample of 50 drive-through customers whose average 
service time is 89.5 seconds, with a standard deviation 
of 11.3 seconds. From these sample results, and using 
the 0.05 level of significance, what conclusion would the 
student reach with regard to the bank manager's claim? 

10.53 Using the sample results in Exercise 10.52, 
construct and interpret the 95% confidence interval for 
the population mean. Is the hypothesized population 
mean (93 seconds) within the interval? Given the 
presence or absence of the 93 minutes value within the 
interval, is this consistent with the findings of the hypoth- 
esis test conducted in Exercise 10.52? 

10.54 The U.S. Census Bureau says the 52-question 
"long form" received by 1 in 6 households during the 
2000 census takes a mean of 38 minutes to complete. 
Suppose a simple random sample of 35 persons is 
given the form, and their mean time to complete it is 
36.8 minutes, with a standard deviation of 4.0 minutes. 
From these sample results, and using the 0.10 level of 



significance, would it seem that the actual population 
mean time for completion might be some value other 
than 38 minutes? source: Haya El Nasser, "Census Forms Can Be 
Filed by Computer," USA Today, February 10, 2000, p. 4A. 

10.55 Using the sample results in Exercise 10.54, 
construct and interpret the 90% confidence interval for 
the population mean. Is the hypothesized population 
mean (38 minutes) within the interval? Given the 
presence or absence of the 38 minutes value within the 
interval, is this consistent with the findings of the 
hypothesis test conducted in Exercise 10.54? 

/ data set/ Note: Exercises 10.56-10.58 require a 
computer and statistical software. 

10.56 The International Council of Shopping Centers 
reports that the average teenager spends $39 during a 
shopping trip to the mall. The promotions director of a 
local mall has used a variety of strategies to attract 
area teens to his mall, including live bands and "teen- 
appreciation days" that feature special bargains for this 
age group. He believes teen shoppers at his mall respond 
to his promotional efforts by shopping there more often 
and spending more when they do. Mall management 
decides to evaluate the promotions director's success 

by surveying a simple random sample of 45 local teens 
and finding out how much they spent on their most 
recent shopping visit to the mall. The results are listed in 
data file XR10056. Use a suitable hypothesis test in exam- 
ining whether the mean mall shopping expenditure for 
teens in this area might be higher than for U.S. teens as a 
whole. Identify and interpret the p-value for the test. 
Using the 0.025 level of significance, what conclusion do 
you reach? source: Anne R. Carey and Suzan Deo, "Mall Denizens 
Compared," USA Today 1 999 Snapshot Calendar, September 18-19. 

10.57 According to the Insurance Information Institute, 
the mean annual expenditure for automobile insurance for 
U.S. motorists is $706. Suppose that a government official 
in North Carolina has surveyed a simple random sample 
of 80 residents of her state, and that their auto insurance 
expenditures for the most recent year are in data file 
XR10057. Based on these data, examine whether the mean 
annual auto insurance expenditure for motorists in North 
Carolina might be different from the $706 for the country 
as a whole. Identify and interpret the p-value for the test. 
Using the 0.05 level of significance, what conclusion do 
you reach? source: Insurance Information Institute, Insurance Fact 
Book 2000, p. 2.6. 

10.58 Using the sample data in Exercise 10.57, construct 
and interpret the 95% confidence interval for the popula- 
tion mean. Is the hypothesized population mean ($706) 
within the interval? Given the presence or absence of 

the $706 value within the interval, is this consistent 
with the findings of the hypothesis test conducted in 
Exercise 10.57? 
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10.6 



TESTING A PROPORTION 



Occasions may arise when we wish to compare a sample proportion, p, with a 
value that has been hypothesized for the population proportion, it. As we noted 
in Figure 10.2, the theoretically correct distribution for dealing with proportions 
is the binomial distribution. However, the normal distribution is a good approxi- 
mation when 7? 77 > 5 and «(1 — 77) > 5. The larger the sample size, the better this 
approximation becomes, and for most practical settings, this condition is satis- 
fied. When using the normal distribution for hypothesis tests of a sample propor- 
tion, the test statistic is as follows: 



Test statistic, z-test for a sample proportion 

p ~ 77, 



where p = sample proportion 

77 0 = hypothesized population proportion 
n = sample size 
(Tp = standard error of the distribution of the 
sample proportion 



^(i ~ go) 



Two-Tail Testing of a Proportion 

Whenever the null hypothesis involves a proportion and is nondirectional, this 
technique is appropriate. To demonstrate how it works, consider the following 
situation. 



example 



Two-Tail Test 

The career services director of Hobart University has said that 70% of the school's 
seniors enter the job market in a position directly related to their undergraduate 
field of study. In a sample consisting of 200 of the graduates from last year's class, 
66% have entered jobs related to their field of study. The underlying data are in file 
CX10GRAD, with values coded as 1 = no job in field, 2 = job in field. 

SOLUTION 

Formulate the Null and Alternative Hypotheses 

The director's statement is nondirectional and leads to null and alternative hy- 
potheses of 

H 0 : 77 = 0.70 The proportion of graduates entering jobs in their field is 0.70. 
H^. 77 # 0.70 The proportion is some value other than 0.70. 

Select the Significance Level 

For this test, the 0.05 level will be used. The sum of the two tail areas will be 0.05. 




Chapter 10: Hypothesis Tests Involving a Sample Mean or Proportion 



337 





that 70% of a university's 
seniors enter jobs related 
to their field of study. 



a sample proportion, the 
sample result leads to 



nonrejection of the career 
services director's claim 



In this two-tail test involving 



77n = 0.70 



z = -1.96 



z = +1.96 



Test statistic: 
z = -1.23 



Select the Test Statistic and Calculate Its Value 

The test statistic will be z, the number of standard error units from the hypothe- 
sized population proportion, tt q = 0.70, to the sample proportion, p = 0.66. The 
standard error of the sample proportion is 



Identify Critical Values for the Test Statistic and State the Decision Rule 

Since the test is two-tail and the selected level of significance is 0.05, the critical 
values will be z = —1.96 and z = +1.96. The decision rule can be stated as "Reject 
H Q if calculated z is either <— 1.96 or > +1.96, otherwise do not reject." 

Compare the Calculated and Critical Values and Reach a Conclusion for the 
Null Hypothesis 

The calculated value of the test statistic, z = —1.23, falls between the two critical 
values, placing it in the nonrejection region of the distribution shown in Figure 10.8. 
The null hypothesis is not rejected. 

Make the Related Decision 

Failure to reject the null hypothesis leads us to conclude that the proportion of 
graduates who enter the job market in careers related to their field of study could 
indeed be equal to the claimed value of 0.70. If the career services director has 
been making this claim to her students or their parents, this analysis would sug- 
gest that her assertion not be challenged. 




and the calculated value of z will be 



z = 



p - 77-q _ 0.66 - 0.70 
ov ~ 0.0324 



= -1.23 
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One-Tail Testing of a Proportion 

Directional tests for a proportion are similar to the preceding example, but have 
only one tail area in which the null hypothesis can be rejected. Consider the fol- 
lowing actual case. 



Example 



One-Tail Test 

In an administrative decision, the U.S. Veterans Administration (VA) closed the 
cardiac surgery units of several VA hospitals that either performed fewer than 150 
operations per year or had mortality rates higher than 5.0%. a In one of the closed 
surgery units, 100 operations had been performed during the preceding year, with a 
mortality rate of 7.0%. The underlying data are in file CX10HOSP, with values 
coded as 1 = nonfatality, 2 = fatality. At the 0.01 level of significance, was the mor- 
tality rate of this hospital significantly greater than the 5.0% cutoff point? Consider 
the hospital's performance as representing a sample from the population of possible 
operations it might have performed if the patients had been available. 

SOLUTION 

Formulate the Null and Alternative Hypotheses 

The null hypothesis makes the assumption that the "population" mortality rate 
for the hospital cardiac surgery unit is really no greater than 0.05, and that the 
observed proportion, p = 0.07, was simply due to chance variation. 



H 0 : 



it < 0.05 The true mortality rate for the unit is no 

more than 0.05. 
it > 0.05 The true mortality rate is greater than 0.05. 



The center of the hypothesized distribution, ir 0 = 0.05, is the highest possible 
value for which the null hypothesis could be true. 

Select the Significance Level 

The significance level has been specified as a = 0.01. If the null hypothesis 
were really true, there would be no more than a 0.01 probability of incorrectly 
rejecting it. 

Select the Test Statistic and Calculate Its Value 

The test statistic will be z, calculated as z = (p — 7T 0 )/cr The standard error of 
the sample proportion and the calculated value of the test statistic are 



7r 0 (l-7r 0 ) , / 0.05(1 - 0.05 



100 



0.02179 



anc 



p - 77-p _ 0.07 - 0.05 
ov " 0.02179 



0.92 



Source: "VA Halts Some Heart Operations," Indiana Gazette, January 13, 1987. 
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H 0 : 77 < 0.05 

Hy TT > 0.05 



Do not reject H 0 - 



-0.05 



Reject H Q —>- 
- Area = 0.01 



z = +2.33 



Test statistic: 
Z = 0.92 



FIGURE 10.9 



The U.S. Veterans 
Administration closed a 
number of cardiac surgery 
units because they either 
performed fewer than 
1 50 operations or had a 
mortality rate over 5.0% 
during the previous year. 
For one of the hospitals, 
there was a mortality rate 
of 7.0% in 100 operations, 
but this relatively high 
mortality rate could have 
been due to chance 
variation. 



Identify the Critical Value for the Test Statistic and State the Decision Rule 

For the 0.01 level, the critical value of z is z = +2.33. The decision rule can be 
stated as "Reject H 0 if calculated z > +2.33, otherwise do not reject." 

Compare the Calculated and Critical Values and Reach a Conclusion 
for the Null Hypothesis 

Since the calculated value, z = 0.92, is less than the critical value, it falls into the 
nonrejection region of Figure 10.9 and the null hypothesis, H Q : tt < 0.05, cannot 
be rejected. 

Make the Related Business Decision 

The cardiac surgery mortality rate for this hospital could have been as high as 
0.07 merely by chance, and closing it could not be justified strictly on the basis of 
a "significantly greater than 0.05" guideline. [Notes: (1) The VA may have been 
striving for some lower population proportion not mentioned in the article, and 
(2) because the cardiac unit did not meet the minimum requirement of 150 oper- 
ations per year, the VA would have closed it anyway.] 



Computer Solutions 10.3 shows how we can use Excel or Minitab to carry 
out a hypothesis test for a proportion. In this case, we are replicating the hypoth- 
esis test shown in Figure 10.8, using summary information. The printouts in 
Computer Solutions 10.3 show the p-value (0.217) for the test. This p-value rep- 
resents the following statement: "If the population proportion really is 0.70, there 
is a 0.217 probability of getting a sample proportion this far away from 0.70 just 
by chance." The p-value is not less than the level of significance we are using 
to reach a conclusion (i.e., p-value = 0.217 is not less than a = 0.05), and 
H Q : tt = 0.70 cannot be rejected. 

In the Minitab portion of Computer Solutions 10.3, the 95% confidence 
interval is shown as (0.594349 to 0.725651). The hypothesized population pro- 
portion (0.70) falls within the 95% confidence interval, and, at this confidence 
level, it appears that the population proportion could be 0.70. This relationship 
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computer solutions 10.3 




I 





Hypothesis Test for a Population Proportion 

These procedures show how to carry out a hypothesis test for the population proportion. 

EXCEL 





A 


B 


C 


D 


1 


z-Test of a Proportion 








2 










3 


Sample proportion 


0.66 


zStat 


-1.23 


4 


Sample size 


200 


P(Z<=z) one-tail 


0.1085 


5 


Hypothesized proportion 


0.70 


z Critical one-tail 


1 .6449 


6 


Alpha 


0.05 


P(Z<=z) two-tail 


0.2170 


7 






z Critical two-tail 


1.9600 



Excel hypothesis test for tt based on summary statistics 

1. For example, with n = 200 and p = 0.66, as in Figure 10.8: Open the TEST STATISTICS.XLS workbook, supplied with 
the text. 

2. Using the arrows at the bottom left, select the z-Test_Proportion worksheet. Enter the sample proportion (0.66), the 
sample size (200), the hypothesized population proportion (0.70), and the level of significance for the test (0.05). The 
p-value for this two-tail test is shown as 0.2170. 

(Note: As an alternative, you can use Excel worksheet template TMPTEST.XLS, supplied with the text. The steps are 
described within the template.) 

Excel hypothesis test for tt based on raw data 

1. For example, using data file CX10GRAD.XLS, with the label and 200 data values in A1:A201 and data coded as 
1 = no job in field, 2 = job in field: Click Tools. Click Data Analysis Plus. Click Z-Test: Proportion. Click OK. 

2. Enter A1:A201 into the Input Range box. Enter 2 into the Code for Success box. Enter 0.70 into the Hypothesized 
Proportion box. Click Labels. Enter the level of significance for the test (0.05) into the Alpha box. Click OK. 

MINI TAB 

Minitab hypothesis test for tt based on summary statistics 
Test and CI for One Proportion 

Test of p = 0.7 vs p not =0.7 

Sample X N Sample p 95% CI Z-Value P-Value 

1 132 200 0.660000 (0.594349, 0.725651) -1.23 0.217 

Using the normal approximation. 

1 . This interval is based on the summary statistics for Figure 10.8: Click Stat. Select Basic Statistics. Click 1 Proportion. 
Select Summarized Data. Enter the sample size (200) into the Number of Trials box. Multiply the sample proportion 
(0.66) times the sample size (200) to get the number of "events" or "successes" (0.66)(200) = 132. (Had this not been 
an integer, it would have been necessary to round to the nearest integer.) Enter the number of "successes" (132) into 
the Number of events box. Select Perform hypothesis test and enter the hypothesized population proportion 
(0.70) into the Hypothesized proportion box. 
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2. Click Options. Enter the desired confidence level as a percentage (95.0) into the Confidence Level box. Within the 
Alternative box, select not equal. Click to select Use test and interval based on normal distribution. Click OK. 
Click OK. 

Minitab hypothesis test for tt based on raw data 

1. For example, using file CX10GRAD.MTW, with column C1 containing the 200 assumed data values (coded as 1 = no 
job in field, 2 = job in field): Click Stat. Select Basic Statistics. Click 1 Proportion. Select Samples in columns and 
enter C1 into the dialog box. Select Perform hypothesis test and enter the hypothesized proportion (0.70) into the 
Hypothesized Proportion box. 

2. Follow step 2 in the summary-information procedure, above. Note: Minitab will select the larger of the two codes (i.e., 
2 = job in field) as the "success" and provide the sample proportion and the confidence interval for the population pro- 
portion of graduates having jobs in their fields. To obtain the results for those not having jobs in their fields, just recode 
the data so graduates without jobs in their fields will have the higher code number: Click Data. Select Code. Click 
Numeric to Numeric. Enter C1 into both the Code data from columns box and the Into columns box. Enter 1 into 
the Original values box. Enter 3 into the New box. Click OK. The new codes will be 3 = no job in field, 2 = job in field. 



between confidence intervals and two-tail hypothesis tests was discussed in 
Section 10.4. 

Had we used the computer to perform the test summarized in Figure 10.9, we 
would have found the p-value for this one-tail test to be 0.179. The p-value for the 
test says, "If the population proportion really is 0.05, there is a 0.179 probability of 
getting a sample proportion this large (0.07) just by chance." The p-value (0.179) is 
not less than the level of significance we are using as a criterion (a = 0.01), so we 
are unable to reject H Q : tt < 0.05. 



exercises 



10.59 When carrying out a hypothesis test for a popula- 
tion proportion, under what conditions is it appropriate 
to use the normal distribution as an approximation to the 
(theoretically correct) binomial distribution? 

10.60 For a simple random sample, n = 200 and p = 0.34. 
At the 0.01 level, test H Q : tt = 0.40 versus H t : tt + 0.40. 

10.61 For a simple random sample, n = 1000 and 

p = 0.47. At the 0.05 level, test H Q : tt > 0.50 versus 
H t : tt < 0.50. 

10.62 For a simple random sample, n = 700 and 

p = 0.63. At the 0.025 level, test H Q : tt < 0.60 versus 
H t : tt > 0.60. 

10.63 A simple random sample of 300 items is selected 
from a large shipment, and testing reveals that 4% of the 
sampled items are defective. The supplier claims that no 
more than 2% of the items in the shipment are defective. 
Carry out an appropriate hypothesis test and comment 
on the credibility of the supplier's claim. 



10.64 The director of admissions at a large university says 
that 15% of high school juniors to whom she sends univer- 
sity literature eventually apply for admission. In a sample 
of 300 persons to whom materials were sent, 30 students 
applied for admission. In a two-tail test at the 0.05 level 

of significance, should we reject the director's claim? 

10.65 According to the human resources director of a 
plant, no more than 5% of employees hired in the past year 
have violated their preemployment agreement not to use 
any of five illegal drugs. The agreement specified that ran- 
dom urine checks could be carried out to ascertain compli- 
ance. In a random sample of 400 employees, screening 
detected at least one of these drugs in the systems of 8% of 
those tested. At the 0.025 level, is the human resources 
director's claim credible? Determine and interpret the 
p-value for the test. 

10.66 It has been claimed that 65% of homeowners 
would prefer to heat with electricity instead of gas. A 
study finds that 60% of 200 homeowners prefer electric 
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heating to gas. In a two-tail test at the 0.05 level of signif- 
icance, can we conclude that the percentage who prefer 
electric heating may differ from 65%? Determine and 
interpret the p-value for the test. 

10.67 In the past, 44% of those taking a public account- 
ing qualifying exam have passed the exam on their first 
try. Lately, the availability of exam preparation books 
and tutoring sessions may have improved the likelihood 
of an individual's passing on his or her first try. In a sam- 
ple of 250 recent applicants, 130 passed on their first 
attempt. At the 0.05 level of significance, can we 
conclude that the proportion passing on the first try 

has increased? Determine and interpret the p- value for 
the test. 

10.68 Opinion Research has said that 49% of U.S. adults 
have purchased life insurance. Suppose that for a random 
sample of 50 adults from a given U.S. city, a researcher 
finds that only 38% of them have purchased life 
insurance. At the 0.05 level in a one-tail test, is this sam- 
ple finding significantly lower than the 49% reported by 
Opinion Research? Determine and interpret the p-value 
for the test. source: Cindy Hall and Web Bryant, "Life Insurance 
Prospects," USA Today, November 11, 1996, p. 1B. 

10.69 According to the National Association of Home 
Builders, 62% of new single-family homes built during 
1996 had a fireplace. Suppose a nationwide homebuilder 
has claimed that its homes are "a cross section of 
America," but a simple random sample of 600 of its 
single-family homes built during that year included only 
57.5% that had a fireplace. Using the 0.05 level of signifi- 
cance in a two-tail test, examine whether the percentage 
of sample homes having a fireplace could have differed 
from 62% simply by chance. Determine and interpret the 
p-value for the test, source: National Association of Homebuilders, 
7998 Housing Facts, Figures, and Trends, p. 7. 

10.70 Based on the sample results in Exercise 10.69, con- 
struct and interpret the 95% confidence interval for the 
population proportion. Is the hypothesized proportion 
(0.62) within the interval? Given the presence or absence 
of the 0.62 value within the interval, is this consistent 
with the findings of the hypothesis test conducted in 
Exercise 10.69? 

10.71 According to the U.S. Bureau of Labor Statistics, 
9.0% of working women who are 16 to 24 years old 
are being paid minimum wage or less. (Note that some 
workers in some industries are exempt from the 
minimum wage requirement of the Fair Labor 
Standards Act and, thus, could be legally earning less 
than the "minimum" wage.) A prominent politician is 
interested in how young working women within her 
county compare to this national percentage, and selects 
a simple random sample of 500 working women who 
are 16 to 24 years old. Of the women in the sample, 
55 are being paid minimum wage or less. From these 



sample results, and using the 0.10 level of significance, 
could the politician conclude that the percentage of 
young working women who are low-paid in her county 
might be the same as the percentage of young women 
who are low-paid in the nation as a whole? Determine 
and interpret the p-value for the test, source: Encyclopaedia 
Britannica Almanac 2003, p. 887. 

10.72 Using the sample results in Exercise 10.71, 
construct and interpret the 90% confidence interval 

for the population proportion. Is the hypothesized popu- 
lation proportion (0.09) within the interval? Given the 
presence or absence of the 0.09 value within the interval, 
is this consistent with the findings of the hypothesis test 
conducted in Exercise 10.71? 

10.73 Brad Davenport, a consumer reporter for a national 
cable TV channel, is working on a story evaluating generic 
food products and comparing them to their brand-name 
counterparts. According to Brad, consumers claim to like 
the brand-name products better than the generics, but they 
can't even tell which is which. To test his theory, Brad gives 
each of 200 consumers two potato chips — one generic, 
the other a brand name — and asks them which one is 

the brand-name chip. Fifty-five percent of the subjects cor- 
rectly identify the brand-name chip. At the 0.025 level, 
is this significantly greater than the 50% that could be 
expected simply by chance? Determine and interpret the 
p-value for the test. 

10.74 It has been reported that 80% of taxpayers who 
are audited by the Internal Revenue Service end up 
paying more money in taxes. Assume that auditors are 
randomly assigned to cases, and that one of the ways 
the IRS oversees its auditors is to monitor the 
percentage of cases that result in the taxpayer paying 
more taxes. If a sample of 400 cases handled by an 
individual auditor has 77.0% of those she audited pay- 
ing more taxes, is there reason to believe her overall 
"pay more" percentage might be some value other than 
80%? Use the 0.10 level of significance in reaching a 
conclusion. Determine and interpret the p-value for the 
test. source: Sandra Block, "Audit Red Flags You Don't Want to 
Wave," USA Today, April 11, 2000, p. 3B. 

10.75 Based on the sample results in Exercise 10.74, 
construct and interpret the 90% confidence interval for 
the population proportion. Is the hypothesized proportion 
(0.80) within the interval? Given the presence or absence 
of the 0.80 value within the interval, is this consistent 
with the findings of the hypothesis test conducted in 
Exercise 10.74? 

/ data set/ Note: Exercises 10.76-10.78 require a 
computer and statistical software. 

10.76 According to the National Collegiate Athletic 
Association (NCAA), 41% of male basketball players 
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graduate within 6 years of enrolling in their college or 
university, compared to 56% for the student body as a 
whole. Assume that data file XR10076 shows the current 
status for a sample of 200 male basketball players who 
enrolled in New England colleges and universities 6 years 
ago. The data codes are 1 = left school, 2 = still in 
school, 3 = graduated. Using these data and the 0.10 
level of significance, does the graduation rate for male 
basketball players from schools in this region differ 
significantly from the 41% for male basketball players 
across the nation? Identify and interpret the p-value for 
the test. source: "NCAA Basketball 'Reforms' Come Up Short," 
USA Today, April 1 , 2000, p. 1 7 A. 

10.77 Using the sample results in Exercise 10.76, 
construct and interpret the 90% confidence interval 
for the population proportion. Is the hypothesized 
proportion (0.41) within the interval? Given the presence 
or absence of the 0.41 value within the interval, is this 



consistent with the findings of the hypothesis test 
conducted in Exercise 10.76? 

10.78 Website administrators sometimes use analysis 
tools or service providers to "track" the movements of 
visitors to the various portions of their site. Overall, the 
administrator of a political action website has found that 
35% of the visitors who visit the "Environmental Issues" 
page go on to visit the "Here's What You Can Do" page. 
In an effort to increase this rate, the administrator places 
a photograph of an oil-covered sea otter on the Issues 
page. Of the next 300 visitors to the Issues page, 40% 
also visit the Can Do page. The data are in file XR10078, 
coded as 1 = did not go on to visit Can Do page and 
2 = went on to visit Can Do page. At the 0.05 level, is 
the 40% rate significantly greater than the 35% that had 
been occurring in the past, or might this higher rate be 
simply due to chance variation? Identify and interpret the 
p-value for the test. 



THE POWER OF A HYPOTHESIS TEST 10.7 



Hypothesis Testing Errors and the Power of a Test 

As discussed previously in the chapter, incorrect conclusions can result from 
hypothesis testing. As a quick review, the mistakes are of two kinds: 

• Type I error, rejecting a true hypothesis: 

a = probability of rejecting H Q when H Q is true 

or 

a = P( reject H Q \H 0 true) 

a = the level of significance of a test 

• Type II error, failing to reject a false hypothesis: 

;8 = probability of failing to reject H Q when H 0 is false 

or 

f3 = P(fail to reject H Q \H Q false) 
1 — /3 = probability of rejecting H 0 when H Q is false 
1 — j8 = the power of a test 



In this section, our focus will be on (1 — (3), the power of a test. As mentioned 
previously, there is a trade-off between a and /3: For a given sample size, reducing 
a tends to increase /3, and vice versa; with larger sample sizes, however, both a 
and /3 can be decreased for a given test. 

In wishing people luck, we sometimes tell them, "Don't take any wooden 
nickels." As an analogy, the power of a hypothesis test is the probability that the 
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test will correctly reject the "wooden nickel" represented by a false null hypothe- 
sis. In other words, (1 — /3), the power of a test, is the probability that the test will 
respond correctly by rejecting a false null hypothesis. 

The Power of a Test: An Example 

As an example, consider the Extendabulb test, presented in Section 10.3 and illus- 
trated in Figure 10.4. The test can be summarized as follows: 

• Null and alternative hypotheses: 

H Q : /jl < 1030 hours Extendabulb is no better than the 

previous system. 

Hji /x > 1030 hours Extendabulb does increase bulb life. 

• Significance level selected: 0.05 

• Calculated value of test statistic: 

x - fi 0 1061.6 - 1030.0 

Z = - "_ u = — — = 2.22 

cr/Vn 90/V40 

• Critical value for test statistic: z = +1.645 

• Decision rule: Reject H 0 if calculated z > +1.645, otherwise do not reject. 

For purposes of determining the power of the test, we will first convert the 
critical value, z = +1.645, into the equivalent mean bulb life for a sample of this 
size. This will be 1.645 standard error units to the right of the mean of the 
hypothesized distribution (1030 hours). The standard error for the distribution of 
sample means is a- = a/\n = 90/V40, or 14.23 hours. The critical z value can 
now be converted into a critical sample mean: 

Sample mean, x 

corresponding to = 1030.00 + 1.645(14.23) = 1053.41 hours 
critical z = +1.645 

and the decision rule, "Reject H 0 if calculated test statistic is greater than z = +1.645" 
can be restated as "Reject H Q if sample mean is greater than 1053.41 hours." 

The power of a test to correctly reject a false hypothesis depends on the true 
value of the population mean, a quantity that we do not know. At this point, we 
will assume that the true mean has a value that would cause the null hypothesis 
to be false, then the decision rule of the test will be applied to see whether this 
"wooden nickel" is rejected, as it should be. 

As an arbitrary choice, the true mean life of Extendabulb-equipped bulbs will 
be assumed to be /jl = 1040 hours. The next step is to see how the decision rule, 
"Reject H 0 if the sample mean is greater than 1053.41 hours," is likely to react. In 
particular, interest is focused on the probability that the decision rule will correctly 
reject the false null hypothesis that the mean is no more than 1030 hours. 

As part (a) of Figure 10.10 shows, the distribution of sample means is centered 
on fjL — 1040 hours, the true value assumed for bulb life. The standard error of the 
distribution of sample means remains the same, so the spread of the sampling dis- 
tribution is unchanged compared to that in Figure 10.4. In part (a) of Figure 10.10, 
however, the entire distribution has been "shifted" 10 hours to the right. 

If the true mean is 1040 hours, the shaded portion of the curve in part (a) of 
Figure 10.10 represents the power of the hypothesis test — that is, the probability 
that it will correctly reject the false null hypothesis. Using the standard error of 
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FIGURE 10.10 



Decision rule 

Reject H 0 if x > 1053.41 hours 

(a) If actual mean is 1040 hours 

_ 1053.41 - 1040.00 
Z " 14.23 



P = 0.8264 = probability 
of not rejecting 
the false Hq 



H 0 : ft < 1030 hours H { : /ll > 1030 hours 
ov= 14.23 hours 



+0.94 



1053.41 




1000 

(b) If actual mean is 1060 hours 

_ 1053.41 - 1060.00 
z " 14.23 



1040 





Power of the test: 




1 -/3 = 0.1736, 




probability of 






rejecting the 




false H 0 


1 1 1 


1080 


1120 



1053.41 



-0.46 



P = 0.3228 = probability 
of not rejecting 
the false Hq 









1000 

(c) If actual mean is 1080 hours 

1053.41 - 1080.00 



P = 0.0307 = probability 
of not rejecting 
the false Hq 






Power of the test: 




1 - P = 0.6772, 




probability of 






rejecting the 




false H 0 



1053.41 



-1.87 



1120 



Power of the test: 
1 -/3 = 0.9693, 
probability of 
rejecting the 
false H 0 




The power of the test (1 — p) is the probability that the decision rule will correctly reject a false null hypothesis. For example, if the 
population mean were really 1 040 hours (part a), there would be a 0. 1 736 probability that the decision rule would correctly reject the 
null hypothesis that jj. £ 1 030. 
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the sample mean, a- = 14.23 hours, we can calculate the number of standard 
error units from 1040 to 1053.41 hours as 

x - fi 1053.41 - 1040.00 13.41 nnA 

Z = = = = 0.94 standard error units to the 

a- 14.23 14.23 • (t , , ,■ 

x right of the population mean 

From the normal distribution table, we find the area between z = 0.00 and 
Z = +0.94 to be 0.3264. Knowing that the total area to the right of z = 0.00 is 
0.5000, we can calculate the shaded area as 0.5000 - 0.3264, or 0.1736. Thus, 
if the true mean life of Extendabulb-equipped bulbs is 1040 hours, there is a 
0.1736 probability that a sample of 40 bulbs will have a mean in the "reject H 0 " 
region of our test and that we will correctly reject the false null hypothesis that fi 
is no more than 1030 hours. For a true mean of 1040 hours, the power of the test 
is 0.1736. 



The Power Curve for a Hypothesis Test 
One-Tail Test 

In the preceding example, we arbitrarily selected one value (/x = 1040 hours) that 
would make the null hypothesis false, then found the probability that the deci- 
sion rule of the test would correctly reject the false null hypothesis. In other 
words, we calculated the power of the test ( 1 — /3) for just one possible value of 
the actual population mean. If we were to select many such values (e.g., 
fi = 1060, /j- = 1080, /jl = 1100, and so on) for which H Q is false, we could cal- 
culate a corresponding value of (1 — (5) for each of them. 

For example, part (b) of Figure 10.10 illustrates the power of the test when- 
ever the Extendabulb-equipped bulbs are assumed to have a true mean life of 
1060 hours. In part (b), the power of the test is 0.6772. This is obtained by the 
same approach used when the true mean life was assumed to be 1040 hours, but 
we are now using = 1060 hours instead of 1040. 

The diagram in part (c) of Figure 10.10 repeats this process for an assumed value 
of 1080 hours for the true population mean. Notice how the shape of the distribu- 
tion is the same in diagrams (a), (b), and (c), but that the distribution itself shifts from 
one diagram to the next, reflecting the new true value being assumed for /j,. 

Calculating the power of the test (1 — /3) for several more possible values for 
the population mean, we arrive at the power curve shown by the lower line in Fig- 
ure 10.11. (The upper line in the figure will be discussed shortly.) As Figure 10.11 
shows, the power of the test becomes stronger as the true population mean 
exceeds 1030 by a greater margin. For example, our test is almost certain (prob- 
ability = 0.9949) to reject the null hypothesis whenever the true population mean 
is 1090 hours. In Figure 10.11, the power of the test drops to zero whenever the 
true population mean is 1030 hours. This would also be true for all values lower 
than 1030 as well, because such actual values for the mean would result in the 
null hypothesis actually being true — in such cases, it would not be possible to 
reject a false null hypothesis, since the null hypothesis would not be false. 

A complement to the power curve is known as the operating characteristic 
(OC) curve. Its horizontal axis would be the same as that in Figure 10.11, but the 
vertical axis would be identified as (3 instead of (1 — fi). In other words, the oper- 
ating characteristic curve plots the probability that the hypothesis test will not reject 
the null hypothesis for each of the selected values for the population mean. 
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For n = 40 and decision rule: 
"Reject # 0 if i> 1053.41." 
For n = 60 and decision rule: 
"Reject H 0 if x> 1049.11." 



0.00 



1030 1040 1050 1060 1070 1080 

Assumed actual values for jx = mean hours of bulb life 



1090 



Note: As the specified actual value for the mean becomes smaller and approaches 1 030 hours, the power of the test 
approaches 0.05. This occurs because (1) the mean of the hypothesized distribution for the test was set at the high- 
est possible value for which the null hypothesis would still be true (i.e., 1030 hours), and (2) the level of significance 
selected in performing the test was a = 0.05. 



FIGURE 10.11 



The power curve for the 
Extendabulb test of 
Figure 1 0.4 shows the 
probability of correctly 
rejecting H 0 : fi < 1030 
hours for a range of actual 
population means for which 
the null hypothesis would 
be false. If the actual 
population mean were 
1 030 hours or less, the 
power of the test would be 
zero because the null 
hypothesis is no longer 
false. The lower line 
represents the power of 
the test for the original 
sample size, n = 40. The 
upper line shows the 
increased power if the 
hypothesis test had been 
for a sample size of 60. 



Two-Tail Test 

In two-tail tests, the power curve will have a zero value when the assumed popu- 
lation mean equals the hypothesized value, then will increase toward 1.0 in both 
directions from that assumed value for the mean. In appearance, it will somewhat 
resemble an upside-down normal curve. The basic principle for power curve con- 
struction will be the same as for the one-tail test: Assume different population 
mean values for which the null hypothesis would be false, then determine the 
probability that an observed sample mean would fall into a rejection region orig- 
inally specified by the decision rule of the test. 



The Effect of Increased Sample Size on Type I 
and Type II Errors 

For a given sample size, we can change the decision rule so as to decrease /3, the 
probability of making a Type II error. However, this will increase a, the probabil- 
ity of making a Type I error. Likewise, for a given sample size, changing the deci- 
sion rule so as to decrease a will increase (i. In either of these cases, we are 
involved in a trade-off between a and /8. On the other hand, we can decrease both 
a and /3 by using a larger sample size. With the larger sample size, ( 1 ) the sam- 
pling distribution of the mean or the proportion will be narrower, and (2) the 
resulting decision rule will be more likely to lead us to the correct conclusion 
regarding the null hypothesis. 

If a test is carried out at a specified significance level (e.g., a = 0.05), using a 
larger sample size will change the decision rule but will not change a. This is 
because a has been decided upon in advance. However, in this situation the larger 
sample size will reduce the value of /3, the probability of making a Type II error. 
As an example, suppose that the Extendabulb test of Figure 10.4 had involved a 



348 



Part 4: Hypothesis Testing 



sample consisting of n = 60 bulbs instead of just 40. With the greater sample size, 
the test would now appear as follows: 

• The test is unchanged with regard to the following: 

Null hypothesis: H 0 : fx < 1030 hours 
Alternative hypothesis: Hf. jx > 1030 hours 
Population standard deviation: a = 90 hours 
Level of significance specified: a = 0.05 

• The following are changed as the result of n = 60 instead of n = 40: 

The standard error of the sample mean, a-, is now 

— = = — = = 11.62 hours 
Vn V60 

The critical z of +1.645 now corresponds to a sample mean of 

1030.00 + 1.645(11.62) = 1049.11 hours 

The decision rule becomes, "Reject H Q if x > 1049.11 hours." 

With the larger sample size and this new decision rule, if we were to repeat 
the process that led to Figure 10.10, we would find the following values for the 
power of the test. In the accompanying table, they are compared with those 
reported in Figure 10.10, with each test using its own decision rule for the 0.05 
level of significance. 

Power of the test 



With n = 60 With n = 40 

prob(x> 1049.11) prob(x> 1053.41) 



True Value of 1040 0.2177 0.1736 

Population 1060 0.8264 0.6772 

Mean 1080 r-»-0.9961 0.9693 



r 



For example, for n = 60 and the decision rule shown, 
x - Li 1049.11 - 1080.00 

* = ^f = IUi — = " 2 - 66 

with the normal curve area to the right of z = —2.66 equal to 0.9961. 



As these figures indicate, the same test with n = 60 (and the corresponding 
decision rule) would have a greater probability of correctly rejecting the false 
null hypothesis for each of the possible population means in Figure 10.10. If the 
Extendabulb example had remained the same, but only the sample size were 
changed, the n = 60 sample size would result in much higher (1 — /3) values than 
the ones previously calculated for n = 40. This curve is shown by the upper line in 
Figure 10.11. Combining two or more power curves into a display similar to Fig- 
ure 10.11 can reveal the effect of various sample sizes on the susceptibility of a test 
to Type II error. Seeing Statistics Applet 13, at the end of the chapter, allows you to 
do "what-if" analyses involving a nondirectional test and its power curve values. 
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Computer Solutions 10.4 shows how we can use Excel or Minitab to determine 
the power of the test and generate a power curve like the ones in Figure 10.11. With 
Excel, we specify the assumed population means one at a time, while Minitab offers 
the capability of finding power values for an entire set of assumed population 
means at the same time. In either case, the procedure is much more convenient than 
using a pocket calculator and statistical tables, as was done in generating the values 
shown in Figure 10.10. 



computer solutions 1 0.4 




The Power Curve for a Hypothesis Test 

These procedures show how to construct the power curve for a hypothesis test. The example used is the 
Extendabulb test described in Figures 10.4, 10.10, and 10.11. 

EXCEL 



D 



Type II Error 



HO: MU 



1030 



Critical value 



1053.41 



SIGMA 



90 



Prob(Type II error) 



0.8269 



Sample size 



40 



Power of the test 



0.1731 



ALPHA 



0.05 



H1: MU 



1040 



10 



11 



12 



13 



14 



15 



16 



17 



19 



20 



21 



22 



23 




0.00 



1030 1040 1050 1060 1070 
Assumed Actual Mu 



1080 



1090 



Excel procedure for obtaining the power curve for a hypothesis test 

1 . Open the BETA-MEAN.XLS workbook, supplied with the text. Using the arrows at the bottom left, select the Right-tail 
Test worksheet. 

2. Enter 1030 as the hypothesized population mean, HO: MU. Enter 90 as SIGMA, 40 as the Sample size, and 0.05 as 

ALPHA. Enter 1040 as the value for H1:MU, the assumed population mean for which we want the power of the test. 



(continued) 
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Excel automatically calculates the critical value for the sample mean as 1053.41 hours, and the power of the 
test appears in row 5 as 0.1731 . This is comparable to part (a) of Figure 10.10, but is slightly more accurate because 
its calculation does not rely upon printed tables and the inherent gaps within them. We now have one point on the 
power curve for n = 40: Power = 0.1731 on the vertical axis for an Assumed Actual /jl of 1 040 hours on the horizon- 
tal axis. This result is shown above, in the upper portion of the Excel printout. 
3. Repeat step 2 to obtain the power of the test for each of several other assumed actual values for fi — for example, 
1 030, 1 050, 1 060, 1 070, 1 080, and 1 090. The resulting points can then be plotted as shown in the lower portion of 
the Excel printout. 



MINI TAB 

Minitab procedure for obtaining the power curve for a hypothesis test 
Power and Sample Size 

1-Sample Z Test 

Testing mean = null (versus > null) 
Calculating power for mean = null + difference 
Alpha = 0.05 Assumed standard deviation = 90 





Sample 






ice 


Size 




Power 


10 


40 


0. 


173064 


20 


40 


0. 


405399 


30 


40 


0. 


678437 


40 


40 


0. 


878205 


50 


40 


0. 


969174 


60 


40 


0. 


994937 



1 . Click Stat. Select Power and Sample Size. Click 1-Sample Z. Enter 40 into the Sample Sizes box. Enter 10 20 30 40 
50 60 (separated by spaces) into the Differences box. These are the differences between the hypothesized popula- 
tion mean (1030) and the selected values for the assumed actual mean (i.e., 1040, 1050, 1060, 1070, 1080, and 
1090). Enter 90 into the Standard deviation box. 

2. Click Options. Within the Alternative Hypothesis box, select Greater than. Enter 0.05 into the Significance level 
box. Click OK. Click OK. The printout shows the power of the test for sample sizes of 40 and for assumed actual pop- 
ulation means of 1 040 (difference from 1 030 is 1 0), 1 050, 1 060, 1 070, 1 080, and 1 090. It corresponds to the values 
plotted in Figure 10.1 1 . Minitab will also generate a power curve graph (not shown here). 



exercises 





10.79 What is a power curve and how is it applicable to 
hypothesis testing? 

10.80 What is an operating characteristic curve and how 
is it related to the power curve for a test? 

10.81 A hypothesis test has been set up and is to be con- 
ducted at the a = 0.05 level of significance. If the sample 
size is doubled, what will be the effect on a? On /3? 



10.82 For the test described in Exercise 10.31, if the true 
population mean is really 2.520 inches, what is the prob- 
ability that the inspector will correctly reject the false null 
hypothesis that [x, = 2.500 inches? 

10.83 For the test described in Exercise 10.32, assume 
that the true population mean for the new booklet is 
/x = 2.80 hours. Under this assumption, what is the 
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probability that the false null hypothesis, H 0 : /jl > 3.00 
hours, will be rejected? 

10.84 Using assumed true population means of 2.485, 
2.490, 2.495, 2.500, 2.505, 2.510, and 2.515 inches, plot 
the power curve for the test in Exercise 10.31. 

10.85 Using assumed true population means of 2.80, 
2.85, 2.90, 2.95, and 3.00 hours, plot the power curve 
for the test in Exercise 10.32. 

10.86 Using assumed true population percentages of 2%, 
3%, 4%, 5%, 6%, and 7%, plot the power curve for the 
test in Exercise 10.63. 

10.87 When a thread-cutting machine is operating prop- 
erly, only 2% of the units produced are defective. Since 
the machine was bumped by a forklift truck, however, the 
quality-control manager is concerned that it may require 
expensive downtime and a readjustment. The manager 
would like to set up a right-tail test at the 0.01 level of 
significance, then identify the proportion of defectives in 

a sample of 400 units produced. His null hypothesis is 
H Q : tt < 0.02, with H t : tt > 0.02. 



a. What critical value of z will be associated with 
this test? 

b. For the critical z determined in part (a), identify 
the sample proportion that this z value represents, 
then use this value of p in stating a decision rule 
for the test. 

c. What is the probability that the quality-control man- 
ager will fail to reject a false H 0 if the actual popula- 
tion proportion of defectives is tt- = 0.02? If 

tt = 0.03? If tt = 0.04? If tt = 0.05? If tt = 0.06? 

d. Making use of the probabilities calculated in part (c), 
plot the power and operating characteristic curves for 
the test. 

10.88 Plot the operating characteristic curve that 
corresponds to the power curve constructed for 
Exercise 10.84. 

10.89 Plot the operating characteristic curve that 
corresponds to the power curve constructed for 
Exercise 10.86. 



SUMMARY 1 0.8 



• Hypothesis testing: null and alternative hypotheses 

Hypothesis testing is a method of using sample information to evaluate a claim 
that has been converted into a set of statements called the null and alternative 
hypotheses. The null hypothesis can be either rejected or not rejected; if it is not 
rejected, the alternative hypothesis, which asserts the null hypothesis to be wrong, 
must be rejected. The null hypothesis is given the benefit of the doubt and is 
assumed to be true unless we are faced with statistically overwhelming evidence 
to the contrary. 

• Directional and nondirectional hypothesis tests 

A verbal statement from which a hypothesis test originates may be either direc- 
tional or nondirectional. Accordingly, the appropriate hypothesis test will also be 
either directional (one-tail test) or nondirectional (two-tail test). The object of the 
hypothesis test may be a single population mean or proportion, as discussed in 
this chapter, or the difference between two population means or proportions (cov- 
ered in the next chapter). 

• Type I and Type II errors 

When we reject or fail to reject the null hypothesis, there is the possibility 
that our conclusion may be mistaken. Rejecting a true null hypothesis is known 
as a Type I error, while failing to reject a false null hypothesis results in a Type II 
error. Alpha (a), the probability of incorrectly rejecting a true null hypothesis, is 
also known as the level of significance of the test. Beta (/3) is the probability of 
incorrectly failing to reject a false null hypothesis. With sample size and type of 
test (one-tail versus two-tail) being equal, reducing a tends to increase /3, and vice 
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versa, though random samples of larger size can simultaneously reduce both a 
and )3. 

• The hypothesis testing procedure 

Hypothesis testing involves (1) formulating the null and alternative hypotheses, 
(2) selecting the significance level (a) to be used in the test, (3) selecting the test 
statistic and calculating its value, (4) identifying the critical value(s) for the 
test statistic and stating the decision rule, (5) comparing calculated and 
critical value(s) for the test statistic and reaching a conclusion about the null 
hypothesis, and (6) making the related business decision for the situation 
that precipitated the hypothesis test in the first place. In a two-tail test, there 
will be two critical values for the test statistic; in a one-tail test, a single critical 
value. 

• Tests involving a sample mean or proportion 

When testing a sample mean, an important consideration in the selection of 
the test statistic (z or t) is whether the population standard deviation (a) is 
known. If other assumptions are met, the £-test is appropriate whenever a is 
unknown. Figure 10.2 summarizes the test statistic selection process and related 
assumptions. 

If the calculated test statistic falls into a rejection region defined by the critical 
value(s), the difference between the sample mean or proportion and its hypothe- 
sized population value is judged to be too great to have occurred by chance. In 
such a case, the null hypothesis is rejected. 

• The p-value for a hypothesis test 

The p-value for a hypothesis test is the level of significance where the calculated 
value of the test statistic is exactly the same as a critical value, and is another way 
of expressing the level of significance (a) of the test. When the hypothesis test is 
computer assisted, the computer output will include a p-value for use in interpret- 
ing the results. The p-value is the lowest level of significance at which the null 
hypothesis can be rejected. 

• Confidence intervals and nondirectional tests 

Although confidence intervals and hypothesis tests have different purposes, the 
concepts are actually related. In the case of a nondirectional test for a population 
mean, the conclusion at the a level of significance will be the same as that reached 
on the basis of a confidence interval constructed for the 100(1 — a)% confidence 
level. 

• Power and operating characteristic curves for a test 

Another measure of the performance of a hypothesis test is the power of the 
test. This is the probability (1 — /3) that the test will avoid a Type II error by cor- 
rectly rejecting a false null hypothesis. When the power of the test (vertical axis) 
is plotted against a range of population parameter values for which the null 
hypothesis would be false (horizontal axis), the result is known as the power 
curve for the test. A complement to the power curve is the operating character- 
istic (OC) curve, which has the same horizontal axis, but uses /3 instead of 
(1 — jS) for the vertical axis. The OC curve expresses the probability that the 
test will make a Type II error by incorrectly failing to reject the null hypothesis 
for a range of population parameter values for which the null hypothesis would 
be false. 
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Testing a Mean, Population Standard Deviation Known 

Z-test, with test statistic: 

_ X ~ flQ 



where a- = standard error for the 
°* sample mean, = cr/Vn 

x = sample mean 
(i 0 = hypothesized population mean 
n = sample size 

If the population is not normally distributed, n should be &30 for the central limit theo- 
rem to apply. The population cr is usually not known. 



Testing a Mean, Population Standard Deviation Unknown 

£-test, with test statistic: 



where s- = estimated standard error for 
the sample mean, = s/Vn 
x = sample mean 
Mo = hypothesized population mean 
n = sample size and df = n — 1 

For an unknown a, and when the population is approximately normally distributed, the 
f-test is appropriate regardless of the sample size. As n increases, the normality assumption 
becomes less important. If n < 30 and the population is not approximately normal, non- 
parametric testing (e.g., the sign test for central tendency, in Chapter 14) may be applied. 
The f-test is "robust" in terms of not being adversely affected by slight departures from the 
population normality assumption. 

Testing a Proportion 

Z-test, with test statistic: 

z = — where p = sample proportion 

°P tt q = hypothesized population proportion 

n = sample size 
(Tp = standard error of the distribution 
of the sample proportion 



When hit > 5 and n(\ — tt) > 5, the normal distribution is considered to be a good 
approximation to the binomial distribution. If this condition is not met, the exact prob- 
abilities must be derived from the binomial table. Most practical business settings involv- 
ing proportions satisfy this condition, and the normal approximation is used in the 
chapter. 



Confidence Intervals and Hypothesis Testing 

Confidence intervals and hypothesis testing are related. For example, in the case of a 
nondirectional test for a population mean, the conclusion at the a level of significance will 
be the same as that reached on the basis of a confidence interval constructed for the 
100(1 — a)% confidence level. 
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chapter exercises 




10.90 For each of the following situations, determine 
whether a one-tail test or a two-tail test would be 
appropriate. Describe the test, including the null and 
alternative hypotheses, then explain your reasoning in 
selecting it. 

a. A machine that has not been serviced for several 
months is producing output in which 5 % of the items 
are defective. The machine has just been serviced and 
quality should now be improved. 

b. In a speech during her campaign for reelection, a 
Republican candidate claims that 55% of registered 
Democrats in her county intend to vote for her. 

c. Of those who have bought a new car in the past, a 
dealer has found that 70% experience three or more 
mechanical problems in the first four months of own- 
ership. Unhappy with this percentage, the dealer has 
heavily revised the procedure by which predelivery 
mechanical checks are carried out. 

10.91 Strands of human hair absorb elements from the 
bloodstream and provide a historical record of both health 
and the use or nonuse of chemical substances. Hair grows 
at the rate of about one-half inch per month, and a person 
with long hair might be accused or absolved on the basis of 
a segment of hair that sprouted many months or years ago. 
By separately analyzing sections of the hair strand, 
scientists can even approximate the periods of time during 
which drug use was heavy, moderate, light, or altogether 
absent. If a transit employee provides a strand of hair for 
drug testing, state the null and alternative hypotheses in 
verbal terms, then identify what would constitute a Type I 
error and a Type II error in this situation, source: David 
Wessel, "Hair as History: Advantages Seen in New Methods for Drug 
Testing," Wall Street Journal, May 22, 1987, p. 19. 

10.92 Before accepting a large shipment of bolts, the direc- 
tor of an elevator construction project checks the tensile 
strength of a simple random sample consisting of 20 bolts. 
She is concerned that the bolts may be counterfeits, which 
bear the proper markings for this grade of bolt, but are 
made from inferior materials. For this application, the 
genuine bolts are known to have a tensile strength that is 
normally distributed with a mean of 1400 pounds and a 
standard deviation of 30 pounds. The mean tensile strength 
for the bolts tested is 1385 pounds. Formulate and carry 
out a hypothesis test to examine the possibility that the 
bolts in the shipment might not be genuine. 

10.93 Before the hiring of an efficiency expert, the mean 
productivity of a firm's employees was 45.4 units per 
hour, with a standard deviation of 4.5 units per hour. 
After incorporating the changes recommended by the 
expert, it was found that a sample of 30 workers 
produced a mean of 47.5 units per hour. Using the 



0.01 level of significance, can we conclude that the mean 
productivity has increased? 

10.94 The average U.S. family includes 3.17 persons. To 
determine whether families in her city tend to be representa- 
tive in size compared to those across the United States, a city 
council member selects a simple random sample of 40 fami- 
lies. She finds the average number of persons in a family to 
be 3.44, with a standard deviation of 1.10. source: Bureau of 
the Census, Statistical Abstract of the United States 2002, p. 52. 

a. Using a nondirectional hypothesis test and a level of 
significance of your choice, what conclusion would 
be reached? 

b. For the significance level used in part (a), construct 
the appropriate confidence interval and verify the 
conclusion reached in part (a). 

10.95 The average U.S. household has $178,600 in life 
insurance. A local insurance agent would like to see how 
households in his city compare to the national average, 
and selects a simple random sample of 30 households 
from the city. For households in the sample, the average 
amount of life insurance is x = $188,800, with s = 
$25,500. source: American Council of Life Insurance, Life Insurance 
Fact Book 1999, p. 12. 

a. Using a nondirectional hypothesis test and a level of 
significance of your choice, what conclusion would 
be reached? 

b. For the significance level used in part (a), construct 
the appropriate confidence interval and verify the 
conclusion reached in part (a). 

10.96 Technical data provided with a room dehumidifier 
explain that units tested from the production line consume 
a mean of 800 watts of power, with a standard deviation of 
12 watts. The superintendent of a large office complex, 
upon evaluating 30 of the units recently purchased and find- 
ing their mean power consumption to be 805 watts, claims 
that her shipment does not meet company specifications. At 
the 0.05 level of significance, is the superintendent's 
complaint justified? 

10.97 A regional office of the Internal Revenue Service ran- 
domly distributes returns to be audited to the pool of audi- 
tors. Over the thousands of returns audited last year, the 
average amount of extra taxes collected was $356 per 
audited return. One of the auditors, Jeffrey Jones, is 
suspected of being too lenient with persons whose returns 
are being audited. For a simple random sample of 30 of the 
returns audited by Mr. Jones last year, an average of $322 
in extra taxes was collected, with a standard deviation of 
$90. Based on this information and a hypothesis test of 
your choice, do the suspicions regarding Mr. Jones appear 
to be justified? 
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10.98 A state transportation official claims that the mean 
waiting time at exit booths from a toll road near the capi- 
tol is no more than 0.40 minutes. For a sample of 35 
motorists exiting the toll road, it was found that the mean 
waiting time was 0.46 minutes, with a standard deviation 
of 0.16 minutes. At the 0.05 level of significance, can we 
reject the official's claim? 

10.99 During 2000, 3.2% of all U.S. households were 
burglary victims. For a simple random sample of 300 
households from a certain region, suppose that 18 house- 
holds were victimized by burglary during that year. Apply 
an appropriate hypothesis test and the 0.05 level of 
significance in determining whether the region should be 
considered as having a burglary problem greater than 
that for the nation as a whole, source: Bureau of the Census, 
Statistical Abstract of the United States 2002, p. 190. 

10.100 A 1931 issue of Time magazine contained an 
advertisement for Liberty magazine. According to a study 
cited by Liberty, it was found that 15% of Liberty families 
had a "mechanical refrigerator," compared to just 8% for 
all U.S. families. Assuming that the study included a sam- 
ple of 120 Liberty families, and using the 0.01 level of sig- 
nificance, could the percentage of Liberty families owning 
this modern (for 1931) convenience be higher than for the 
nation overall? source: Time, August 10, 1931, p. 49. 

10.101 The administrator of a local hospital has told the 
governing board that 30% of its emergency room patients 
are not really in need of emergency treatment (i.e., the 
problems could just as easily have been handled by an 
appointment with their family physician). In checking a 
random sample of 400 emergency room patients, a board 
member finds that 35% of those treated were not true 
emergency cases. Using an appropriate hypothesis test and 
the 0.05 level, evaluate the administrator's statement. 

10.102 In a taste comparison test, it was found that 58 
of 100 persons preferred the chunky version of a peanut 
butter over the creamy type. An interested observer 
would like to determine whether this proportion (0.58) 
is significantly greater than the (0.50) proportion that 
would tend to result from chance. Using the 0.05 level of 
significance, what conclusion will be reached? 

10.103 An exterminator claims that no more than 10% 
of the homes he treats have termite problems within 

1 year after treatment. In a sample of 100 homes, local 
officials find that 14 had termites less than 1 year after 
being treated. At the 0.05 level of significance, evaluate 
the credibility of the exterminator's statement. 

10.104 A national chain of health clubs says the mean 
amount of weight lost by members during the past month 
was at least 5 pounds. Skeptical of this claim, a consumer 
advocate believes the chain's assertion is an exaggeration. 
She interviews a random sample of 40 members, finding 
their mean weight loss to be 4.6 pounds, with a standard 



deviation of 1.5 pounds. At the 0.01 level of significance, 
evaluate the health club's contention. 

10.105 In an interview with a local newspaper, a 
respected trial lawyer claims that he wins at least 75% of 
his court cases. Bert, a skeptical statistics student, sets up 
a one-tail test at the 0.05 level of significance to evaluate 
the attorney's claim. The student plans to examine a ran- 
dom sample of 40 cases tried by the attorney and 
determine the proportion of these cases that were won. 
The null and alternative hypotheses are H 0 : it > 0.75 and 
Hj: tt < 0.75. Using the techniques of this chapter, Bert 
sets up a hypothesis test in which the decision rule is 
"Reject H 0 if z < —1.645, otherwise do not reject." 
What is the probability that Bert will make a Type II 
error (fail to reject a false null hypothesis) if the 
attorney's true population proportion of wins is 
actually 

a. 7r — 0.75? b. 77 = 0.70? c. 77 = 0.65? 
d. 77 = 0.60? e. 77 = 0.55? 

f. Making use of the probabilities calculated in parts (a) 
through (e), describe and plot the power curve for 
Bert's hypothesis test. 

10.106 A consumer agency suspects that a pet food com- 
pany may be underfilling packages for one of its brands. 
The package label states "1600 grams net weight," and 
the president of the company claims the average weight 
is at least 1600 grams. For a simple random sample of 
35 boxes collected by the consumer agency, the mean and 
standard deviation were 1591.7 grams and 18.5 grams, 
respectively. 

a. At the 0.05 level of significance, what conclusion 
should be reached by the consumer agency? Would 
the president of the company prefer to use a different 
level of significance in reaching a conclusion? 
Explain. 

b. Use the decision rule associated with part (a) and a 
range of selected assumed values for /jl in construct- 
ing the power curve for the test. 

10.107 Historically, Shop-Mart has gotten an average of 
2000 hours of use from its G&E fluorescent lightbulbs. 
Because its fixtures are attached to the ceiling, the bulbs 
are rather cumbersome to replace, and Shop-Mart is look- 
ing into the possibility of switching to Phipps bulbs, which 
cost the same. A sample of Phipps bulbs lasted an average 
of 2080 hours, and the p-value in a right-tail test 

(H Q : [l < 2000 versus H 1 : 1 l> 2000) is 0.012. Shop-Mart 
has shared these results with both G&E and Phipps. 

a. Give Shop-Mart a recommendation. In doing so, use 
"0.012" in a sentence that would be understood by a 
Shop-Mart executive who has had no statistical train- 
ing but is comfortable with probabilities as used by 
weather forecasters. 

b. In interpreting these results, what level of significance 
might G&E like to use in reaching a conclusion? 
Explain. 



356 



Part 4: Hypothesis Testing 



c. In interpreting these results, what level of significance 
might Phipps like to use in reaching a conclusion? 
Explain. 

d. If the test had been two-tail (i.e., H 0 : /jl = 2000 ver- 
sus £L: ju, ~f~ 2000) instead of right-tail, would the 
p-value still be 0.012? If not, what would the p-value 
be? Explain. 

/ data Set / Note: Exercises 10.108-10.113 require 
a computer and statistical software. 

10.108 Before installing a high-tech laser exhibit near the 
exit area, a museum of American technology found the 
average contribution by patrons was $2.75 per person. 
For a sample of 30 patrons following installation of the 
new exhibit, the contributions are as listed in data file 
XR10108. Based on the sample data, and using the 
0.025 level of significance, can we conclude that the new 
exhibit has increased the average contribution of exhibit 
patrons? Identify and interpret the p-value for the test. 

10.109 According to the U.S. Department of Justice, 
27% of violent crimes involve the use of a weapon. We 
will assume that data file XR10109 contains a sample of 
the crime information for a given city, with data for the 
400 crimes coded as 1 = crime involved a weapon and 

2 = crime did not involve a weapon. Using the 0.05 level 
in a two-tail test, was the percentage of violent crimes 
involving a weapon in this city significantly different from 
the 27% in the nation as a whole? Identify and interpret 
the p-value for the test. source: Bureau of the Census, Statistical 
Abstract of the United States 2002, p. 190. 



10.110 Based on the sample results in Exercise 10.109, 
construct and interpret the 95% confidence interval for 
the population proportion. Is the hypothesized 
proportion (0.27) within the interval? Given the presence 
or absence of the 0.27 value within the interval, is this 
consistent with the findings of the hypothesis test 
conducted in Exercise 10.109? 

10.111 In making aluminum castings into alternator 
housings, an average of 3.5 ounces per casting must be 
trimmed off and recycled as a raw material. A new manu- 
facturing procedure has been proposed to reduce the 
amount of aluminum that must be recycled in this way. 
For a sample of 35 castings made with the new process, 
data file XR10111 lists the weights of aluminum trimmed 
and recycled. Based on the sample data, and using the 
0.01 level, has the new procedure significantly reduced 
the average amount of aluminum trimmed and recycled? 
Identify and interpret the p-value for the test. 

10.112 Use the decision rule associated with Exercise 
10.111 and a range of selected assumed values for (i to 
construct the power curve for the test. 

10.113 In the past, the mean lifetime of diesel engine 
injection pumps has been 12,000 operating hours. A new 
injection pump is available that is promoted as lasting 
longer than the old version. In a test of 50 of the new 
pumps, the lifetimes are as listed in data file XR10113. 
Based on the sample data, and using the 0.025 level of 
significance, examine the possibility that the new pumps 
might have a mean life that is no more than that of the 
old design. Identify and interpret the p-value for the test. 



integrated cases 




Thorndike Sports Equipment 

The first item on Ted Thorndike's agenda for today is a 
meeting with Martha Scott, a technical sales representative 
with Cromwell Industries. Cromwell produces a racquet- 
stringing machine that is promoted as being the fastest in 
the industry. In their recent telephone conversation, 
Martha offered Ted the chance to try the machine for a full 
week, then return it to Cromwell if he decides it doesn't 
offer a worthwhile improvement. 

Currently, the Thorndike racquetball and tennis rac- 
quets are strung using a type of machine that has been 
around for about 10 years, and the Thorndikes have been 
very pleased with the results. When efficiency experts vis- 
ited the plant last year, they found it took an average of 
3.25 minutes to string a racquetball racquet and an aver- 
age of 4.13 minutes for a tennis racquet. Both distributions 
were found to be approximately normal. 



Ms. Scott comes by, explains the operation of the 
machine, and assists in its installation. Because its controls 
are similar in function and layout to the older model, no 
operator training is necessary. During the 1-week trial, the 
new machine will be working side-by-side with older mod- 
els in the racquet-stringing department. 

After Ms. Scott leaves, Luke and Ted Thorndike dis- 
cuss the machine further. Luke indicates that the oldest of 
the current machines is about due for replacement and, if 
the Cromwell model is really faster, he will buy the 
Cromwell to replace it. It's possible that Cromwell models 
would also be purchased as other stringing machines 
approach the end of their operating lifetimes and are 
retired. Luke cautions Ted that the Cromwell must be pur- 
chased only if it is indeed faster than the current models — 
although Ms. Scott seems to have an honest face, he's never 
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trusted anybody whose briefcase contains brochures. The 
Cromwell must be quicker in stringing both racquetball 
racquets and tennis racquets. Otherwise, the firm will con- 
tinue purchasing and using the current model. 

Evaluating the Cromwell model, Ted measures the exact 
time required for each racquet in separate samples consisting 
of 40 racquetball racquets and 40 tennis racquets. The times 
are given here and are also in data file THORN10. 

40 racquetball racquets: 40 tennis racquets: 

2.97 3.43 3.10 2.79 3.41 3.22 3.36 3.46 
3.11 3.26 3.36 3.48 3.09 3.06 2.67 3.13 
2.97 3.09 3.18 2.71 3.41 2.83 3.20 3.32 
2.70 3.17 3.16 3.16 3.24 2.91 3.34 2.97 
3.14 3.36 3.89 3.21 3.13 2.92 2.94 3.18 

minutes 



Springdale Shopping Survey 

The Springdale computer database exercise of Chapter 2 
listed 30 shopping-related questions asked of 150 respon- 
dents in the community of Springdale. The data are in file 
SHOPPING. The coding key for these responses was also 
provided in the Chapter 2 exercise. In this section, hypoth- 
esis testing will be used to examine the significance of some 
of the survey results. 

From a managerial perspective, the information gained 
from this database exercise would be useful in determining 
whether the mean attitude toward a given area differed sig- 
nificantly from the "neutral" level, in observing and com- 
paring the mean attitude scores exhibited toward the three 
shopping areas, and in examining the strengths and weak- 
nesses of a manager's shopping area relative to the other 
two areas. If such a study were repeated from time to time, 
management could observe the extent to which the overall 
attitude toward their shopping area was becoming stronger 
or weaker in the eyes of the consumer, thus helping them 
select business strategies that could retain and take advan- 
tage of their area's strengths and either correct or minimize 
the effect of its weaknesses. 

1. Item C in the description of the data collection instru- 
ment lists variables 7, 8, and 9, which represent the 
respondent's general attitude toward each of the three 
shopping areas. Each of these variables has numeri- 
cally equal distances between the possible responses, 
and for purposes of analysis they may be considered to 
be of the interval scale of measurement, 
a. Calculate the mean attitude score toward the 
Springdale Mall shopping area and compute the 
sample standard deviation. Does this area seem to 
be well regarded by the respondents? 



Since Luke seems quite adamant against buying some- 
thing that isn't any faster than the current models, Ted 
must be very confident that any increased speeds measured 
are not simply due to chance. Using a computer statistical 
package to analyze the sample times, do the respective 
tests appear to warrant purchase of the Cromwell 
machine, or should it be returned to Cromwell at the end 
of the week? 



b. In a two-tail test at the 0.10 level, compare the 
sample mean calculated in part (a) to the 3.0 "neu- 
tral" value. Use the null hypothesis H 0 : /jl = 3.0 
and the alternative hypothesis H^. fj, + 3.0. 

c. Generate the 90% confidence interval for the pop- 
ulation mean and use this interval in verifying the 
conclusion obtained in part (b). 

d. What is the p-value associated with the hypothesis 
test? 

e. Repeat parts (a) through (d) for the Downtown 
shopping area. 

f. Repeat parts (a) through (d) for the West Mall 
shopping area. 

2. If Springdale Mall offered exactly the same benefits as 
the other two shopping areas, we would expect 
exactly one-third of those who expressed an opinion 
to select it as the area best fitting the description 
for variable number 10 ("Easy to return/exchange 
goods"). In testing whether Springdale Mall differs 
significantly from this expected proportion, the null 
and alternative hypotheses will be H 0 : it = 0.333 and 
Hy it + 0.333. Carry out the following analyses for 
Springdale Mall: 

a. Analyzing the data for variable 10, use the preced- 
ing null and alternative hypotheses in testing the 
null hypothesis at the 0.05 level. 

b. Determine the p-value for the test conducted in 
part (a). 

c. Repeat parts (a) and (b) for variables 11-17. 

d. Based on the preceding analyses, identify the 
principal strengths and weaknesses exhibited by 
Springdale Mall. 



3.77 3.97 4.08 4.36 

3.51 3.43 4.31 3.84 

3.91 4.47 4.27 3.70 

4.49 3.74 4.17 3.68 

3.71 3.77 3.90 4.14 



3.85 4.65 3.89 3.99 

3.86 3.50 4.23 3.88 
3.80 4.54 4.46 3.97 
4.09 4.27 3.84 3.51 
4.03 4.61 3.63 4.62 

minutes 
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Pronto Pizza (A) 




Pronto Pizza is a family-owned pizza restaurant in 
Vinemont, a small town of 20,000 people in upstate New 
York. Antonio Scapelli started the business 30 years ago as 
Antonio's Restaurant with just a few thousand dollars. 
Antonio, his wife, and their children, most of whom are 
now grown, operate the business. Several years ago, one of 
Antonio's sons, Tony Jr., graduated from NYU with an 
undergraduate degree in business administration. After 
graduation, he came back to manage the family business. 
Pronto Pizza was one of the earliest pizza restaurants to 
offer pizza delivery to homes. Fortunately, Tony had the 
foresight to make this business decision a few years ago. At 
the same time, he changed the restaurant's name from 
Antonio's to Pronto Pizza to emphasize the pizza delivery 
service. The restaurant has thrived since then and has 
become one of the leading businesses in the area. While 
many of their customers still "dine in" at the restaurant, 
nearly 90% of Pronto's current business is derived from 
the pizza delivery service. 

Recently, one of the national chains of fast-food pizza 
delivery services found its way to Vinemont, New York. In 
order to attract business, this new competitor guarantees 
delivery of its pizzas within 30 minutes after the order is 
placed. If the delivery is not made within 30 minutes, the 
customer receives the order without charge. Before long, 
there were signs that this new pizza restaurant was taking 
business away from Pronto Pizza. Tony realized that 
Pronto Pizza would have to offer a similar guarantee in 
order to remain competitive. 

After a careful cost analysis, Tony determined that to 
offer a guarantee of 29 minutes or less, Pronto's average 
delivery time would have to be 25 minutes or less. Tony 
thought that this would limit the percentage of "free piz- 
zas" under the guarantee to about 5% of all deliveries, 
which he had figured to be the break-even point for such a 
promotion. To be sure of Pronto's ability to deliver on a 
promise of 29 minutes or less, Tony knew that he needed 
to collect data on Pronto's pizza deliveries. 

Pronto Pizza's delivery service operates from 4:00 p.m. 
to midnight every day of the week. After an order for a 
pizza is phoned in, one of the two cooks is given the order 
for preparation. When the crust is prepared and the ingre- 
dients have been added, the pizza is placed on the belt of 
the conveyor oven. The speed of the conveyor is set so that 
pizzas come out perfectly, time after time. Once the pizza 
is ready and one of Pronto's drivers is available to make 
the delivery, the pizza is taken in a heat-insulated bag to 
the customer. Pronto uses approximately five to six drivers 




each night for deliveries. « 
Most of the drivers hired ? 
by Pronto Pizza are juniors | 
and seniors at the local i= 
high school. | 

Given the large num- J 
ber of deliveries made each i 
evening, Tony knew that he i 
could not possibly monitor £ 
every single delivery. He * 
had thought of the possibil- 
ity of having someone else collect the data, but given the 
importance of accurate data, he decided to make all of the 
measurements himself. This, of course, meant taking a 
random sample of, rather than all, deliveries over some 
time period. Tony decided to monitor deliveries over the 
course of a full month. During each hour of delivery ser- 
vice operation, he randomly selected a phoned-in order. 
He then carefully measured the time required to prepare 
the order and the amount of time that the order had to 
wait for a delivery person to become available. Tony 
would then go with the delivery person to accurately mea- 
sure the delivery time. After returning, Tony randomly 
selected an order placed during the next hour and repeated 
the process. At the end of the month, Tony had collected 
data on 240 deliveries. 

Once the data were available, Tony knew there were 
several issues that should be addressed. He was committed 
to going with the 29-minute delivery guarantee unless the 
data strongly indicated that the true average delivery time 
was greater than 25 minutes. How would he make this 
decision? Tony also realized that there were three compo- 
nents that could affect pizza delivery times: the prepara- 
tion time, the waiting time for an available driver, and the 
travel time to deliver the pizza to the customer. Tony 
hoped that he had collected sufficient data to allow him to 
determine how he might improve the delivery operation by 
reducing the overall delivery time. 

Assignment 

Tony has asked you for some assistance in interpreting the 
data he has collected. In particular, he needs to know 
whether the true average delivery time for Pronto Pizza 
might be more than 25 minutes. Use the data in the file 
PRONTO on the text CD in answering his question. A 
description of this data set is given in the accompanying 
Data Description section. Based on your examination of 
the data, provide Tony with some suggestions that would 
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seeing statistics: applet 1 



z-lnterval and Hypothesis Testing 

When this applet loads, it shows the hypothesized popula- 
tion mean for the nondirectional z-test discussed in this sec- 
tion as "Null Hyp 1.325." It also shows the sample mean 
("Sample Mean 1 .3229") along with the resulting 95% con- 
fidence interval for the population mean. In this example, we 
took only one sample. However, by moving the slider at the 
bottom of the graph, we can see what the confidence inter- 
val would have been if our sample mean had been lower or 
higher than the 1 .3229 minutes we actually obtained. 




1.305 1.31 1.316 1.32 1.3(25 1.33 1.335 1.34 1545 

1.3142 1.3318 
I I ±\ 




Note that the confidence interval limits shown in the 
graph may sometimes differ slightly from those we would 
calculate using the pocket calculator and our standard nor- 
mal distribution table. This is because the applet is using 
more exact values for z than we are able to show within 
printed tables like the one in the text. 

Applet Exercises 

12.1 When the applet initially loads, identify the sample 
mean and describe the 95% confidence interval for /x. 
Based on this confidence interval, would it seem believable 
that the true population mean might be 1.325 minutes? 

12.2 Using the slider, increase the sample mean to 
approximately 1.330 minutes. What is the 95% confi- 
dence interval for /jl7 Based on this confidence interval, 
would it seem believable that the true population mean 
might be 1 .325 minutes? 

12.3 Using the slider, decrease the sample mean to approx- 
imately 1.310 minutes. Based on this confidence interval, 
would it seem believable that the true population mean 
might be 1.325 minutes? 



help him in making his decision about the 29-minute deliv- 
ery guarantee and in improving his pizza delivery service. 
The case questions will assist you in your analysis of the 
data. Use important details from your analysis to support 
your recommendations. 

Data Description 

The PRONTO file on the text CD contains the data collected 
by Tony Scapelli over the past month on pizza deliveries. 
Data recorded for each delivery order are as shown in the 
next table. 

The variables are defined as follows: 



DAY: 



Day of the week ( 1 
7 = Sunday) 



Monday, 



HOUR: 
PREPJTIME: 

WAIT TIME: 



TRAVEL TIME: 



DISTANCE: 



Hour of the day (4-11 p.m.) 

Time required (in minutes) to prepare 

the order 

Time (in minutes) from completing 
preparation of the order until a delivery 
person was available to deliver the 
order 

Time (in minutes) it took the car to 
reach the delivery location 
Distance (in miles) from Pronto Pizza 
to the delivery location 



(Note: You will first have to help Tony by creating a new 
variable called TOT_TIME, which represents the total 
amount of time from the call being received to the delivery 



DAY 



HOUR 



PREP TIME 



WAIT TIME 



TRAVEL TIME 



DISTANCE 



14.86 
14.84 
15.41 
16.34 
15.19 



3.08 
13.81 
9.91 
2.08 
2.69 



6.02 
5.47 
8.99 
7.98 
9.01 



2.5 
3.3 
4.9 
3.8 
4.9 



r. s*~T 
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Statistical Power of a Test 

This applet allows "what-if" analyses of the power of a 
nondirectional test with null and alternative hypotheses H 0 : 
H = 1 0 versus H, : /x ^ 10. There are three sliders, allowing 
us to choose the a level for the test (left slider), the sample 
size (right slider), and the actual value of ti (identified as 
"Alt Hyp" and controlled by the bottom slider). As we 
move any of the sliders, we can immediately see the power 
of the test in the display at the top of the graph. Recall that 



alpha = 0.05 Power = 0.41 

Null Hyp 1G 




the power of the test is (1 - /3), and that it is the probabil- 
ity of rejecting a false null hypothesis. 

The distribution in the upper portion of the chart shows 
the hypothesized population mean (H 0 : /jl = 10) as well as 
the lower and upper critical values for the sample mean in 
a nondirectional test for the combination of a and n being 
used. The sum of the blue areas in the lower distribution 
show, for a given a, actual fi, and n, the probability that X 
will fall outside the critical values and cause us to reject the 
null hypothesis. 

Applet Exercises 

13.1 Using the left and right sliders, set up the test so that 
a = 0.10 and n = 20. Now move the bottom slider so that 
the actual /x is as close as possible to 10 without being 
equal to 10 (e.g., 10.01). What is the value for the power 
of the test? Is this the value you would expect? 

13.2 Set the left, right, and bottom sliders so that a = 0.05, 
n = 15, and ti = the same value you selected in Applet 
Exercise 13.1 . Now move the left slider upward and down- 
ward to change the a level for the test. In what way(s) do 
these changes in a affect the power of the test? 

13.3 Set the left and bottom sliders so that a = 0.05 and 
Ijl = 1 1.2. Now move the right slider upward and down- 
ward to change the sample size for the test. Observe the 
power of the test for each of the following values for n: 2, 
10, 20, 40, 60, 80, and 100. 

13.4 Set the right and bottom sliders so that n = 20 and 
/jl = 1 1 .2. Now move the left slider upward and downward 
to change the a level for the test. Observe the power of the 
test for each of the following values for a: 0.01, 0.02, 0.05, 
0.10, 0.20, 0.30, 0.40, and 0.50. 



being made. TOT_TIME is the total of PREP_TIME, 
WAITTIME, and TRAVEL_TIME. It is the time to which 
the guarantee would be applied and is the time of interest 
in the questions below.) 

1. Is there sufficient evidence in the data to conclude that 
the average time to deliver a pizza, from the time of 
the phone call to the time of delivery, is greater than 
25 minutes? Using a level of significance of your 
choice, perform an appropriate statistical analysis and 
attach any supporting computer output. What is the 
p-value for the test? 

2. Based on the data Tony has already collected, approx- 
imately what percentage of the time will Pronto Pizza 
fail to meet its guaranteed time of 29 minutes or less? 
Explain how you arrived at your answer and discuss 
any assumptions that you are making. Does it appear 



that they will meet their requirement of failing to meet 
the guarantee 5% of the time or less? 

3. Compare the average delivery times for different days 
of the week. Does the day of the week seem to have an 
effect on the average time a customer will have to wait 
for his or her pizza? Attach any supporting output for 
your answer. 

4. Compare the average delivery times for different hours 
of the day. Does the time of day seem to have an effect 
on the average time a customer will have to wait for 
his or her pizza? Attach any supporting output for 
your answer. 

5. Based on your analysis of the data, what action (or 
actions) would you recommend to the owners of 
Pronto Pizza to improve their operation? Attach any 
supporting output that led to your conclusion. 




Gender Stereotypes and Asking For Directions 



Many of us have heard the stereotypical observation that men absolutely refuse to 
stop and ask for directions, preferring instead to wander about until they "find their 
own way." Is this just a stereotype, or is there really something to it? When Lincoln 
Mercury was carrying out research prior to the design and introduction of its in- 
vehicle navigational systems, the company surveyed men and women with regard to 
their driving characteristics and navigational habits. 

According to the survey, 61 % of the female respondents said they would stop 
and ask for directions once they figured out they were lost. On the other hand, only 
42% of the male respondents said they would stop and ask for directions under 
the same circumstances. For purposes of our discussion, we'll 



assume that Lincoln Mercury surveyed 200 per- 
sons of each gender. 

Is it possible that the lower rate of 
direction-asking for men — a sample proportion 
of just 0.42 (84 out of 200) versus 0.61 (1 22 
out of 200) — may have occurred simply by 
chance variation? Actually, if the population pro- 
portions were really the same, there would be 
only a 0.0001 probability of the direction-asking 
proportion for men being this much lower just 
by chance. Section 1 1 .6 of this chapter will 
provide some guidance regarding a statistical test 
with which you can compare these gender 
results and verify the conclusion for yourself. 
There will be no need to stop at Section 1 1 .5 to 
ask for directions. 



the 




E 
S3 



Source: Jamie LaReau, "Lincoln Uses Gender Stereotypes to Sell Navigation 
System," Automotive News, May 30, 2005, p. 32. 
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learning 
objectives 

After reading this 
chapter, you should 
be able to: 



• Select and use the appropriate hypothesis test in comparing the means of two 
independent samples. 

• Test the difference between sample means when the samples are not 
independent. 

• Test the difference between proportions for two independent samples. 

• Determine whether two independent samples could have come from populations 
having the same standard deviation. 



11.1 INTRODUCTION 



One of the most useful applications of business statistics involves comparing two 
samples to examine whether a difference between them is (1) significant or 
(2) likely to have been due to chance. This lends itself quite well to the analysis of 
data from experiments such as the following: 



example 



Comparing Samples 

A local YMCA, in the early stages of telephone solicitation to raise funds for ex- 
panding its gymnasium, is testing two appeals. Of 100 residents approached with 
appeal A, 21% pledged a donation. Of 150 presented with appeal B, 28% 
pledged a donation. 

SOLUTION 

Is B really a superior appeal, or could its advantage have been merely the result of 
chance variation from one sample to the next? Using the techniques in this chap- 
ter, we can reach a conclusion on this and similar questions. The approach will be 
very similar to that in Chapter 10, where we dealt with one sample statistic (either 
x or p), its standard error, and the level of significance at which it differed from its 
hypothesized value. 



Sections 11.2-11.4 and 11.6 deal with the comparison of two means or two 
proportions from independent samples. Independent samples are those for which 
the selection process for one is not related to that for the other. For example, in an 
experiment, independent samples occur when persons are randomly assigned to the 
experimental and control groups. In these sections, a hypothesis-testing procedure 
very similar to that in Chapter 10 will be followed. As before, either one or two crit- 
ical value(s) will be identified, and then a decision rule will be applied to see if the 
calculated value of the test statistic falls into a rejection region specified by the rule. 

When comparing two independent samples, the null and alternative hypothe- 
ses can be expressed in terms of either the population parameters and /x 2 , or tt 1 
and 7r 2 ) or the sampling distribution of the difference between the sample statistics 
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Null and Alternative 
Hypotheses Expressed 
in Terms of the 
Population Means 



Two-tail test: 



Left-tail test: 



Hy (l,¥= fl 2 

t: 

H Q . /i 1 > (L 2 



Hy /i 1 < fl 2 

Right-tail test: 

H Q . /i 1 < (L 2 

Hy /i 1 > /i 2 



Hypotheses Expressed in 
Terms of the Sampling 
Distribution of the Difference 
Between the Sample Means 



or 



or 



or 



H 0 : M(x 1 -x 2 ) = ° 



H 0- M(x,-x 2 ) a 0 



H V M (jfl - 3 y<0 



H v ^,-x 2 )>0 



TABLE 11.1 



When comparing means 
from independent samples, 
null and alternative 
hypotheses can be 
expressed in terms of the 
population parameters (on 
the left) or described by the 
mean of the sampling 
distribution of the 
difference between the 
sample statistics (on the 
right). This also applies to 
testing two sample propor- 
tions. For example, 
H 0 : tt, = tt 2 
is the same as 

H o- ^(p,-p 2 ) = °- 



(x 1 and x 2 , or p 1 and p 2 ). These approaches to describing the null and alternative 
hypotheses are equivalent and are demonstrated in Table 11.1. 

Section 11.5 is concerned with the comparison of means for two dependent 
samples. Samples are dependent when the selection process for one is related to 
the selection process for the other. A typical example of dependent samples occurs 
when we have before-and-after measures of the same individuals or objects. In 
this case we are interested in only one variable: the difference between measure- 
ments for each person or object. 

In this chapter, we present three different methods for comparing the means 
of two independent samples: the pooled-variances £-test (Section 11.2), the 
unequal-variances i-test (Section 11.3), and the z-test (Section 11.4). Figure 11.1 
(page 364) summarizes the procedure for selecting which test to use in comparing 
the sample means. 

As shown in Figure 11.1, an important factor in choosing between the pooled- 
variances f-test and the unequal-variances f-test is whether we can assume the 
population standard deviations (and, hence, the variances) might be equal. 
Section 11.7 provides a hypothesis-testing procedure by which we can actually 
test this possibility. However, for the time being, we will use a less rigorous 
standard — that is, based on the sample standard deviations, whether it appears 
that the population standard deviations might be equal. 



THE POOLED-VARIANCES t-TEST FOR COMPARING 11.2 
THE MEANS OF TWO INDEPENDENT SAMPLES 



Situations can arise where we'd like to examine whether the difference between 
the means of two independent samples is large enough to warrant rejecting the 
possibility that their population means are the same. In this type of setting, the al- 
ternative conclusion is that the difference between the sample means is small 
enough to have occurred by chance, and that the population means really could 
be equal. 
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FIGURE 11.1 



Selecting the test statistic 
for hypothesis tests 
comparing the means of 
two independent samples 



Hypothesis test 



Are the population standard 
deviations equal? 
Does o - ! = 02? 




Yes 



For any sample sizes 



Only if n l and n 2 both > 30 



Compute the pooled estimate 
of the common variance as 

2 _ («i - D^i + (w 2 - 1)4 
' p ~ ra,+n 2 -2 

and perform a pooled-variances 
Mest where 

(■*l-^)-(/*l-/*2)o 



(#■=«! + n 2 - 2 

and (p, l - fj, 2 )o is from H 0 

Test assumes samples are from 
normal populations with equal 
standard deviations. 

Section 11.2 

See Note 1 



Perform an unequal-variances 
Mest where 

(xi-X2)-Ol-/i 2 )o 



2 2 

4 4 

I- — 

"1 "2 



and 



[(4/ ni ) + (4/«2)] 2 



V 4/ Ml ) 2 (4 I '"if 

+ 

«1 — 1 n 2 —\ 

and fjtj - /i2)o is from // 0 



Test assumes samples are from 
normal populations. When either 
test can be applied to the same 
data, the unequal-variances Mest is 
preferable to the z-test — especially 
when doing the test with computer 
assistance. 

Section 11.3 

See Note 2 



A z-test approximation can be 
performed where 

(*l-^)-Oil-/i 2 )o 



Si s 2 

I- — 

"I "2 



a\ and a\ 



and - fj, 2 ) 0 is from H 0 

The central limit theorem 
prevails and there are no 
limitations on the population 
distributions. This test may also 
be more convenient for solutions 
based on pocket calculators. 

Section 11.4 

See Note 3 



1. Section 11.7 describes a procedure for testing the null hypothesis that <Tj = a 2 - 
However, when using a computer and statistical software, it may be more convenient to 
simply bypass this assumption and apply the unequal-variances f-test described in Sec- 
tion 11.3. 

2. This test involves a corrected df value that is smaller than if <T] = u 2 had been assumed. 
When using a computer and statistical software, this test can be used routinely instead of 
the other two tests shown here. The nature of the df expression makes hand calculations 
somewhat cumbersome. The normality assumption becomes less important for larger sample 
sizes. 

3. When samples sizes are large (each n a 30), the z-test is a useful alternative to the unequal 
variances Mest, and may be more convenient when hand calculations are involved. 

4. For each test, Computer Solutions within the chapter describe the Excel and Minitab procedures 
for carrying out the test. Procedures based on data files and those based on summary statistics are 
included. 



Chapter 11: Hypothesis Tests Involving Two Sample Means or Proportions 



365 



Our use of the £-test assumes that (1) the (unknown) population standard devia- 
tions are equal, and (2) the populations are at least approximately normally 
distributed. Because of the central limit theorem, the assumption of population 
normality becomes less important for larger sample sizes. Although it is often as- 
sociated only with small-sample tests, the t distribution is appropriate when the 
population standard deviations are unknown, regardless of how large or small the 
samples happen to be. 



The West used here is known as the pooled-variances Mest because it involves 
the calculation of an estimated value for the variance that both populations are as- 
sumed to share. This pooled estimate is shown in Figure 11.1 as s2. The number 
of degrees of freedom associated with the test will be df = n 1 + n 2 — 2, and the test 
statistic is calculated as follows: 



Test statistic for comparing the means of two independent samples, a 1 and a 2 as ~ 
sumed to be equal: 



with 



— + — 



where x^ and x 2 = means of samples 1 and 2 
(/ij — /jl 2 ) = ^ e hyP otnes rz e cl difference 
between the population 
means 

n 1 and n 2 = sizes of samples 1 and 2 
s, and s 2 = standard deviations of 
samples 1 and 2 
Sp = pooled estimate of the 

common standard deviation 



l)sf + («, 



n-y + n 2 — 2 



Ds 



and df = n x + n 2 — 2 



Confidence interval for — /jl 2 : 



with 



a = (1 — confidence coefficient) 



The numerator of the i-statistic includes (p, 1 — /J. 2 ) 0 , the hypothesized value 
of the difference between the population means. The hypothesized difference is 
generally zero in tests like those in this chapter. The term in the denominator of 
the f-statistic is the estimated standard error of the difference between the sample 
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means. It is comparable to the standard error of the sampling distribution for the 
sample mean discussed in Chapter 10. Also shown is the confidence interval for 
the difference between the population means. 



example 



Pooled-Variances t-Test 

Entrepreneurs developing an accounting review program for persons preparing to 
take the Certified Public Accountant (CPA) examination are considering two pos- 
sible formats for conducting the review sessions. A random sample of 10 students 
are trained using format 1, and then their number of errors is recorded for a pro- 
totype examination. Another random sample of 12 individuals are trained 
according to format 2, and their errors are similarly recorded for the same exami- 
nation. For the 10 students trained with format 1, the individual performances are 
11, 8, 8, 3, 7, 5, 9, 5, 1, and 3 errors. For the 12 students trained with format 2, 
the individual performances are 10, 11, 9, 7, 2, 11, 12, 3, 6, 7, 8, and 12 errors. 
These data are in file CX11CPA. 



SOLUTION 

Since the study was not conducted with directionality in mind, the appropriate 
test will be two-tail. The null hypothesis is H Q : = /jl 2 , and the alternative 
hypothesis is H^: /Aj + /jl 2 - The null and alternative hypotheses may also be 
expressed as follows: 

• Null hypothesis 

H Q : p^-x ) = 0 The two review formats are equally effective. 

• Alternative hypothesis 

Hf. |U^_3^) ¥= 0 The two review formats are not equally effective. 

In comparing the performances of the two groups, the 0.10 level of signifi- 
cance will be used. Based on these data, the 10 members of group 1 made an 
average of 6.000 errors, with a sample standard deviation of 3.127. The 12 stu- 
dents trained with format 2 made an average of 8.167 errors, with a standard 
deviation of 3.326. The sample standard deviations do not appear to be very dif- 
ferent, and we will assume that the population standard deviations could be 
equal. (As noted previously, this rather informal inference can be replaced by the 
separate hypothesis test in Section 11.7.) In applying the pooled-variances £-test, 
the pooled estimate of the common variance, s 2 , and the test statistic, t, can be 
calculated as 

2 (10 - 1)(3.127) 2 + (12 - 1)(3.326) 2 

si = = 10.484 

P 10 + 12-2 

and 



-1.563 
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The two training formats are equally effective. 
The two training formats are not equally effective. 



X\ and x 2 are the mean numbers of errors for group 1 and group 2. 



FIGURE 11.2 



In this two-tail pooled- 
variances f-test, we are not 
able to reject the null 
hypothesis that the two 
accounting review formats 
could be equally effective. 



-<— Reject H 0 - 
Area = 0.05 -, 



Do not reject H 0 



Reject H 0 —>■ 
r- Area = 0.05 



-1.725 



Test statistic: 
t = -1.563 



t = +1.725 



For the 0.10 level of significance, the critical values of the test statistic will be 
t = —1.725 and t = +1.725. These are based on the number of degrees of free- 
dom, df = (n 1 + n 2 — 2), or (10 + 12 — 2) = 20, and the specification that the 
two tail areas must add up to 0.10. The decision rule is to reject the null hypoth- 
esis (i.e., conclude that the population means are not equal for the two review 
formats) if the calculated test statistic is either less than t = —1.725 or greater 
than t = +1.725. 

As Figure 11.2 shows, the calculated test statistic, t = —1.563, falls into the 
nonrejection region of the test. At the 0.10 level, we must conclude that the re- 
view formats are equally effective in training individuals for the CPA examina- 
tion. For this level of significance, the observed difference between the groups' 
mean errors is judged to have been due to chance. 

Based on the sample data, we will also determine the 90% confidence interval 
for (jitj — ju 2 ). For df = 20 and a = 0.10, this is 



t al2 



4 


1 


1 




+ — 




n \ 





(6.000 - 8.167) ± 1.725 A/10.484 



J_ J_ 
10 12 



= -2.167 ± 2.392, or from -4.559 to +0.225. 

The hypothesized difference (zero) is contained within the 90% confidence 
interval, so we are 90% confident the population means could be the same. As 
discussed in Chapter 10, a nondirectional test at the a level of significance and a 
100(1 — a)% confidence interval will lead to the same conclusion. Computer 
Solutions 11.1 shows Excel and Minitab procedures for the pooled-variances 
f-test. 



computer solutions 1 1.1 
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Pooled-Variances t-Test for (/«.., - /u 2 ). Population Variances Unknown but 
Assumed Equal 

These procedures show how to use the pooled-variances f-test to compare the means of two independent 
samples. The population variances are unknown, but assumed to be equal. 



EXCEL 





D 


E 


F 


1 


t-Test: Two-Sample Assuming Equal Variances 


2 








3 




format 1 


format2 


4 


Mean 


6.000 


8.167 


5 


Variance 


9.778 


11.061 


6 


Observations 


10 


12 


7 


Pooled Variance 


10.48333 




8 


Hypothesized Mean Difference 


0 




9 


df 


20 




10 


tStat 


-1.563 




11 


P(T<=t) one-tail 


0.067 




12 


t Critical one-tail 


1.325 




13 


P(T<=t) two-tail 


0.134 




14 


t Critical two-tail 


1.725 





Excel pooled-variances f-test for - /jl 2 ), based on raw data 

1 . For the sample data (file CX11CPA.XLS) on which Figure 1 1 .2 is based, with the label and 10 data values for format 1 
in column A, and the label and 1 2 data values for format 2 in column B: Click Tools. Click Data Analysis. Click 
t-Test: Two-Sample Assuming Equal Variances. Click OK. 

2. Enter A1:A11 into the Variable 1 Range box and B1:B13 into the Variable 2 Range box. Enter 0 into the Hypoth- 
esized Mean Difference box. Click to select Labels. Specify the significance level for the test by entering 0.10 into 
the Alpha box. Select Output Range and enter D1 into the box. Click OK. The results will be as shown above. This 
is a two-tail test, so we refer to the 0. 1 34 p-value. (Excel doesn't ask whether our test is one-tail or two-tail, but it pro- 
vides critical values and p-values for both.) 

3. To obtain a confidence interval for - /x 2 ), it will be necessary to refer to the procedure described below. It is based 
on summary statistics for the samples. 

Excel Unstacking Note: When this analysis is based on raw data, Excel requires that the two samples be in two distinctly 
separate fields (e.g., two separate columns). If the data are in one column and the subscripts identifying group member- 
ship are in another column, it will be necessary to "unstack" the data. For example, if the data were in column A and 
the subscripts (1 and 2, to denote prep formats 1 and 2) were in column B, and each column had a label at the top: Click 
and drag to select the numbers and labels in columns A and B. Click Data. Click Sort. Enter format into the first sort 
by box. Select Ascending. Select My list has header row. Click OK. The data for sample 1 will now be listed directly 
above the data for sample 2 — from here, we need only select each grouping of data (e.g., scores) and either move or copy 
it to its own column. The result will be one column for the scores of group 1 and an adjacent column for the scores of 
group 2. 

Excel pooled-variances t-test and confidence interval for - fi 2 ), based on summary statistics 

1 . Using the summary statistics associated with CX11CPA.XLS: Open the Test Statistics.XLS workbook, supplied with the 
text. 

2. Using the arrows at the bottom left, select the f-Test_2 Means (Eq-Var) worksheet. 
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3. Enter the sample means, variances, and sizes into the appropriate cells. Enter the hypothesized difference (0, in this 
case) and the desired alpha level for the test. The calculated f-statistic and a two-tail p-value will be shown at the right. 

4. To obtain a confidence interval for - n 2 ), follow steps 1-3, but open the Estimators.XLS workbook and select the 
f-Estimate_2 Means (Eq-Var) worksheet. Enter the desired confidence level as a decimal fraction (e.g., 0.90). 

Note: As an alternative, you can use Excel worksheet template TMT2POOL.XLS, supplied with the text. It simultaneously 
conducts the test and reports a confidence interval. The steps are described within the template. 



MINITAB 

Minitab pooled-variances t-test and confidence interval for (ju, 1 - fi 2 ), based on raw data 

Two-Sample T-Test and CI: formatl, format2 

Two-sample T for formatl vs format2 

N Mean StDev SE Mean 
formatl 10 6.00 3.13 0.99 
format2 12 8.17 3.33 0.96 

Difference = mu (formatl) - mu (format2) 

Estimate for difference: -2.17 

90% CI for difference: (-4.56, 0.22) 

T-Test of difference = 0 (vs not =) : T-Value = -1.56 P-Value = 0.134 DF = 20 
Both use Pooled StDev = 3.2378 

1 . For example, using the data (file CX11CPA.MTW) on which Figure 1 1 .2 is based, with the data values for format 1 in 
column C1 and the data values for format 2 in column C2: Click Stat. Select Basic Statistics. Click 2-Sample t. 

2. Select Samples in different columns. Enter C1 into the First box and C2 into the Second box. Click to select As- 
sume equal variances. (Note: If all the data had been in a single column [i.e., "stacked"], it would have been nec- 
essary to select the Samples in one column option, then to specify the column containing the data and the column 
containing the subscripts identifying group membership.) 

3. Click Options. Enter the desired confidence level as a percentage (e.g., 90.0) into the Confidence Level box. Enter 
the hypothesized difference (0) into the Test difference box. Within the Alternative box, select not equal. Click OK. 
Click OK. 

Minitab pooled-variances t-test and confidence interval for (ju, 1 - fi 2 ), based on summary data 

Follow steps 1 through 3 above, but select Summarized data in step 2 and insert the appropriate summary statistics 
into the Sample size. Mean, and Standard deviation boxes for each sample. 





11.1 "When comparing two sample means, the f-test 
should be used only when the sample sizes are less than 
30." Comment. 

11.2 An educator is considering two different videotapes 
for use in a half-day session designed to introduce 
students to the basics of economics. Students have been 
randomly assigned to two groups, and they all take the 
same written examination after viewing the videotape. 
The scores are summarized here. Assuming normal popu- 
lations with equal standard deviations, does it appear 



that the two videotapes could be equally effective? What 
is the most accurate statement that could be made about 
the p-value for the test? 



Videotape 1: 
Videotape 2: 



x x = 77.1 
x 2 = 80.0 



Sj = 7.8 
s 2 = 8.1 



«j = 25 
« 2 = 25 



11.3 Using independent random samples, a researcher is 
comparing the number of hours of television viewed last 
week for high school seniors versus sophomores. The re- 
sults are shown here. Assuming normal populations with 
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equal standard deviations, does it appear that the average 
number of television hours per week could be equal for 
these two populations? What is the most accurate state- 
ment that could be made about the p-value for the test? 

Seniors: x 1 = 3.9 hours s 1 = 1.2 hours »j = 32 

Sophomores: x 2 = 3.5 hours s 2 = 1.4 hours » 2 = 30 

1 1 .4 An ambulance service located at the edge of town is 
responsible for serving a large office building in the down- 
town area. Testing different routes for getting from the 
ambulance station to the office building, a driver finds that 
five trips using route A take an average of 5.9 minutes, 
with a standard deviation of 1.4 minutes; six trips using 
route B take an average of 4.2 minutes, with a standard 
deviation of 1.8 minutes. Assuming normal populations 
with equal standard deviations, and using the 0.10 level, is 
there a significant difference between the two routes? Con- 
struct and interpret the 90% confidence interval for the 
difference between the population means. 

11.5 A maintenance supervisor is comparing the standard 
version of an instructional booklet with one that has been 
claimed to be superior. An experiment is conducted in 
which 26 technicians are divided into two groups, 
provided with one of the booklets, then given a test a 
week later. For the 13 using the standard version, the 
average exam score was 72.0, with a standard deviation 
of 9.3. For the 13 given the new version, the average 
score was 80.2, with a standard deviation of 10.1. Assum- 
ing normal populations with equal standard deviations, 
and using the 0.05 level of significance, does the new 
booklet appear to be better than the standard version? 

1 1 .6 A sample of 40 investment customers serviced by an 
account manager are found to have had an average of 
$23,000 in transactions during the past year, with a stan- 
dard deviation of $8500. A sample of 30 customers 
serviced by another account manager averaged $28,000 in 
transactions, with a standard deviation of $11,000. 
Assuming the population standard deviations are equal, 
use the 0.05 level of significance in testing whether the 
population means could be equal for customers serviced by 
the two account managers. Using the appropriate statisti- 
cal table, what is the most accurate statement we can make 
about the p-value for this test? Construct and interpret the 
95% confidence interval for the difference between the 
population means. 

11.7 Comparing dexterity-test scores of workers on the 
day shift versus those on the night shift, the production 
manager of a large electronics plant finds that a sample of 
37 workers from the day shift have an average score of 
73.1, with a standard deviation of 12.3. For 42 workers 
from the night shift, the average score was 77.3, with a 
standard deviation of 8.4. Assuming the population 
standard deviations are equal, use the 0.05 level of 
significance in comparing the average scores for the two 
shifts. Using the appropriate statistical table, what is 



the most accurate statement we can make about the 
p-value for this test? Construct and interpret the 95% 
confidence interval for the difference between the 
population means. 

1 1 .8 Sheila Smith, the manager of a large resort's main 
hotel, has been receiving complaints from some guests 
that they are not being provided with prompt service upon 
approaching the front desk. In particular, she is concerned 
that desk staff might be providing female guests with less 
prompt service than their male counterparts. In observing 
a sample of 34 male guests, she finds it takes an average of 
15.2 seconds, with a standard deviation of 5.9 seconds, 
for them to be greeted after their arrival at the front desk. 
For a sample of 39 female guests, the mean and standard 
deviation are 17.4 seconds and 6.4 seconds, respectively. 
Assuming the population standard deviations to be equal, 
use the 0.05 level of significance in examining whether the 
population mean time for serving female guests might 
actually be no greater than that for serving male guests. 
Using the appropriate statistical table, what is the most 
accurate statement we can make about the p-value for 

the test? 

11.9 Media observers have been examining the 
number of minutes devoted to business and financial 
news during the half-hour evening news broadcasts of 
two local television channels. For each channel, they 
have randomly selected 10 weekday broadcasts and 
observed the number of minutes spent on business 
and financial news during that broadcast. The times 
measured in these independent samples are shown 
here. Assuming normal populations with equal 
standard deviations, use the 0.10 level of significance 
in testing whether the population means might actually 
be the same. Using the appropriate statistical table, 
what is the most accurate statement we can make 
about the p-value for the test? Construct and interpret 
the 90% confidence interval for the difference between 
the population means. 

Channel 2: 3.8 2.7 4.9 3.4 3.7 4.5 4.2 2.8 3.5 4.6 minutes 
Channel 4: 3.6 4.0 4.5 5.2 4.8 4.3 5.7 3.5 3.7 5.8 minutes 

11.10 In a test of the effectiveness of a new battery design, 
16 battery-powered music boxes are randomly provided 
with either the old design or the new version. Hours of 
playing time before battery failure were as follows: 

8 boxes, 3.3, 6.4, 3.9, 5.4, 

new battery type: 5.1, 4.6, 4.9, 7.2 hrs 

8 boxes, 4.2, 2.9, 4.5, 4.9, 

old battery type: 5.0, 5.1, 3.2, 4.0 hrs 

Assuming normal populations with equal standard 
deviations, use the 0.05 level to determine whether the 
new battery could be better than the old design. Using the 
appropriate statistical table, what is the most accurate 
statement we can make about the p-value for this test? 



Chapter 11: Hypothesis Tests Involving Two Sample Means or Proportions 



371 



11.11 A nutritionist has noticed a FoodFarm ad stating 
the company's peanut butter contains less fat than that 
produced by a major competitor. She purchases 11 8-ounce 
jars of each brand and measures the fat content of each. 
The 11 FoodFarm jars had an average of 31.3 grams of fat, 
with a standard deviation of 2.1 grams. The 11 jars from 
the other company had an average of 33.2 grams of fat, 
with a standard deviation of 1.8 grams. Assuming normal 
populations with equal standard deviations, use the 0.05 
level of significance in examining whether FoodFarm 's ad 
claim could be valid. What is the most accurate statement 
that could be made about the p-value for this test? 

/ data set/ Note: Exercises 11.12—11.17 require a 
computer and statistical software. 

11.12 In a study by WebCriteria, it took an average of 
2.1 minutes to locate information and buy products on 
the Delta Airlines website, compared to an average of 
2.7 minutes on the British Airways site. Assume that data 
file XR11012 contains the times (in minutes) required dur- 
ing visits to the two websites. In a suitable one-tail test at 
the 0.01 level, was the mean time for the Delta visits sig- 
nificantly less than that for the British Airway visits? 
Identify and interpret the p-value for the test, source: 
Marcy E. Mullins, "Travel," USA Today, March 20, 2000, p. 1B. 

11.13 In Los Angeles, doctors carried out an experi- 
ment in which 105 sedentary people whose arteries were 
photographed were given a photograph of the inside of 
their arteries to carry around. Another 105 people for 
whom the same medical procedure was performed 
were not given a photograph to take with them. After 

6 months, the photo group had lost an average of 
17.6 pounds, while the nonphoto group had lost an 
average of 11.0 pounds. Assume that data file XR11013 
contains the weight loss for members of each group. In a 
suitable one-tail test at the 0.05 level, was the mean 
weight loss for the photo group significantly greater 
than for the nonphoto group? Identify and interpret the 



p-value for the test, source: Nanci Hellmich, "Artery Images 
Inspire Pictures of Health," USA Today, May 3, 1999, p. 10D. 

11.14 The New York Stock Exchange has reported that 
the average number of stocks directly held by senior citi- 
zens is 4.3, compared to 2.0 stocks for "Baby Boomers." 
Assume that data file XR11014 contains the number of 
stocks held by persons sampled from each age group. 
Based on the results of a two-tail test at the 0.10 level, 
comment on whether the difference between the sample 
means could have simply occurred by chance. Identify 
and interpret the p-value for the test, source: New York Stock 
Exchange, Fact Book 1998, p. 57. 

11.15 Using the sample results in Exercise 11.14, 
construct and interpret the 90% confidence interval for 
the difference between the population means. Is the 
hypothesized difference (0.00) within the interval? Given 
the presence or absence of the 0.00 value within the inter- 
val, is this consistent with the findings of the hypothesis 
test conducted in Exercise 11.14? 

11.16 An engineer has measured the hardness scores for a 
sample of conveyor-belt support bearings that have been 
hardened by two different methods. The first method is 
used by her company, and the second method is known to 
be used by a number of other companies in the industry. 
With the resulting data in file XR11016, use the 0.05 level 
of significance in comparing the mean hardness scores of 
the two samples, and comment on the possibility that the 
difference between the sample means could have occurred 
by chance. Identify and interpret the p-value for the test. 

11.17 Using the sample results in Exercise 11.16, 
construct and interpret the 95% confidence interval for 
the difference between the population means. Is the 
hypothesized difference (0.00) within the interval? Given 
the presence or absence of the 0.00 value within the inter- 
val, is this consistent with the findings of the hypothesis 
test conducted in Exercise 11.16? 



THE UNEQUAL-VARIANCES t-TEST FOR COMPARING 11.3 
THE MEANS OF TWO INDEPENDENT SAMPLES 



When the population standard deviations are unknown and are not assumed to 
be equal, pooling the sample standard deviations into a single estimate of their 
common population value is no longer applicable. As a result, s x and s 2 must be 
used to estimate their respective population standard deviations, o~ 1 and <x 2 . The 
test assumes the populations to be at least approximately normally distributed, an 
assumption that becomes less important for larger sample sizes. 

In the unequal-variances £-test, the ^-statistic expression is straightforward, but 
the df formula is a little more complex — it is a correction formula that provides a 
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df value that is smaller than its counterpart in the preceding section. For accuracy, it's 
best to maintain a lot of decimal places if you are computing off with a pocket calcu- 
lator. If we are using the computer and statistical software, the unequal-variances 
i-test presents no computational difficulties, and is the preferred method for 
comparing the means of two independent samples, regardless of the sample sizes. 
The test statistic, df, and confidence interval expressions for this test are shown here: 

Unequal-variances i-test for comparing the means of two independent samples, a 1 
and tr 2 unknown and not assumed to be equal: 




where x 1 and x 2 = means of samples 1 and 2 
(yu-j — yu. 7 ) 0 = hypothesized difference 
between the population 



means 

n 1 and n 2 = sizes of samples 1 and 2 
Sj and s 2 = standard deviations of 
samples 1 and 2 

with 

[(5?/ Wl ) + (s\ln 2 f 
f ~ (sj/n,) 2 | (s 2 2 /n 2 ) 2 
— 1 n 2 — 1 

Confidence interval for /x 1 — /j, 2 : 




with 

a = (1 — confidence coefficient) 



example .4 



Unequal-Variances t-Test 

The makers of Graphlex, a graphite additive for engine oil, have conducted an 
experimental study to determine the effectiveness of their product in improving 
the fuel efficiency of automobiles. In cooperation with the Metropolitan Cab 
Company, they've randomly divided the company's cabs into two groups of equal 
size. Graphlex was added to the engine oil of the 45 cabs in the experimental 
group, while the 45 cabs in the control group continued to operate with the usual 
lubricant. Drivers were not informed of the experiment. After 1 month, fuel effi- 
ciency records were examined. For the 45 cabs using Graphlex, the average cab 
achieved 18.94 miles per gallon (mpg), with a standard deviation of 3.90 mpg. 
For the 45 cabs not using Graphlex, the average mpg was 17.51, with a standard 
deviation of 2.87 mpg. The underlying data are in file CX11MPG. Graphlex is 
preparing a national advertising campaign to promote its ability to improve fuel 
efficiency. 
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H 0 : mpg with Graphlex is no higher than for ordinary oil. 



H 



1- ^(xi-xr) 



r._^>0 



Graphlex improves fuel economy. 



and x 2 are mpg averages for cabs with and without Graphlex. 



Do not reject H Q - 



Reject H 0 — *- 
Area = 0.05 



t = +1.664 



Test statistic: 
f = +1.98 



FIGURE 11.3 



The makers of Graphlex 
claim their graphite oil 
additive improves the fuel 
efficiency of automobiles. 
In this right-tail unequal- 
variances test at the 0.05 
level, the results indicate 
they may be correct. 



SOLUTION 

The results will be evaluated at the 0.05 level of significance. Since the purpose of 
the study was to determine whether cabs using Graphlex (group 1) get better fuel 
economy than cabs without it (group 2), the null and alternative hypotheses are 
H 0 : < /jl 2 , and H^. > fi 2 . Expressed in terms of the sampling distribution of 
the difference between sample means, the null and alternative hypotheses can be 
stated as follows: 

• Null hypothesis 

H Q : fji^- _ - j < 0 mpg with Graphlex is no higher than 
with conventional oil. 

• Alternative hypothesis 

H^: j j >0 mpg with Graphlex is higher. 

For these data, the values for the test statistic (t) and the number of degrees of 
freedom (df) are calculated as 

(x 1 - * ) - ( Ml - fi 2 ) Q (18.94 - 17.51) - 0 



»« ru V 45 45 



1.98 



and 

[(«?/«,) + (s> 7 )F [(3.90 2 /45) + (2.87 2 /45)P 

df = 2 in = l - = -. — -. — % = 80.85, rounded to 81 

1 (sj/ntf (s 2 2 /n 2 f (3.90 2 /45) 2 | (2.87 2 /45) 2 

n x -\ n 2 -\ 45 - 1 45 -1 

For the 0.05 level of significance, df = 81, and a right-tail test, the critical 
value of the test statistic is t = +1.664. The decision rule is, "Reject H 0 if the 
calculated test statistic is greater than t = +1.664, otherwise do not reject." As 
Figure 11.3 shows, the calculated test statistic exceeds the critical value, the null 
hypothesis is rejected, and we conclude that Graphlex really works. 
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As we saw in Chapter 10, different levels of significance can lead to different 
conclusions. For example, at the 0.01 level (critical t = +2.373), the null hypoth- 
esis would not have been rejected. While the Graphlex Company would likely 
stress the additive's effectiveness by relying on the 0.05 level of significance, the 
manufacturer of a competing brand would tend to prefer a more demanding test 
(e.g., a = 0.01) in order to boast that Graphlex has no effect. 

You may have noticed that the degrees of freedom value for this test, df = 81, 
was near the upper part of the df range of the t distribution table. Should you 
encounter a test in which J/happens to be over 100, just use the "infinity" row of 
the table to determine the critical value(s) of the test statistic. This row represents 
the normal distribution, toward which the t distribution converges as df becomes 
larger. 

Based on the sample data, we will also determine the 90% confidence interval 
for (/iij — ix 2 ). For df = 81 and a = 0.10, this will be 

= 1.43 ± 1.20 , or from 0.23 to 2.63. 

Computer Solutions 11.2 shows Excel and Minitab procedures for the unequal- 
variances f-test. 




Wgcpmputer solutions 1 1 . 


2 







Unequal-Variances t-Test for - fx 2 ), Population Variances Unknown 
and Not Equal 

These procedures show how to use the unequal-variances f-test to compare the means of two independent 
samples. The population variances are unknown and not assumed to be equal. 

EXCEL 





D E 


F 


1 


t-Test: Two-Sample Assuming Unequal Variances 


2 








3 




Expgrp 


Ctrlgrp 


4 


Mean 


18.94 


17.51 


5 


Variance 


15.22 


8.24 


6 


Observations 


45 


45 


7 


Hypothesized Mean Difference 


0 




8 


df 


81 




9 


tStat 


1.98 




10 


P(T<=t) one-tail 


0.026 




11 


t Critical one-tail 


1.664 




12 


P(T<=t) two-tail 


0.051 




13 


t Critical two-tail 


1.990 
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Excel unequal variances t-test for — fi 2 ), based on raw data 

1. For the sample data (file CX11MPG.XLS) on which Figure 11.3 is based, with the label and 45 mpg values for the 
Graphlex cabs in column A, and the label and 45 data values for the standard cabs in column B: Click Tools. Click Data 
Analysis. Click t-Test: Two-Sample Assuming Unequal Variances. Click OK. 

2. Enter A1:A46 into the Variable 1 Range box and B1:B46 into the Variable 2 Range box. Enter 0 into the Hypoth- 
esized Mean Difference box. Click to select Labels. Specify the significance level by entering 0.05 into the Alpha 
box. Select Output Range and enter D1 into the box. Click OK. This is a one-tail test, so we refer to the 0.026 
p-value. 

3. To obtain a confidence interval for (fj, ] - /jl 2 ), it will be necessary to refer to the procedure described next. It is based on 
summary statistics for the samples. 

Note: For an analysis based on raw data, Excel requires the two samples to be in two distinctly separate fields (e.g., two 
separate columns). If the data are in one column and the subscripts identifying group membership are in another column, 
see the Excel Unstacking Note in Computer Solutions 11.1. 

Excel unequal-variances t-test and confidence interval for — fi 2 ), based on summary statistics 

1. Using the summary statistics associated with CX11MPG.XLS (the test is summarized in Figure 11.3): Open the TEST 
STATISTICS.XLS workbook, supplied with the text. 

2. Using the arrows at the bottom left, select the f-Test_2 Means (Uneq-Var) worksheet. 

3. Enter the sample means, variances, and sizes into the appropriate cells. Enter the hypothesized difference (0, in this 
case) and the desired alpha level for the test. The calculated f-statistic and a one-tail p-value will be shown at the 
right. 

4. To get a confidence interval for - fi 2 ), follow steps 1-3, above, but open the ESTlMATORS.XLS workbook and 
select the f-Estimate_2 Means (Uneq-Var) worksheet. Enter the desired confidence level as a decimal fraction 
(e.g., 0.90). 

Note: As an alternative, you can use Excel worksheet template TMT2UNEQ.XLS, supplied with the text. It simultaneously 
conducts the test and reports a confidence interval. The steps are described within the template. 

MINI TAB 

Minitab unequal-variances t-test and confidence interval for - fi 2 ), based on raw data 

Two-Sample T-Test and CI: Expgrp, Ctrlgrp 

Two-sample T for Expgrp vs Ctrlgrp 

N Mean StDev SE Mean 
Expgrp 45 18.94 3.90 0.58 
Ctrlgrp 45 17.51 2.87 0.43 

Difference = mu (Expgrp) - mu (Ctrlgrp) 

Estimate for difference: 1.429 

90% lower bound for difference: 0.496 

T-Test of difference = 0 (vs >) : T-Value = 1.98 P-Value = 0.026 DF = 80 

1 . For example, using the data (file CX11MPG.MTW) on which Figure 1 1 .3 is based, with the data values for the Graphlex 
cabs in column C1 and the data values for the standard cabs in column C2: Click Stat. Select Basic Statistics. Click 
2-Sample t. 

2. Select Samples in different columns. Enter C1 into the First box and C2 into the Second box. Do NOT select the 
"Assume equal variances" option. (Note: If all the data had been in a single column [i.e., "stacked"], it would have 
been necessary to select the Samples in one column option, then to specify the column containing the data and the 
column containing the subscripts identifying group membership.) 
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3. Click Options. Enter the desired confidence level as a percentage (e.g., 90.0) into the Confidence Level box. Enter 
the hypothesized difference (0) into the Test difference box. Within the Alternative box, select greater than. Click 
OK. Click OK. 

Minitab unequal-variances t-test and confidence interval for (/u 1 - /n 2 ), based on summary data 

Follow steps 1 through 3 above, but select Summarized data in step 2 and insert the appropriate summary statistics into 
the Sample size. Mean, and Standard deviation boxes for each sample. 




11.18 In two independent samples from populations that 
are normally distributed, x x = 35.0, s 1 = 5.8, n 1 = 12 
and x 2 = 42.5, s 2 = 9.3, « 2 = 14. Using the 0.05 level of 
significance, test H Q : = /x, versus Hj: ¥= fi 2 . 



11.19 In two independent samples, x 1 = 165.0, s 1 = 21.5, 
n x = 40 and x 2 = 172.9, s 2 = 31.3, » 2 = 32. Using the 
0.10 level of significance, test H Q : > /x 2 versus Hy 

Mi < 

11.20 In two independent samples, x 1 = 125.0, s 1 = 
21.5, n x = 40, andx 2 = 116.4, s 2 = 10.8, n 2 = 35. 
Using the 0.025 level of significance, test H 0 : < /jl 2 
versus Hy > /jl 2 . 

11.21 Fred and Martina, senior agents at an airline secu- 
rity checkpoint, carry out advanced screening procedures 
for hundreds of randomly selected passengers per day. 
For a random sample of 30 passengers recently processed 
by Fred, the mean processing time was 124.5 seconds, 
with a standard deviation of 20.4 seconds. For a random 
sample of 36 passengers recently processed by Martina, 
the corresponding mean and standard deviation were 
133.0 seconds and 38.7 seconds, respectively. Using the 
0.05 level of significance, can we conclude that the popu- 
lation mean processing times for Fred and Martina could 
be the same? Using the appropriate statistical table, what 
is the most accurate statement we can make about the 
p-value for the test? Construct and interpret the 95% 
confidence interval for the difference between the popula- 
tion means. 

11.22 A tire company is considering switching to a new 
type of adhesive designed to improve tire reliability in 
high-temperature and overload conditions. In laboratory 
"torture" tests with temperatures and loads 90% higher 
than the maximum normally encountered in the field, 

15 tires constructed with the new adhesive run an 



average of 65 miles before failure, with a standard 
deviation of 14 miles. For 18 tires constructed with the 
conventional adhesive, the mean mileage before failure 
was 53 miles, with a standard deviation of 22 miles. 
Assuming normal populations and using the 0.05 level 
of significance, can we conclude that the new adhesive is 
superior to the old under such test conditions? What is 
the most accurate statement that could be made about 
the p-value for this test? 

1 1 .23 The credit manager for Braxton's Department 
Store, in examining the accounts for various types of cus- 
tomers served by the establishment, has noticed that the 
mean outstanding balance for a sample of 20 customers 
from the local zip code is $375, with a standard deviation 
of $75. For a sample of 25 customers from a nearby zip 
code, the mean and standard deviation were $425 and 
$143, respectively. Assuming normal populations and 
using the 0.05 level of significance, examine whether 
credit customers from the two zip codes might have the 
same mean outstanding balance. What is the most accu- 
rate statement that could be made about the p-value for 
this test? Construct and interpret the 95% confidence in- 
terval for the difference between the population means. Is 
the hypothesized difference (0.00) within the interval? 
Given the presence or absence of the 0.00 value within 
the interval, is this consistent with the findings of the 
hypothesis test? 

11.24 Safety engineers at a manufacturing plant are eval- 
uating two brands of 50-ampere electrical fuses for possi- 
ble purchase and use in the plant. One of the performance 
characteristics they are considering is how long the fuse 
will carry 50 amperes before it blows. In a sample of 

35 fuses from the Shockley Fuse Company, the mean 
time was found to be 240 milliseconds, with a standard 
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deviation of 50 milliseconds. In comparable tests of a 
sample of 30 fuses from the Fusemaster Corporation, 
the mean time was 221 milliseconds, with a standard 
deviation of 28 milliseconds. Using the 0.10 level of 
significance, examine whether the population mean times 
for fuses from the two companies might be the same. 
What is the most accurate statement that could be made 
about the p-value for this test? Construct and interpret 
the 90% confidence interval for the difference between 
the population means. Is the hypothesized difference 
(0.00) within the interval? Given the presence or absence 
of the 0.00 value within the interval, is this consistent 
with the findings of the hypothesis test? 

11.25 According to a national Gallup poll, men visit the 
doctor's office an average of 3.8 times per year, while 
women visit an average of 5.8 times per year. In a similar 
poll conducted in a Midwest county, a sample of 50 men 
visited the doctor an average of 2.2 times in the past year, 
with a standard deviation of 0.6 visits. For a sample of 
40 women from the same county, the mean and standard 
deviation were 3.9 and 0.9, respectively. In a two-tail test 
at the 0.05 level, test whether the observed difference 
between and x 2 is significantly different from the 

(3.8 — 5.8)= —2.0 visits per year that was found for the 
nation as a whole. Construct and interpret the 95% con- 
fidence interval for (n 1 — /x 2 ) for the county. Is the 
hypothesized difference (—2.0) within the interval? Given 
the presence or absence of the —2.0 value within the 
interval, is this consistent with the findings of the hypoth- 
esis test? source: Cindy Hall and Marcy E. Mullins, "Doctors See 
Women More Often," USA Today, May 2, 2000, p. 7D. 

11.26 One of the measures of the effectiveness of a 
stimulus is how much the viewer's pulse rate increases on 
exposure to it. In testing a lively new music theme for its 
television commercials, an advertising agency shows ads 
with the new music to a sample of 25 viewers. Their 
mean pulse rate increase is 20.5 beats per minute, with a 
standard deviation of 7.4. For a comparable sample of 
25 viewers seeing the same ads with the previous music 
theme, the mean pulse rate increase is 16.4 beats per 
minute, with a standard deviation of 4.9. Assuming nor- 
mal populations and using the 0.025 level of significance, 
can we conclude that the new music theme is better than 
the old in terms of increasing the pulse rate of viewers? 
What is the most accurate statement that could be made 
about the p-value for this test? 

/ data set/ Note: Exercises 11.27-11.30 require a 
computer and statistical software. 

11.27 According to a Yankelovich poll, women spend 
an average of 19.1 hours shopping during the month of 



December, compared to 12.7 hours for men. Assuming 
that file XR11027 contains the survey data underlying 
these results, use the 0.01 level in examining whether the 
sample mean for women is significantly higher than that 
for men. Identify and interpret the p-value for the test. 
source: Anne R. Carey and Gary Visgaitis, "Tis the Season, Health," USA 
Today 1999 Snapshot Calendar, December 8. 

11.28 When companies are designing a new product, one 
of the steps typically taken is to see how potential buyers 
react to a picture or prototype of the proposed product. 
The product-development team for a notebook computer 
company has shown picture A to a large sample of poten- 
tial buyers and picture B to another, asking each person 
to indicate what they "would expect to pay" for such a 
product. The data resulting from the two pictures are 
provided in file XR11028. Using the 0.05 level of 
significance, determine whether the prototypes might not 
really differ in terms of the price that potential buyers 
would expect to pay. Identify and interpret the p-value 
for the test. Construct and interpret the 95% confidence 
interval for the difference between the population means. 
Is the hypothesized difference (0.00) within the interval? 
Given the presence or absence of the 0.00 value within 
the interval, is this consistent with the findings of the 
hypothesis test? 

11.29 It has been reported that the average visitor from 
Japan spent $1953 during a trip to the United States, 
while the average for a visitor from the United Kingdom 
was $1783. Assuming that file XR11029 contains the 
survey data underlying these results, use the 0.05 level 
in examining whether the sample mean for Japanese 
visitors is significantly higher than that for visitors 
from the United Kingdom. Identify and interpret the 
p-value for the test, source: The World Almanac and Book of Facts 
2003, p. 752. 

11.30 It has been reported that males who are 55 or 
older spend an average of 43 minutes on-line, versus 38 
minutes on-line daily for women in this age group. 
Assuming that file XR11030 contains the survey data un- 
derlying these results, use the 0.01 level of significance in 
examining whether the population means for daily on- 
line minutes might really be the same for the two genders 
in this age group. Identify and interpret the p-value for 
the test. Construct and interpret the 99% confidence 
interval for the difference between the population means. 
Is the hypothesized difference (0.00) within the interval? 
Given the presence or absence of the 0.00 value within 
the interval, is this consistent with the findings of the 
hypothesis test? source: Cindy Hall and Marcy E. Mullins, "Older 
Adults Are TV Viewers," USA Today, September 18, 2000, p. 1D. 
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1 1 .4 THE z-TEST FOR COMPARING THE MEANS OF 

TWO INDEPENDENT SAMPLES 



The z-test approximation is included in Figure 11.1 and presented here as an 
alternative to the unequal-variances i-test whenever both n 1 and n 2 are — 30. Be- 
sides requiring no assumptions about the shape of the population distributions, it 
offers the advantages of slightly greater simplicity and avoidance of the cumber- 
some df correction formula used in the unequal-variances £-test; thus, it can be 
useful to those who are not relying upon a computer and statistical software. This 
test has been popular for many years as a method for comparing the means of two 
large, independent samples when a 1 and a 2 are unknown, and of two indepen- 
dent samples of any size when <j 1 and cr 2 are known and the two populations are 
normally distributed. Like the unequal-variances Mest, the z-test approximation 
does not assume the population standard deviations are equal, and s 1 and s 2 are 
used to estimate their respective population standard deviations, cr 1 and cr 1 . 

Z-test approximation for comparing the means of two independent samples, a l 
and o" 2 unknown, and each n > 30: 

~i ) — ^2)0 w here x 1 and x 2 = means of samples 1 and 2 
Sl . sj (£4 — /jl 2 ) 0 = hypothesized difference 

n x n 2 between the population 

means 

n 1 and n 2 = sizes of samples 1 and 2 
and s 2 = standard deviations of 



samples 1 and 2 



Confidence interval for — /j, 2 : 



IS 2 ! s\ 

^ /2 \ ni + n 2 
with 

a = (1 — confidence coefficient) 





■1 "» 




Example 





z-Test 



A university's placement center has collected data comparing the starting salaries 
of graduating students with surnames beginning with the letters A through M 
with those whose surnames begin with N through Z. For a sample of 30 students 
in the A-M category, the average starting salary was $37,233.33, with a standard 
deviation of $3475.54. For a sample of 36 students with surnames beginning with 
N-Z, the average starting salary was $35,855.81, with a standard deviation of 
$2580.02. The underlying data are in file CX11GRAD. 

SOLUTION 

For this study, the null hypothesis is that there is no difference between the popu- 
lation means, or H 0 : /jl^ = fi 2 . Because the intent of the test is nondirectional, the 
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Starting salaries are the same for the two groups. 
Starting salaries are not the same. 



0 : ^tti-x 2 )- u 

X\ and x 2 are the mean starting salaries for the A-M and N-Z samples. 




-*— Reject H 0 
Area = 0.01 



-2.33 



Reject Hq — *- 
Area = 0.01 



: +2.33 



Test statistic: 
z = +1.80 



FIGURE 11.4 



This is an application of the 
z-test in comparing two 
sample means. From the 
results, we are not able to 
reject the possibility that 
graduates with surnames 
beginning with A-M receive 
the same starting salaries as 
graduates whose names 
begin with N-Z. 



null hypothesis can be rejected by an extreme difference in either direction, and 
the alternative hypothesis is Hji ¥= n 2 - F° r testing the null hypothesis, we'll use 
the 0.02 level of significance. As described in Table 11.1, the null and alternative 
hypotheses can also be stated as follows: 

• Null hypothesis 

H 0 : fife _- j = 0 The starting salaries are the same 
for both populations. 

• Alternative hypothesis 

H t : ft-^-x ) ^ 0 The starting salaries are not the same. 

For these data, the calculated value of the test statistic, z, can be computed as 

(x t ~ x ) - 0 (37,233.33 - 35,855.81) - 0 , on 
z — 1 = - — — = 1.80 

Ws| sf W 3475.54 2 2580.02 2 

V^ + «2 V 30 36 

For the 0.02 level of significance, the critical values will be z = —2.33 and 
z = +2.33. The decision rule will be to reject the null hypothesis of equal popu- 
lation means if the calculated z is either less than —2.33 or greater than +2.33, as 
shown in Figure 11.4. Because the calculated test statistic, z = 1.80, falls into the 
nonrejection region of the diagram, the null hypothesis cannot be rejected at the 
0.02 level of significance. From this analysis, we cannot conclude that people with 
surnames in the first part of the alphabet receive different starting salaries than 
persons whose names are in the latter portion. 



The approximate p-value for this test can be determined by finding the area to 
the right of the calculated test statistic, z = 1.80, then (because it is a nondirectional 
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test) multiplying this quantity by 2. Referring to the normal distribution table, we 
find the area from the midpoint to z = 1.80 is 0.4641. Subtracting 0.4641 from 
0.5000, the right-tail area is 0.0359. The approximate p-value for this two-tail test 
will be 2(0.0359), or 0.0718. 

Based on the sample data, we will also determine the 98% confidence interval 
for (fjL 1 — /jl 2 ). This corresponds to a = 0.02 and, to the best accuracy possible 
using the normal table with z = 2.33, the interval will be 



= (37,233.33 - 35,855.81) ± 2.33\/^! + 258^ 

V 30 36 

= 1377.52 ± 1785.99 or from -408.47 to +3163.51 

The hypothesized difference (zero) is contained within the 98% confidence 
interval, so we are 98% confident the population means could be the same. As we 
discussed in Chapter 10, a nondirectional test at the a level of significance and a 
100(1 — a)% confidence interval will lead to the same conclusion. Computer 
Solutions 11.3 describes Excel procedures for the z-test and confidence interval 
for the difference between population means. The computer-generated confidence 
interval differs very slightly from ours because our z-value (2.33) contained only 
the standard two decimal places from the normal distribution table. 

Seeing Statistics Applet 14, at the end of the chapter, allows you to visually 
examine the sampling distribution of the difference between sample means and 
how it responds to the changes you select. 




The z-Test for - fi 2 ) 

These procedures show how to use the z-test to compare the means of two independent samples. Each sam- 
ple size should be >30. Excel offers the z-test; Minitab does not. 



EXCEL 





D 


E 


F 


1 


z-Test: Two Sample for Means 






2 








3 




GroupAM 


GroupNZ 


4 


Mean 


37233.33 


35855.81 


5 


Known Variance 


12079378.29 


6656503.20 


6 


Observations 


30 


36 


7 


Hypothesized Mean Difference 


0 




8 


z 


1.797 




9 


P(Z<=z) one-tail 


0.036 




10 


z Critical one-tail 


2.054 




11 


P(Z<=z) two-tail 


0.072 




12 


z Critical two-tail 


2.326 
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Excel z-test for - /m 2 ), based on raw data 

1 . For the sample data (file CX11GRAD.XLS) for Figure 1 1 .4, with the label and 30 salary values for the A-M group in col- 
umn A, and the label and 36 salary values for the N-Z group in column B: Click Tools. Click Data Analysis. Click 
z-Test: Two Sample for Means. Click OK. 

2. Enter A1:A31 into the Variable 1 Range box and B1:B37 into the Variable 2 Range box. Enter 0 into the Hypothe- 
sized Mean Difference box. Click to select Labels. Specify the significance level by entering 0.02 into the Alpha box. 

3. Excel does this test assuming the population variances are known. To estimate these using the sample variances, first 
compute the variances of the two samples: Select any available cell and enter =VAR(A2:A31) and hit return. Select 
another available cell and enter =VAR(B2:B37) and hit return. Enter the resulting variances into the Variable 1 
Variance (known) and Variable 2 Variance (known) boxes, respectively. 

4. Select Output Range and enter D1 into the box. Click OK. This is a two-tail test, so we refer to the 0.072 p-value. 

5. To get a confidence interval for - fj, 2 ), it will be necessary to refer to the Excel worksheet template and procedure 
described below. It is based on summary statistics for the samples. 

Note: For an analysis based on raw data, Excel requires the two samples to be in two distinctly separate fields (e.g., two 
separate columns). If the data are in one column and the subscripts identifying group membership are in another 
column, see the Excel Unstacking Note in Computer Solutions 11.1. 

Excel z-test and confidence interval for (/*., - fi 2 ), based on summary data 





A 


B 


c 




E 


1 


2-Sample Z-Test Comparing Means (based on summary statistics) 




2 












3 


Hypothesized Difference (mu1 - mu2) 


0.000 








4 












5 


Summary of Sample Data: 




Calculated Values: 




6 


Mean for Sample 1 , xbarl 


37233.33 


Std. Error = 


766.517 




7 


Std. Dev. For Sample 1 , s1 


3475.54 


z = 


1.797 




8 


Size of Sample 1, n1 


30 








9 


Mean for Sample 2, xbar2 


35855.81 


p-value If the Test Is: 




10 


Std. Dev. For Sample 2, s2 


2580.02 


Left-Tail 


Two-Tail 


Right-Tail 


11 


Size of Sample 2, n2 


36 


0.964 


0.072 


0.036 


12 












13 


Confidence level desired 


0.98 








14 


Lower confidence limit 


-405.7 








15 


Upper confidence limit 


3160.7 









1 . Open Excel worksheet TMZ2TEST.XLS, supplied with the text. Enter the hypothesized difference between the popula- 
tion means (0) into cell B3. Enter the summary statistics for the example on which Figure 11.4 is based into cells 
B6:B1 1 . Enter the desired confidence level as a decimal fraction (e.g., 0.98) into cell B13. 

2. Refer to the p-value that corresponds to the type of test being conducted. The test in Figure 1 1 .4 is a two-tail test, so 
the p-value is in cell D1 1 , or 0.072. 






11.31 Under what conditions is it appropriate to use the 
z-test as an approximation to the unequal-variances f-test 
when comparing two sample means? 

11.32 For the following independent random samples, 
use the z-test and the 0.01 level of significance in testing 
H Q : /Xj = /jl 2 versus H^: ^ /jl 2 - 

x 1 = 33.5 s 1 = 6.4 «j = 31 
x, = 27.6 s-, = 2.7 = 30 



11.33 For the following independent random samples, 
use the z-test and the 0.05 level of significance in testing 
H 0 : fi 1 < ju, 2 versus H^. fi 1 > n 2 - 



85.2 
81.7 



9.6 
4.1 



40 

32 



11.34 Repeat Exercise 11.20 using the z-test approxima- 
tion to the unequal-variances f-test. 
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11.35 Repeat Exercise 11.21 using the z-test approxima- 
tion to the unequal-variances t-test. 

11.36 According to the U.S. Bureau of Labor Statistics, 
personal expenditures for entertainment fees and admis- 
sions averaged $349 per person in the Northeast and 
$420 in the West. Assuming that these data involved 
(1) sample sizes of 800 and 600 and (2) standard 
deviations of $215 and $325, respectively, use a z-test 
and the 0.01 level of significance in testing the difference 
between these means. Determine and interpret the p-value 
for the test. Construct and interpret the 99% confidence 
interval for the test, source: U.S. Bureau of Labor Statistics, Con- 
sumer Expenditure Survey, Interview Survey, annual. 

11.37 Surveys of dog owners in two adjacent towns 
found the average veterinary-related expense total for 
dog owners in town A during the preceding year was 
$240, with a standard deviation of $55, while the com- 
parable results in nearby town B were $225 and $32, 
respectively. Assuming a sample size of 30 dog owners 
surveyed in each town, use a z-test and the 0.05 level of 
significance in determining whether the population 
means might actually be the same. Determine and inter- 
pret the p-value for the test; then construct and interpret 
the 95% confidence interval for the difference between 
the population means. 

11.38 A study of 102 patients who had digestive-tract 
surgery found that those who chewed a stick of gum for 
15 minutes at mealtimes ended up being discharged after 
an average of 4.4 days in the hospital, compared to an 
average of 5.2 days for their counterparts who were not 
provided with gum. Assuming (1) a sample size of 51 for 
each group and (2) sample standard deviations of 1.3 
and 1.9 days, respectively, use a z-test and the 0.01 level 
of significance in examining whether the mean hospital 
stay for the gum-chewing patients could have been this 
much lower simply by chance. Determine and interpret 
the p-value for the test, source: Jennifer Bails, "Chewing Gum 
Might Help Digestive-Tract Patients Get Home Sooner," Pittsburgh 
Tribune-Review, October 1 5, 2005, p. A4. 

11.39 A study found that women of normal weight missed 
an average of 3.4 days of work due to illness during the pre- 
ceding year, compared to an average of 5.2 days for women 
who were considered overweight. Assuming (1) a sample 
size of 30 for each group and (2) sample standard deviations 
of 2.5 and 3.6 days, respectively, use a z-test and the 0.05 
level of significance in examining whether the mean number 
of absence days for the overweight group could have been 
this much higher simply by chance. Determine and interpret 
the p-value for the test, source: Nanci Hellmich, "Heavy Workers, 
Hefty Price," USA Today, September 9, 2005, p. 7D. 

1 1 .40 According to a psychologist at the Medical 
College of Virginia, soccer players who "head" the ball 
10 or more times a game are at risk for brain damage 



that could lower their intellectual abilities. In tests 
involving young male soccer players, those who typically 
headed the ball 10 or more times a game recorded a 
mean IQ score of 103, compared to a mean of 112 for 
those who usually headed the ball once or less a game. 
Assuming 30 players in each sample, with sample 
standard deviations of 17.4 and 14.5 IQ points for the 
"headers" and "nonheaders," respectively, use an appro- 
priate one-tail z-test and the 0.01 level of significance in 
reaching a conclusion as to whether frequent "heading" 
of the ball by soccer players might lower intellectual per- 
formance. Determine and interpret the p-value for the test. 
source: Marilyn Elias, "Heading Soccer Ball Can Lower IQ," USA Today, 
August 14, 1995, p. 1D. 

/ data Set /Note: Exercises 11.41-11.43 require a 
computer and statistical software. 

11.41 In examining vendors' service and technical sup- 
port for desktop personal computers, PC Magazine re- 
ported an average of 16.5 minutes on hold when calling 
Gateway compared to 14.1 minutes for calls to IBM. 
Assume that data file XR11041 contains the data for 
times on hold for calls to the two companies. Use a 
one-tail test and the 0.05 level of significance in 
concluding whether Gateway's mean technical support 
hold time could be greater than IBM's. Determine and 
interpret the p-value for the test, source: Rachel Derby Teitler 
and Mary E. Behr, "Readers Rate Service and Reliability," PC Magazine, 
July 1996, p. 222. 

11.42 In planning the processes to be incorporated into 
a new manufacturing facility, engineers have proposed 
two possible assembly procedures for one of the phases 
in the production sequence. Sixty of the eventual produc- 
tion line workers have taken part in preliminary tests, 
with data representing productivity (units produced in 

1 hour) for 30 workers using procedure A and 30 using 
procedure B. Given the data in file XR11042, use the 0.10 
level of significance in determining whether the two pro- 
cedures might be equally efficient. Identify and interpret 
the p-value for the test, then construct and interpret the 
90% confidence interval for the difference between the 
population means. 

11.43 A study has been conducted to examine the effec- 
tiveness of a new experimental program for preparing 
high school students for the Scholastic Aptitude Test. 
Eighty students have been randomly divided into two 
groups of 40. The eventual SAT scores for those 
exposed to the new program and the conventional 
program are listed in data file XR11043. Use a one-tail 
test and the 0.025 level of significance in concluding 
whether the experimental program might be better 
than the conventional method in preparing students 

for the SAT. Determine and interpret the p-value for 
the test. 
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COMPARING TWO MEANS WHEN THE 
SAMPLES ARE DEPENDENT 



11.5 



In previous comparisons of two sample means, the samples were independent 
from each other. That is, the selection process for one sample was not related to 
the selection process for the other. However, there may be times when we wish to 
test hypotheses involving samples that are not independent. For example, we may 
wish to examine the before-and-after productivity of individual employees after a 
change in their workstation layout, or compare the before-and-after reading 
speeds of individual participants in a speed-reading course. In such cases, we do 
not really have two different samples of persons, but rather before and after mea- 
surements for the same individuals. As a result, there will be just one variable: the 
difference recorded for each individual. 

Tests in which the samples are not independent are also referred to as paired 
observations, or matched pairs, and they are essentially the same as those discussed 
in Chapter 10 for the mean of a single sample. The variable under consideration in 
this case is d = (x 1 — x 2 ), where x 1 and x 2 are the before and after measurements, 
respectively. As in the tests for one sample mean, the null and alternative hypothe- 
ses will be one of the following, with the test statistic calculated as shown here: 



Null Hypothesis 



Alternative Hypothesis 



Type of Test 



H, 



Two-tail 
Left-tail 
Right-tail 



Test statistic for comparing the means for paired observations: 
d 



where d 

d 
n 



Confidence interval for /jlj. 



df 



for each individual or test unit, (x^ — x 2 ), 
the difference between the two measurements 
the average difference, = ILdJn 
number of pairs of observations 



the standard deviation of d, or 
n - 1 



Sff - nd 1 
n — 1 



'■all 



Vn 





example 









Dependent Samples 



Exploring ways to increase office productivity, a company vice president has 
ordered 12 ergonomic keyboards for distribution to a sample of secretarial 
employees. If the keyboards substantially increase productivity, she plans to 
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TABLE 11.2 



For dependent samples, 
only one variable is tested: 
the difference between 
measurements. For each of 
12 individuals, the typing 
speed for generating a 
standard document is 
shown before and after 
learning to use an 
ergonomic keyboard. 



x„ Words/Minute 



x,, Words/Minute 



Person with Old Keyboard with New Keyboard 



Difference 
ef = (x 1 - x 2 ) 



d 2 



1 


25.5 


43.6 


-18.1 


327.61 


2 


59.2 


69.9 


-10.7 


114.49 


3 


38.4 


39.8 


-1.4 


1.96 


4 


66.8 


73.4 


-6.6 


43.56 


c 
_> 


A A Q 

44. y 


jU.Z 


C O 
— D.D 


"i o no. 
zo.uy 


6 


47.4 


53.9 


-6.5 


42.25 


7 


41.6 


40.3 


1.3 


1.69 


8 


48.9 


58.0 


-9.1 


82.81 


9 


60.7 


66.9 


-6.2 


38.44 


10 


41.0 


66.5 


-25.5 


650.25 


11 


36.1 


27.4 


8.7 


75.69 


12 


34.4 


33.7 


0.7 


0.49 








-78.7 


1407.33 








= 2d 


= 2d 2 



replace all of the firm's current keyboards with the new models. Prior to delivery 
of the keyboards, each of the 12 sample members types a standard document on 
his or her old keyboard, and the number of words per minute is measured. After 
receiving the new keyboards and spending a few weeks becoming familiar with 
their operation, each employee then types the same document using the ergonomic 
model. Table 11.2 shows the number of words per minute each of the 12 persons 
typed in each test. The data are also in file CX11TYPE. 

SOLUTION 

Because the vice president doesn't want to replace the current stock of keyboards 
unless the ergonomic model is clearly superior, the burden of proof is on the new 
model, and a one-tail test is appropriate. The 0.025 level will be used to examine 
whether the new keyboard has significantly increased typing speeds. For each per- 
son in the sample, the difference in typing speed between the first and second 
measurements is d = (x 1 — x 2 ) words per minute. The null and alternative 
hypotheses will be as follows: 

• Null hypothesis 

H 0 : jx d > 0 Typing with the ergonomic keyboard is no faster than with 
the current keyboard. 

• Alternative hypothesis 

iZji < 0 The ergonomic keyboard is faster. 

The sample mean and standard deviation for d are calculated as in Chapter 3, 
and can be expressed as 

7 -18.1 - 10.7 - 1.4 - 6.6 - 5.3 - 6.5 + 1.3 - 9.1 - 6.2 - 25.5 + 8.7 + 0.7 

d= 12 

= -6.558 



1 407.33 - 12(-6.558) 2 = ? QQ1 
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H 0 : ftj^O The ergonomic keyboard is no faster than the current keyboard. 
H l : fi d < 0 The ergonomic keyboard increases typing speed. 
For each person, d = x l -x 2 ; x l = words/minute with current keyboard. 
x 2 = words/minute with ergonomic keyboard. 



FIGURE 11.5 



A summary of the 
hypothesis test for the 
paired observations in 
Table 1 1 .2. At the 0.025 
level in a one-tail test, we 
conclude that the 
ergonomic keyboard 
increases typing speeds. 




and the test statistic is calculated as 

d -6.558 



s d /Vn 9.001/VT2 



-2.524 



The number of degrees of freedom for the test is df = (n — 1), or (12 — 1) = 11. 
For the 0.025 level of significance in a left-tail test, the critical value for the test 
statistic will bet = —2.201. This is obtained by referring to the a = 0.025 column 
and df = 11 row of the table. The decision rule is, "Reject the null hypothesis if 
the calculated test statistic is less than t = —2.201, otherwise do not reject." 

As Figure 11.5 shows, the calculated test statistic is less than the critical value 
and falls into the rejection region for the test. As a result, the null hypothesis is 
rejected, and we conclude that the ergonomic keyboard does increase typing 
speeds. Following through with the intent of her test, the vice president should 
order them for all secretarial personnel. 

Based on the sample data, we will also determine the 95% confidence interval 
for This corresponds to a = 0.05. With df = 12 — 1 = 11 and t = 2.201, the 
interval will be 

= -6.558 ± 5.719, or from -12.277 to -0.839 



Computer Solutions 11.4 describes Excel and Minitab procedures for the 
i-test and confidence interval when comparing the means of dependent samples. 
In a one-tail test, Minitab will provide either an upper or a lower confidence limit 
for depending on the directionality of the test. 
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computer solutions 11.4 













Comparing the Means of Dependent Samples 

These procedures show how to use a t-test to compare sample means when the samples are not 
independent. 



EXCEL 





D 


E 


F 


1 


t-Test: Paired Two Sample for Means 




2 








3 




Old 


New 


4 


Mean 


45.41 


51.97 


5 


Variance 


144.12 


231.80 


6 


Observations 


12 


12 


7 


Pearson Correlation 


0.807 




8 


Hypothesized Mean Difference 


0 




9 


df 


11 




10 


tStat 


-2.524 




11 


P(T<=t) one-tail 


0.014 




12 


t Critical one-tail 


2.201 




13 


P(T<=t) two-tail 


0.028 




14 


t Critical two-tail 


2.593 





Excel t-test for comparing the means of dependent samples, based on raw data 

1 . For the sample data (file CX11TYPE.XLS) on which Figure 1 1 .5 is based, with the label and 12 "old keyboard" data val- 
ues in column A, and the label and 12 "new keyboard" data values in column B: Click Tools. Click Data Analysis. 
Click t-Test: Paired Two-Sample For Means. Click OK. 

2. Enter A1:A13 into the Variable 1 Range box and B1:B13 into the Variable 2 Range box. Enter 0 into the Hypoth- 
esized Mean Difference box. Click to select Labels. Specify the significance level by entering 0.025 into the Alpha 
box. Select Output Range and enter D1 into the box. Click OK. This is a one-tail test, so we refer to the 0.014 
p-value. 

3. To obtain a confidence interval for - /x 2 ), it will be necessary to refer to the procedure described below. It is based 
on summary statistics for the samples. 

Excel t-test for comparing the means of dependent samples, based on summary statistics 

1. Using the summary statistics associated with CX11TYPE.XLS (the test is summarized in Figure 1 1.5): Open the TEST 
STATISTICS.XLS workbook, supplied with the text. 

2. Using the arrows at the bottom left, select the f-Test_Mean worksheet. 

3. For d = x 1 - x 2 , enter the mean of d (-6.558), the standard deviation of d (9.001), and the number of pairs (12) into 
the appropriate cells. Enter the hypothesized difference (0) and the desired alpha level for the test (0.025). The calcu- 
lated f-statistic and a one-tail p-value will be shown at the right. 

4. To get a confidence interval for follow steps 1-3, but open the ESTlMATORS.XLS workbook and select the 
f-Estimate_Mean worksheet. Enter the desired confidence level as a decimal fraction (e.g., 0.95). 

Note: As an alternative, you can use Excel worksheet template TMTTEST.XLS, supplied with the text. The steps are 
described within the template. 
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MINITAB 

Minitab f-test for comparing the means of dependent samples, based on raw data 

Paired T-Test and CI: Old, New 

Paired T for Old - New 





N 


Mean 


StDev 


SE Mean 


Old 


12 


45.41 


12.00 


3.47 


New 


12 


51.97 


15.22 


4.40 


Difference 


12 


-6.56 


9.00 


2.60 



95% upper bound for mean difference: -1.89 

T-Test of mean difference =0 (vs < 0) : T-Value = -2.52 P-Value = 0.014 

1 . For example, using the data (file CX11TYPE.MTW) on which Figure 1 1 .5 is based, with the "old keyboard" data values 
in column C1 and the "new keyboard" data values in column C2: Click Stat. Select Basic Statistics. Click Paired t. 

2. Select Samples in columns and enter C1 into the First sample box and C2 into the Second sample box. 

3. Click Options. Enter the desired confidence level as a percentage (e.g., 95.0) into the Confidence Level box. Enter the 
hypothesized difference (0) into the Test mean box. Within the Alternative box, select less than. Click OK. Click OK. 

Minitab f-test for comparing the means of dependent samples, based on summary data 

Follow steps 1 through 3 above, but select Summarized data (differences) in step 2 and insert the appropriate summary 
statistics (number of pairs, the mean of d, and the standard deviation of d) into the Sample size. Mean, and Standard 
deviation boxes. 




1 1 .44 A pharmaceutical firm has checked the cholesterol 
levels for each of 30 male patients, then provided them 
with fish-oil capsules to take on a daily basis. The choles- 
terol levels are rechecked after a 1-month period. Does this 
study involve independent samples or dependent samples? 

11.45 Each of 20 consumers is provided with a package 
containing two different brands of instant coffee. A week 
later, they are asked to rate the taste of each coffee on a 
scale of 1 (poor taste) to 10 (excellent taste). Is this an 
example of independent samples or dependent samples? 

11.46 A university president randomly selects 10 tenured 
faculty from the College of Arts and Sciences and 10 
tenured faculty from the College of Business. Each faculty 
member is then asked to rate his or her job satisfaction 
on a scale of 1 (very dissatisfied) to 10 (very satisfied). 
Would this be an example of independent samples or 
dependent samples? 

1 1 .47 A trucking firm is considering the installation of a 
new, low-restriction engine air filter for its long-haul trucks, 
but doesn't want to make the switch unless the new filter 



can be shown to improve the fuel economy of these 
vehicles. A test is set up in which each of 10 trucks makes 
the same run twice — once with the old filtration system and 
once with the new version. Given the sample results shown 
below, use the 0.05 level of significance in determining 
whether the new filtration system could be superior. 

Truck Number Current Filter New Filter 



1 


7.6 mpg 


7.3 mpg 


2 


5.1 


7.2 


3 


10.4 


6.8 


4 


6.9 


10.6 


5 


5.6 


8.8 


6 


7.9 


8.7 


7 


5.4 


5.7 


8 


5.7 


8.7 


9 


5.5 


8.9 


10 


5.3 


7.1 



11.48 In an attempt to measure the emotional effect of a 
proposed billboard ad, an advertising agency checks the 
pulse rate of 10 persons before and after they are shown 
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a photograph of the billboard. The agency believes that 
an effective billboard will increase the pulse rate of those 
who view it. In its test, the agency found the mean change 
in pulse rate was +5.7 beats per minute, with a standard 
deviation of 1.6. Using the 0.01 level of significance, 
examine whether the billboard stimulus could meet the 
agency's criterion for effectiveness. 

1 1 .49 The students in an aerobics class have been weighed 
both before and after the 5-week class, with the following 



results: 






Person 


Weight 


Weight 


Number 


Before 


After 


1 


1981b 


194 1b 


2 


154 


151 


3 


124 


126 


4 


110 


104 


5 


127 


123 


6 


162 


155 


7 


141 


129 


8 


180 


165 



Using the 0.05 level of significance, evaluate the effective- 
ness of the program. Using the appropriate statistical 



table, what is the most accurate statement we can make 
about the p-value for this test? 

/ data Set / Note: Exercises 11.50 and 11.51 require 
a computer and statistical software. 

11.50 For a special pre-New Year's Eve show, a radio 
station personality has invited a small panel of prominent 
local citizens to help demonstrate to listeners the adverse 
effect of alcohol on reaction time. The reaction times (in 
seconds) before and after consuming four drinks are in 
data file XR11050. At the 0.005 level, has the program 
host made his point? Identify and interpret the p-value 
for the test. 

11.51 A plant manager has collected productivity data 
for a sample of workers, intending to see whether there is 
a difference in the number of units they produce on 
Monday versus Thursday. The results (units produced) 
are in data file XR11051. Using the 0.01 level of 
significance, evaluate the null hypothesis that there is no 
difference in worker productivity between the two days. 
Identify and interpret the p-value for the test, then 
construct and interpret the 99% confidence interval for 
the mean difference in productivity between the days. 



11.6 COMPARING TWO SAMPLE PROPORTIONS 



The comparison of sample proportions from two independent samples is a frequent 
subject for statistical analysis. The following are but a few of the possibilities: 

• Comparing the percentage of defective parts between shipments provided by 
two different suppliers. 

• Determining whether the proportion of headache sufferers getting relief from 
a new medication is significantly greater than for those using aspirin. 

• Comparing the enlistment percentage of high school seniors who have viewed 
version A of a recruiting film versus those seeing version B. 

In this section, tests assume that both sample sizes are large (each n > 30). In 
addition, n^p^, n x {\ — n 2 p 2 , and n 2 (l — p 2 ) should all be s 5. (These 
requirements are necessary in order that the normal distribution used here will be 
a close approximation to the binomial distribution.) As in the comparison of 
means from independent samples, tests involving proportions can be either nondi- 
rectional or directional. Possible null and alternative hypotheses are similar to 
those summarized in Table 11.1. 

Unlike the previous sections, our choice of test statistic will depend on the 
hypothesized difference between the population proportions, (tt 1 — tt 2 ) 0 - In the 
vast majority of practical applications, the hypothesized difference will be zero 
and the appropriate test statistic will be the first of the two alternatives shown 
next. Accordingly, that will be our emphasis in this section. The confidence inter- 
val for (ir 1 — 7r 2 ) is not affected. 
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Test statistic for comparing proportions of two independent samples: 
1. When the hypothesized difference is zero (the usual case): 

(Pi - P 2 ) 



_/l 1 
p(l -p)\— + — 



with p = ^ + ^ 



where p i and p 2 = the sample proportions 
n 1 and n 2 = the sample sizes 

p = pooled estimate of the 
population proportion 



2. When the hypothesized difference is (tt 1 — tt 2 ) 0 t 6 0: 

(Pi ~ P 2 ) ~ ("l ~ ^2)0 
Pl(l ~ Pi) ~ fejj - P 2 ) 



In either case, confidence interval for (tt« — tt 2 ): 



example 









Sample Proportions 



In a 10-year study sponsored by the National Heart, Lung and Blood Institute, 
3806 middle-age men with high cholesterol levels but no known heart problems 
were divided into two groups. Members of the first group received a new drug 
designed to lower cholesterol levels, while the second group received daily 
dosages of a placebo. Besides lowering cholesterol levels, the drug appeared to be 
effective in reducing the incidence of heart attacks. During the 10 years, 155 of 
those in the first group suffered a heart attack, compared to 187 in the placebo 
group. 1 Assume the underlying data are in file CX11HRT, coded as 1 = did not 
have a heart attack, and 2 = had a heart attack. 



SOLUTION 

If we assume the 3806 participants were randomly divided into two groups, 
there would have been 1903 men in each group. Under this assumption, the 
sample proportions for heart attacks within the two groups are p 1 = 155/1903, 
or p 1 = 0.0815, and p 2 = 187/1903, or p 2 = 0.0983. Since the intent of the 
study was to evaluate the effectiveness of the new drug, the hypothesis test will 
be directional. In terms of the population proportions, the null and alternative 



Source: "News from the World of Medicine," Reader's Digest, May 1984, p. 222. 
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hypotheses are H 0 : tt x > tt 2 and H^: tt 1 < ir 2 . The hypotheses can also be 
expressed as 



Null hypothesis 



Alternative hypothesis 



H, 



< 0 



Users of the new drug are at least as 
likely to experience a coronary. 



Users of the new drug are less likely 
to experience a coronary. 



In testing the null hypothesis, we will use the 0.05 level of significance. The 
pooled estimate of the (assumed equal) population proportions is calculated as 



n l p 1 + n 2 p 2 _ (1903)(0.0815) + (1903)(0.0983) 



7Zj + n 1 



1903 + 1903 
The calculated value of the test statistic, z, is 

pi - p 2 0.0815 - 0.0983 



0.0899 



P(l-P) 



1 1 

h — 



-1.81 



0.0899(1 - 0.0899) 



+ 



1903 1903 



For the 0.05 level in this left-tail test, the critical value of z will be z = — 1.645. 
The decision rule is, "Reject H 0 if the calculated test statistic is < —1.645, other- 
wise do not reject." 

As Figure 11.6 shows, the calculated test statistic, z = —1.81, is less than the 
critical value and falls into the rejection region. At the 0.05 level of significance, the 
null hypothesis is rejected, and we conclude that the new medication is effective. 



FIGURE 11.6 



At the 0.05 level of signifi- 
cance, the medication that 
was the subject of this 
study appears to have been 
effective in reducing the 
incidence of heart attacks in 
middle-age men. 



H 0 : fj. fpi > 0 The medication is no more effective than the placebo. 
Hi'. A t (n 1 -nj) < 0 The medication reduces the likelihood of a coronary. 
Pl and p 2 are the proportions of each sample who had a coronary during the 
10 years of the study. 



-<— Reject// 
Area = 0.05 - 


) 


Do not reject // 0 >- \ 

1 








f J -(j> l -pi) = 0 








1.645 


Test statistic: 




Z — 


1.81 
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Using the normal distribution table and looking up the area from the mid- 
point to z = —1.81, we find the area to be 0.4649. Subtracting this from 0.5000, 
the one-tail area (0.5000 — 0.4649) is 0.0351. This is the approximate p-value for 
the test. 

Based on the sample data, we will also determine the 90% confidence interval 
for (7Tj - 7r 2 ). With z = 1.645, this will be 



(0.0815 - 0.0983) ± 1.6451 '° 0815(1 " ° 0815) ■ a ° 983(1 " 0 • 0983, 



1903 1903 
-0.0168 ± 0.0152, or from -0.0320 to -0.0016 



Computer Solutions 11.5 describes Excel and Minitab procedures for the 
Z-test and confidence interval when comparing the proportions from two indepen- 
dent samples. In a one-tail test, Minitab will provide either an upper or a lower 
confidence limit for (ir 1 — 7T 2 ), depending on the directionality of the test. 




The z-Test for Comparing Two Sample Proportions 

These procedures show how to use the z-test to compare proportions from two independent samples. 

EXCEL— USING SUMMARY DATA 





A 


B 


C 


D 


E 


1 


z-Test of the Difference Between Two Proportions (Case 1 ) 




2 












3 




Sample 1 


Sample 2 


zStat 


-1.812 


4 


Sample proportion 


0.0815 


0.0983 


P(Z<=z) one-tail 


0.035 


5 


Sample size 


1903 


1903 


z Critical one-tail 


1.645 


6 


Alpha 


0.05 




P(Z<=z) two-tail 


0.070 


7 








z Critical two-tail 


1.960 



Excel comparison of p 1 and p 2 when the hypothesized value of (it 1 - tt 2 ) is 0, for summary data 

1 . For the summary data described in Figure 1 1 .6: Open the TEST STATISTICS.XLS workbook, supplied with the text. 

2. Using the arrows at the bottom left, select the z-Test_2 Proportions (Case 1). For each sample, enter the sam- 
ple proportion and the sample size, as shown above. You can also enter the alpha level for the test (0.05). Along 
with the z statistic, the printout lists both one-tail and two-tail p-values. Because this is a one-tail test, the p-value 
is 0.035. 

Note: As an alternative, you can use Excel worksheet template TM2PTEST.XLS, supplied with the text. The steps are 
described within the template. 
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Excel comparison of p 1 and p 2 when the hypothesized value of (ir 1 - 7t 2 ) is not 0, for summary data 

Follow the procedure above, but select z-Test_2 Proportions (Case 2) in step 2, then specify the value of (t7 1 - tt 2 ) 
associated with the null hypothesis. 

Excel confidence interval for - ir 2 ), based on summary data 

Follow the first procedure, but open the ESTlMATORS.XLS workbook and select the z-Estimate_2 Proportions 

worksheet, then specify the desired confidence level as a decimal fraction (e.g., 0.90). 

Note: As an alternative, you can use Excel worksheet template TM2PTEST.XLS, supplied with the text. The steps are de- 
scribed within the template. 

EXCEL— USING RAW DATA 

Excel comparison of p 1 and p 2 for raw data, regardless of the hypothesized value of (tt 1 - tt 2 ) 

1 . For the sample data (file CX11HRT.XLS) on which Figure 1 1 .6 is based, with the labels and data values for the drug and 
placebo groups in columns A and B, and coded as 1 = did not have a heart attack and 2 = had a heart attack: Click 
Tools. Click Data Analysis Plus. Click Z-Test: Two Proportions. Click OK. 

2. Enter A1:A1904 into the Variable 1 Range box. Enter B1:B1904 into the Variable 2 Range box. Enter 2 into the 
Code for Success box. Enter the hypothesized difference (in this case, 0) into the Hypothesized Difference box. 
Click Labels. Enter 0.05 into the Alpha box. Click OK. 

Excel confidence interval for (w, - tt 2 ), based on raw data 

Follow the preceding procedure, but select Z-Estimate: Two Proportions from the Data Analysis Plus menu. Specify the 
desired confidence interval by entering the appropriate value for alpha. For example, to get a 95% confidence interval, 
input 0.05 into the Alpha box in step 2. 

MINITAB— USING SUMMARY DATA 

Minitab comparison of p 1 and p 2 when the hypothesized value of (?7 1 - tt 2 ) is 0, for summary data 
Test and CI for Two Proportions 

Sample X N Sample p 

1 155 1903 0.081450 

2 187 1903 0.098266 

Difference = p (1) - p (2) 

Estimate for difference: -0.0168156 

90% upper bound for difference: -0.00493936 

Test for difference = 0 (vs < 0) : Z = -1.81 P-Value = 0.035 

1. Using the summary statistics associated with Figure 1 1.6: Click Stat. Select Basic Statistics. Click 2 Proportions. 
Select Summarized Data. For sample 1, enter the sample size (1903) into the Trials box. Multiply the sample pro- 
portion (0.0815) times the sample size (1903) to get the number of "successes" (0.0815)0 903) = 155.1, rounded to 
155, and enter this into the Events box for sample 1 . Repeat this for sample 2, entering 1903 into the Trials box and 
(0.0983)(1903) = 187.1, rounded to 187 into the Events box. 

2. Click Options. Enter the desired confidence level as a percentage (90.0) into the Confidence Level box. Enter the hy- 
pothesized difference between the population proportions (0) into the Test difference box. Within the Alternative 
box, select less than. Click to select Use pooled estimate of p for test. Click OK. Click OK. 

Minitab comparison of p 1 and p 2 when the hypothesized value of (ir 1 - ir 2 ) is not 0, for summary data 

Follow the previous procedure, but in step 2 do NOT select "Use pooled estimate of p for test." Specify the hypothesized 
nonzero difference between the population proportions. 
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MINITAB — USING RAW DATA 

Minitab comparison of p 1 and p 2 when the hypothesized value of (tt 1 - tt 2 ) is 0, for raw data 

1 . For the sample data (file CX1 1HRT.MTW) on which Figure 1 1 .6 is based, with the data for the drug group in column 
C1, the data for the placebo group in column C2, and data coded as 1 = did not have a heart attack and 2 = had a 
heart attack: Click Stat. Select Basic Statistics. Click 2 Proportions. Select Samples in different columns. Enter C1 
into the First box and C2 into the Second box. Note that Minitab will select the larger of the two codes (i.e., 2 = had 
heart attack) as the "success" or "event." 

2. Click Options. Enter the desired confidence level as a percentage (90.0) into the Confidence Level box. Enter the hy- 
pothesized difference between the population proportions (0) into the Test difference box. Within the Alternative 
box, select less than. Click to select Use pooled estimate of p for test. Click OK. Click OK. 

Minitab comparison of p 1 and p 2 when the hypothesized value of (t7 1 - n 2 ) is not 0, for raw data 

Follow the previous procedure, but in step 2 do NOT select "Use pooled estimate of p for test." Specify the hypothesized 
nonzero difference between the population proportions. 




11.52 Summary data for two independent samples are 
pj = 0.36, n t = 150, and p 2 = 0.29, n 2 = 100. Use the 
0.025 level of significance in testing H 0 : tt 1 < tt 2 versus 

Hj! 77 j > TT 2 . 

11.53 Summary data for two independent samples are 
p 1 = 0.31, n 1 = 400, and p 2 = 0.38, » 2 = 500. Use the 
0.05 level of significance in testing H Q : tt 1 = tt 2 versus 

H-^ : 77" j 7T? . 

11.54 A bank manager has been presented with a new 
brochure that was designed to be more effective in attract 
ing current customers to a personal financial counseling 
session that would include an analysis of additional bank- 
ing services that could be advantageous to both the bank 
and the customer. The manager's assistant, who created 
the new brochure, randomly selects 400 current 
customers, then randomly chooses 200 to receive the stan 
dard brochure that has been used in the past, with the 
other 200 receiving the promising new brochure that he 
has developed. Of those receiving the standard brochure, 
35% call for more information about the counseling ses- 
sion, while 42% of those receiving the new brochure call 
for more information. Using the 0.10 level of significance, 
is it possible that the superior performance of the new 
brochure was just due to chance and that the new 
brochure might really be no better than the old one? 

1 1 .55 Medical researchers monitoring two groups of 
physicians over a 6-year period found that, of 3429 doc- 
tors who took aspirin daily, 148 died from a heart attack 
or stroke during this period. For 1710 doctors who 



received a placebo instead of aspirin, 79 deaths were 
recorded. At the 0.01 level of significance, does this study 
indicate that taking aspirin is effective in reducing the 
likelihood of a heart attack? source: Dan Sperling, Kim Painter, 
and Tim Friend, "Study Disagrees with Aspirin's Benefits," USA Today, 
February 1, 1988, p. 4D. 

11.56 In a study by the United Dairy Industry Association, 
42% of 655 persons age 35-44 said they seldom or never 
ate cottage cheese. For 455 individuals age 45-54, the cor- 
responding percentage was 34%. At the 0.01 level, can we 
reject the possibility that the population percentages could 
be equal for these two groups? source: United Dairy Industry 
Association, Cottage Cheese: Attitudes & Usage, p. 21 . 

11.57 During the course of a year, 38.0% of the 213 
merchant ships lost were general cargo carriers, while 
43.8% of the 219 ships lost during the following year 
were in this category. Using a two-tail test at the 0.05 
level, examine whether this difference could have been 
the result of chance variation from one year to the next. 
source: Bureau of the Census, Statistical Abstract of the United States 
1996, p. 657. 

11.58 Of 200 subjects approached by interviewer A, 
45 refused to be interviewed. Of 120 approached by 
interviewer B, 42 refused an interview. At the 0.05 level 
of significance, can we reject the possibility that the inter- 
viewers are equally capable of obtaining interviews? 

11.59 Community National Bank is using an observational 
study to examine the utilization of its new 24-hour bank- 
ing machine. Of 300 males who used the machine last 
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week, 42% made two or more transactions before leaving. 
Of 250 female users during the same period, 50% made at 
least two transactions while at the machine. At the 0.10 
level, do males and females differ significantly in terms of 
making multiple transactions? Determine and interpret the 
p-value for this test, then construct and interpret the 90% 
confidence interval for tt 1 — tt 2 - 

1 1 .60 A telephone sales solicitor, trying to decide between 
two alternative sales pitches, randomly alternated between 
them during a day of calls. Using approach A, 20% of 100 
calls led to requests for the mailing of additional product 
information. For approach B in another 100 calls, only 
14% led to requests for the product information mailing. 
At the 0.05 level, can we conclude that the difference in 
results was due to chance? Determine and interpret the 
p-value for this test, then construct and interpret the 
95% confidence interval for tTj — tt 2 . 

11.61 In a preliminary study, the U.S. Veterans Affairs 
Department found that 30.9% of the 81 soldiers who were 
near an accidental nerve gas release just after the 1991 Per- 
sian Gulf War had muscle and bone ailments, compared to 
a rate of 23.5% for the 52,000 Gulf veterans who were 
not near that area. Use an appropriate one-tail test and the 
0.10 level of significance in examining the difference 
between these two rates. Determine and interpret the 
p-value for the test, source: John Diamond, "Some Veterans Suffer 
Bone, Muscle Ailments," The Indiana Gazette, January 22, 1997, p. 5. 

11.62 According to the ICR Research Group, 63% of 
Americans in the 18-34 age group say they are comfort- 
able filing income tax returns electronically, compared to 
just 49% of those who are 55-64. Using the 0.025 level 
of significance, and assuming there were 200 persons sur- 
veyed from each age group, examine whether Americans 
in the 18-34 age group might be more comfortable with 
electronic filing than their counterparts in the 55-64 
group. Determine and interpret the p-value for the test. 
source: Anne R. Carey and Genevieve Lynn, "E-Filing: No Problem," USA 
Today, March 22, 2000, p. 1B. 

/ data set / Note: Exercises 11.63-11.65 require a 
computer and statistical software. 

11.63 A Dun & Bradstreet survey of business owners 
found that 23% of female business owners had a business 



home page compared to 16% of male business owners. 
Assume that file XR11063 contains the underlying data for 
each group, coded as 1 = no business home page and 2 = 
business home page. Using the 0.01 level of significance, 
evaluate the null hypothesis that female and male 
business owners across the country might really be 
equally likely to have a home page for their business. 
Identify and interpret the p-value for the test, then 
construct and interpret the 99% confidence interval for 
the difference between the population proportions. 
source: Anne R. Carey and Suzy Parker, "Women Embrace the Web," USA 
Today 1999 Snapshot Calendar, November 27. 

11.64 Attempting to improve the quality of services pro- 
vided to customers, the owner of a chain of high-fashion 
department stores randomly selected a number of clerks 
for special training in customer relations. Of this group, 
only 10% were the subject of complaints to the store 
manager during the 3 months following the training. On 
the other hand, 15% of a sample of untrained clerks were 
mentioned in customer complaints to the manager during 
this same period. The data are in file XR11064, with data 
for each group coded as 1 = not mentioned in a 
complaint and 2 = mentioned in a complaint. Using the 
0.05 level of significance, does the training appear to be 
effective in reducing the incidence of customer dissatisfac- 
tion with sales personnel? Identify and interpret the 
p-value for the test. 

11.65 A study by the National Marine Manufacturers 
Association found that 12.2% of those who participated 
in sailing were females age 25-34. Of those who 
participated in horseback riding during the same period, 
14.7% were females in this age group. Assume that file 
XR11065 contains the underlying data for each activity 
group, coded as 1 = not a female in this age group and 

2 = a female in this age group. Using the 0.10 level of sig- 
nificance, test whether the population proportion could be 
equal for females age 25-34 participating in each of these 
activities. Identify and interpret the p-value for the test, 
then construct and interpret the 90% confidence interval 
for the difference between the population proportions. 
source: National Marine Manufacturers Association, The Boating Market: 
A Sports Participation Study p. 40. 



1 1 .7 COMPARING THE VARIANCES OF 

TWO INDEPENDENT SAMPLES 



There are occasions when it is useful to compare the variances of two independent 
samples. For example, we might be interested in whether one manufacturing pro- 
cess differs from another in terms of the amount of variation among the units pro- 
duced. We can examine two different portfolio strategies to determine whether 
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there is significantly more variation in the performances of the investments in one 
of the portfolios than in the other. We can also compare the variances of two 
independent samples to determine the permissibility of using the pooled-variances 
i-test of Section 11.2, which assumes that the standard deviations (and, thus, the 
variances as well) of the respective populations are equal. 

The test in this section involves the F distribution. Like the t distribution, it is 
a family of distributions and is continuous. Unlike the t distribution, however, its 
exact shape is determined by two different degrees of freedom instead of just a 
single value. From a theoretical standpoint, the F distribution is the sampling dis- 
tribution of s\ls\ that would result if two samples were repeatedly drawn from 
the same, normally distributed population. 

In terms of the hypothesis-testing procedure introduced in Chapter 10, the 
test can be described as follows: 

1. Formulate null and alternative hypotheses. The null and alternative hypothe- 
ses are H Q : a\ = u\ and H^: a\ ¥= u\. 

2. Select the significance level, a. There are three F distribution tables in 
Appendix A (Table A. 6, Parts A-C). They represent upper-tail areas of 0.05, 
0.025, and 0.01, respectively. Since these are one-tail areas, they represent a = 
0.10, a = 0.05, and a = 0.02 for our two-tail test. 

3. Calculate the test statistic. The calculated test statistic is 




whichever is larger 



4. Identify the critical value of the test statistic and state the decision rule. Although 
the test is nondirectional (i.e., H^. <j\ + crjj), there will be just one critical 
value of F. This is because we have selected the larger of the two ratios in step 3. 

The critical value of F will be 

F(a/2, Vj, v 2 ) where a. = specified level of significance: 

0.10, 0.05, or 0.02 
v 1 = (n — 1), where n is the size of the sample 

that had the larger variance 
v 2 = (n — 1), where n is the size of the sample 
that had the smaller variance 

The critical value is found by consulting the F table that corresponds to a/2 
(0.05, 0.025, or 0.01), with v 1 = the number of degrees of freedom associated 
with the numerator of the F ratio, and v 2 = the number of degrees of freedom 
for the denominator. If v 1 or v 2 happens to be one of the larger values not 
included in the table, interpolate between the listed entries. The decision rule 
is, "Reject H Q if calculated F > critical F, otherwise do not reject." 

5. Compare the calculated and critical values and reach a conclusion. If the cal- 
culated F exceeds the critical F, we are not able to assume that the population 
variances are equal. 

6. Make the related decision. This will depend on the purpose for the test. For 
example, if the variance for one investment portfolio strategy differs signifi- 
cantly from the variance for another, we may wish to pursue the one for which 
the variance is lower. The ability to assume equal variances would also allow 
us to compare two sample means with the pooled-variances £-test of Sec- 
tion 11.2. However, keep in mind that the unequal-variances £-test of Section 
11.3 can routinely be applied without having to go through the inconvenience 
of the test for variance equality. 
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Comparing Variances 

A sample of 9 technicians exposed to the standard version of a training film 
required an average of 31.4 minutes to service a compressor system, with a stan- 
dard deviation of 14.5 minutes. For 7 technicians viewing an alternative version of 
the film, the average time required was 22.3 minutes, with a standard deviation of 
10.2 minutes. If the sampled populations are approximately normally distributed, 
can (i\ and a\ be assumed to be equal? The underlying data are in file CX11TECH. 

SOLUTION 

Formulate Null and Alternative Hypotheses 

The null and alternative hypotheses are H Q : cr\ = a\ and H^. a\ ¥= <j\. 

Select the Significance Level, a 

For this test, the level of significance will be a = 0.02. 

Calculate the Test Statistic 

The calculated F statistic will be 



s\ 14.5 2 s| 10.2 2 

~2 = 7772 or ~2 = 7772' whichever is larger 
si; 10. 2 Z st 14. 5 Z 



Since the first ratio is larger, the calculated F is 

14.5 2 



10.2 2 



2.02 



Identify the Critical Value of the Test Statistic and State the Decision Rule 

The sample associated with the numerator of the F statistic is the one having the 
larger variance. It had a sample size of 9, so v l is9 — 1, or v t = 8. The sample as- 
sociated with the denominator of the F statistic is the one having the smaller vari- 
ance. It had a sample size of 7, so v 2 = 7 — 1j or v i = 6. 

The specified level of significance was a = 0.02, and the critical value of F will 
be F(a/2, v l5 v 2 ) = F(0.01, 8, 6). Referring to the F table with an upper-tail area 
of 0.01, with Vj = 8 and v 2 = 6, the critical value is found to be 8.10. The deci- 
sion rule is, "Reject H 0 if calculated F > 8.10, otherwise do not reject." 

Compare the Calculated and Critical Values and Reach a Conclusion 

As the test summary in Figure 11.7 shows, the calculated F (2.02) does not exceed 
the critical value (8.10), so the null hypothesis of equal population variances is not 
rejected. 

Make the Related Decision 

At the 0.02 level of significance, we conclude that the variances in service times 
associated with the two different technician-training films could be equal. If we 
were to compare the means of the two samples, the population standard devia- 
tions could be assumed to be equal and it would be permissible to apply the 
pooled-variances £-test of Section 11.2. Computer Solutions 11.6 describes 
Excel and Minitab procedures for testing whether two population variances 
could be equal. 



Chapter 11: Hypothesis Tests Involving Two Sample Means or Proportions 



397 



ffn 



<7[ = £T 2 



(orH 0 : 



Do not reject H 0 



— Reject H 0 — » 

Area = a/2 
= 0.01 



FIGURE 11.7 



The F distribution can be 
used to test the null 
hypothesis that the popula- 
tion variances are equal. In 
this test at the a = 0.02 
level, the calculated Fwas 
less than the critical F, so 
the population variances 
can be assumed to be 
equal. 



F=8.10 



Test statistic: 
F = 2.02 



computer solutions 1 1 .6 




Testing for the Equality of Population Variances 

These procedures show how to use the F-test to compare the variances of two independent samples. 

EXCEL 





D 


E 


F 


1 


F-Test Two-Sample for Variances 


2 








3 




Method 1 


Method2 


4 


Mean 


31.40 


22.30 


5 


Variance 


210.20 


104.03 


6 


Observations 


9 


7 


7 


df 


8 


6 


8 


F 


2.02 




9 


P(F<=f) one-tail 


0.2035 




10 


F Critical one-tail 


8.10 





Excel F-test comparing sample variances, based on raw data 

1 . For data file CX11TECH.XLS on which Figure 1 1 .7 is based, the label and 9 data values for method 1 are in column A, 
and the label and 7 data values for method 2 are in column B. Click Tools. Click Data Analysis. Click F-Test Two- 
Sample for Variances. Click OK. 

2. Enter A1:A10 into the Variable 1 Range box. Enter B1:B8 into the Variable 2 Range box. (The sample with the 
greater variance should always be specified as "Variable 1 .") 

3. Select Labels. Specify the significance level by entering 0.01 into the Alpha box. (We are doing a two-tail test at the 
0.02 level, but Excel only performs a one-tail test, so we must tell Excel to do its one-tail test at the 0.01 level to get 
comparable results.) 

4. Select Output Range and enter D1 into the box. Click OK. The p-value for our two-tail test is 0.407 — that is, double 
the 0.2035 value that Excel shows for its one-tail test. 
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Excel F-test comparing sample variances, based on summary statistics 

1 . Using the summary statistics associated with CX11TECH.XLS (the test is summarized in Figure 1 1 .7): Open the TEST 
STATISTICS.XLS workbook, supplied with the text. 

2. Using the arrows at the bottom left, select the F-Test_2 Variances worksheet. 

3. Enter the sample sizes and variances into the appropriate cells along with an alpha level for the test. The calculated 
F-statistic and a two-tail p-value will be shown at the right. 



MINITAB 

Minitab F-test comparing sample variances, based on raw data 
Test for Equal Variances: Methodl, Method2 

F-Test (normal distribution) 

Test statistic = 2.02, p-value = 0.407 

1 . Using data file CX11TECH.MTW on which Figure 1 1 .7 is based, with the data for method 1 in column C1 and the data 
for method 2 in column C2: Click Stat. Select Basic Statistics. Click 2 Variances. 

2. Select Samples in different columns. Enter C1 into the First box and C2 into the Second box. Click OK. A portion 
of the printout, including the two-tail p-value, is shown here. 

Minitab F-test for comparing sample variances, based on summary data 

Follow steps 1 and 2 above, but select Summarized data in step 2 and insert the appropriate summary statistics for 
each sample into the Sample size and Variance boxes. 



exercis 




til 



11.66 For two samples from what are assumed to be nor- 
mally distributed populations, the sample sizes and stan- 
dard deviations are n 1 = 10, s x = 23.5, « 2 = 9, and s 2 = 
10.4. At the 0.10 level of significance, test the null hypoth- 
esis that the population variances are equal. Would your 
conclusion be different if the test had been conducted at 
the 0.05 level? At the 0.02 level? 

11.67 For two samples from what are assumed to be nor- 
mally distributed populations, the sample sizes and stan- 
dard deviations are n 1 = 28, s x = 103.1, « 2 = 41, and 

s 2 = 133.5. At the 0.10 level of significance, test the null 
hypothesis that the population variances are equal. Would 
your conclusion be different if the test had been conducted 
at the 0.05 level? At the 0.02 level? 

11.68 Given the information in Exercise 11.10, and using 
the 0.02 level of significance in comparing the sample 
standard deviations, were we justified in assuming that 
the population standard deviations are equal? Would 
your conclusion change if the standard deviation of play- 
ing times for music boxes equipped with the new battery 



design had been 20% larger than the value calculated in 
Exercise 11.10? 

11.69 Given the information in Exercise 11.11, and using 
the 0.05 level of significance in comparing the sample 
standard deviations, were we justified in assuming that 
the population standard deviations were equal? Would 
your conclusion change if the standard deviation for the 
FoodFarm jars had been 3.0 grams? 

11.70 According to the National Association of Home- 
builders, the average life expectancies of a dishwasher 
and a garbage disposal are about the same: 10 years. 
Assume that their finding was based on a sample of 

«j = 60 dishwashers and « 2 = 40 garbage disposals, and 
that the corresponding sample standard deviations were 
s l = 3.0 years and s 2 = 3.7 years. Using the 0.02 level of 
significance, examine whether the population standard 
deviations for the lifetimes of these two types of 
appliances could be the same, source: National Association of 
Homebuilders, 7998 Housing Facts, Figures, and Trends, p. 9. 
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11.71 In conducting her annual review of suppliers, a 
purchasing agent has collected data on a sample of 
orders from two of her company's leading vendors. On 
average, the 24 shipments from company 1 have arrived 
3.4 days after the order was placed, with a standard 
deviation of 0.4 days. The 30 shipments from company 2 
arrived an average of 3.6 days after the order was 
placed, with a standard deviation of 0.7 days. The aver- 
age time for shipments to be received is about the same, 
regardless of supplier, but the purchasing agent is 
concerned about company 2's higher variability in 
shipping time. Using the 0.025 level of significance in a 
one-tail test, should the purchasing agent conclude that 
company 2's higher standard deviation in shipping times 
is due to something other than chance? 

/ data Set / Note: Exercises 11.72-11.74 require a 
computer and statistical software. 

11.72 A researcher has observed independent samples of 
females and males, recording how long each person took 



to complete his or her shopping at a local mall. The 
respective times, in minutes, are listed in file XR1 1072. 
Using the 0.025 level of significance in a one-tail test, 
would females appear to exhibit more variability than 
males in the length of time shopping in this mall? 

11.73 For independent samples of customers of Internet 
providers A and B, the ages are as listed in file XR11073. 
Using the 0.05 level of significance in a nondirectional 
test, examine whether the variation in ages could be the 
same for customers of the two providers. 

11.74 An investment analyst has data for the past 8 years 
on each of two mutual funds, with the annual rates of re- 
turn for each listed in file XR11074. On average, each 
fund has had an excellent annual rate of return over the 
time period for his data, but the analyst is concerned that 
a mutual fund with greater variation in rate of return 
tends to involve greater risk for his investment clients. 
Using the 0.05 level in a nondirectional test, examine 
whether the two mutual funds have differed significantly 
in their performance variability. 



SUMMARY 11.8 



• Comparing sample means or proportions 

One of the most useful applications of business statistics involves comparing two 
samples to examine whether a difference between them is (1) significant or 
(2) more likely due to chance variation from one sample to the next. As with test- 
ing hypotheses for just a single sample, comparing the means or proportions from 
two samples requires comparing the calculated value of a test statistic with its 
critical value(s), then deciding whether the null hypothesis should be rejected. 

• Independent versus dependent samples 

Independent samples are those for which the selection process for one is not 
related to the selection process for the other. An example of independent samples 
occurs when subjects are randomly assigned to the experimental and control 
groups of an experiment. Samples are dependent when the selection process for 
one is related to the selection process for the other. A typical example of depen- 
dent samples occurs with before-and-after measurements for the same individuals 
or test units. In this case, there is really only one variable: the difference between 
the two measurements recorded for each individual or test unit. 

• Tests for comparing means of independent samples 

When comparing the means of independent samples, the t-test is applicable when- 
ever the population standard deviations are unknown. The pooled-variances 
i-test is used whenever the population standard deviations are assumed to be 
equal, regardless of the sample size. The population standard deviations are typi- 
cally unknown. 

The chapter describes the unequal-variances £-test to be used if the population 
standard deviations are unknown and cannot be assumed to be equal. The 
unequal-variances £-test involves a special correction formula for the number of 
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degrees of freedom (df). The z-test can be used as a close approximation to the 
unequal-variances i-test when the population standard deviations are not as- 
sumed to be equal, but samples are large (each n > 30). 

• Comparing proportions from independent samples 

Comparing proportions from two independent samples involves a Z-test. When 
samples are large and other conditions described in the chapter are met, the nor- 
mal distribution is a close approximation to the binomial. 

• Comparing variances for independent samples 

A special hypothesis test can be applied to test the null hypothesis that the 
population variances are equal for two independent samples. Based on the F dis- 
tribution, it has many possible applications, including determining whether the 
pooled-variances £-test is appropriate for a given set of data. 




Pooled-variances f-test for comparing the means of two independent samples, 
tr y and (r 2 unknown and assumed to be equal 



x 2> ~ (Ml _ M 2 )o 
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where Xj and x 2 = means of samples 1 and 2 

— /x 2 )o = hypothesized difference between 
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Unequal-variances f-test for comparing the means of two independent 
samples, a 1 and <t 2 unknown and not assumed to be equal 



(x 1 



}) ~ (Mi - M 2 )o 



1 2 

- + — 



where x 1 and x 2 



means of samples 1 and 2 
(fji 1 — /x 2 ) 0 = hypothesized difference between 
the population means 
«j and «2 = sizes of samples 1 and 2 
Sj and s 2 = standard deviations of 



with df ■ 
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z-test approximation for comparing the means of two independent samples, 
tr 1 and (t 2 unknown, and each n > 30 

z = gi - H) ~ jg - m 2 )q where _ and _ = means Qf samplgs j and 2 

s 2 (/Xj — /j, 2 ) 0 = hypothesized difference between 

+ ~ the population means 

«j and «2 = sizes of samples 1 and 2 
Sj and s 2 = standard deviations of samples 1 
and 2 



Confidence interval for 



z \l — + — 
01/2 V n x n 2 

Comparing proportions from two independent samples 

• z-test, with test statistic 

1. When the hypothesized difference is zero (the usual case): 

(Pi ~ Pi) 

where p x = observed proportion, sample 1 
p(\ — p\i 1 _|_ 1 1 Pi = observed proportion, sample 2 

- sample size, sample 1 
w 2 = sample size, sample 2 
p = pooled estimate of the population 
proportion 

n 1 p 1 + rc 2 p 2 

with p = 

Wj + « 2 

2. When the hypothesized difference is (ir 1 — ir 2 ) 0 ^ 0: 

(Pi ~ P 2 ) ~ (^i~ ^ 2 )o 
~ Pi) Pi* 1 ~ P2> 

Mj » 2 

Confidence interval for (ttj — tt 2 ): 



(Pi-p,)±zJ H1 - p ^ H1 -^ ] 



Comparing the means when samples are dependent 

• i-test, with test statistic: 

where d = for each individual or test unit, (xj — x 2 ), 



the difference between the two measurements 
d = the average difference, = Ldjn 
n = number of pairs of observations 



s d = the standard deviation of d, or \/ ^' n ^ 

' n — 1 

df=n-\ 



Confidence interval for /jl^: 



A + t d 

d ± ta/izr 

V n 
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Comparing variances from two independent samples 

• f-test, with test statistic: 

Critical value of F: 



^1 °2 
F = -r or -y, whichever is larger 



F(a/2, v v v 2 ) where a = specified level of significance for a nondirectional 

test: 0.10,0.05, or 0.02 
j>j = (n — 1), where n is the size of the sample that had 

the larger variance 
v 2 = (n — 1), where n is the size of the sample that had 

the smaller variance 



Reminder: Be sure to refer to Figure 11.1 in selecting the 
testing procedure for an exercise. When either test can be 
applied to the same data, the unequal-variances t-test is 
preferable to the z-test — especially when doing the test 
with computer assistance. 

11.75 Suspecting that television repair shops tend to 
charge women more than they do men, Emily disconnected 
the speaker wire on her portable television and took it to a 
sample of 12 shops. She was given repair estimates that av- 
eraged $85, with a standard deviation of $28. Her friend 
John, taking the same set to another sample of 9 shops, 
was provided with an average estimate of $65, with a stan- 
dard deviation of $21. Assuming normal populations with 
equal standard deviations, use the 0.05 level in evaluating 
Emily's suspicion. Using the appropriate statistical table, 
what is the approximate p- value for this test? 

11.76 The manufacturer of a small utility trailer is inter- 
ested in comparing the amount of fuel used in towing the 
trailer with that required to overcome the weight and 
wind resistance of a rooftop carrier. Over a standard test 
route at highway speeds, 8 trips with the trailer resulted 
in an average fuel consumption of 0.28 gallons, with a 
standard deviation of 0.03. Under similar conditions, 

10 trips with the rooftop carrier led to an average 
consumption of 0.35 gallons, with a standard deviation 
of 0.05 gallons. Assuming normal populations with equal 
standard deviations, use the 0.01 level in examining 
whether pulling the trailer uses significantly less fuel than 
driving with the rooftop carrier. 

11.77 A compressor manufacturer is testing two different 
designs for an air tank. Testing involves pumping air into a 
tank until it bursts, then noting the air pressure just prior to 
tank failure. Four tanks of design A are found to fail at an 



average of 1400 pounds per square inch (psi), with a stan- 
dard deviation of 250 psi. Six tanks of design B fail at an 
average of 1620 psi, with a standard deviation of 230 psi. 
Assuming normal populations with equal standard devia- 
tions, use the 0.10 level of significance in comparing the 
two designs. Construct and interpret the 90% confidence 
interval for the difference between the population means. 

1 1 .78 An industrial engineer has designed a new worksta- 
tion layout that she claims will increase production 
efficiency. For a sample of 8 workers using the new work- 
station layout, the average number of units produced per 
hour is 36.4, with a standard deviation of 4.9. For a 
sample of 6 workers using the standard layout, the aver- 
age number of units produced per hour is 30.2, with a 
standard deviation of 6.2. Assuming normal populations 
with equal standard deviations, use the 0.025 level of 
significance in testing the engineer's claim. 

11.79 Egbert Techster is very serious about winning this 
year's All- American Soap Box Derby competition. Since 
there is no wind-tunnel time available at the aerospace 
firm where his father works, Egbert is using a local hill to 
test the effectiveness of an aerodynamic design change. In 
10 runs with the standard car, the average time was 22.5 
seconds, with a standard deviation of 1.3 seconds. With 
the design change, 12 runs led to an average time of 21.9 
seconds, with a standard deviation of 0.9 seconds. Assum- 
ing normal populations with equal standard deviations, 
use the 0.10 level in helping Egbert determine whether the 
modifications are really effective in improving his racer's 
speed. Using the appropriate statistical table, what is the 
approximate p-value for this test? 

11.80 The developer of a new welding rod claims that 
spot welds using his product will have greater strength 
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than conventional welds. For 45 welds using the new rod, 
the average tensile strength is 23,500 pounds per square 
inch, with a standard deviation of 600 pounds. For 40 
conventional welds on the same materials, the average 
tensile strength is 23,140 pounds per square inch, with a 
standard deviation of 750 pounds. Use the 0.01 level in 
testing the claim of superiority for the new rod. Using the 
appropriate statistical table, what is the approximate 
p-value for this test? 

11.81 Independent random samples of vehicles traveling 
past a given point on an interstate highway have been 
observed on Monday versus Wednesday. For 16 cars 
observed on Monday, the average speed was 59.4 mph, 
with a standard deviation of 3.7 mph. For 20 cars 
observed on Wednesday, the average speed was 56.3 mph, 
with a standard deviation of 4.4 mph. At the 0.05 level, 
and assuming normal populations, can we conclude that 
the average speed for all vehicles was higher on Monday 
than on Wednesday? What is the most accurate statement 
that can be made about the p-value for this test? 

11.82 A sample of 25 production employees has been 
tested twice on a standard test of manual dexterity. The 
average change in the time required to finish the test was 
a decrease of 1.5 minutes, with a standard deviation of 
0.3 minutes. At the 0.05 level, can we conclude that the 
average production employee will complete the test more 
quickly the second time he or she takes it? 

11.83 For a sample of 48 finance majors, the average time 
spent reading each issue of the campus newspaper is 19.7 
minutes, with a standard deviation of 7.3 minutes. The 
corresponding figures for a sample of 40 MIS majors are 
16.3 and 4.1 minutes. Using the 0.01 level of significance, 
test the null hypothesis that the population means are 
equal. Using the appropriate statistical table, what is the 
approximate p-value for this test? Construct and interpret 
the 99% confidence interval for the difference between the 
population means. 

11.84 Two machines are supposed to be producing steel 
bars of approximately the same length. A sample of 35 
bars from one machine has an average length of 37.013 
inches, with a standard deviation of 0.095 inches. For 
38 bars produced by the other machine, the correspond- 
ing figures are 36.974 inches and 0.032 inches. Using the 
0.05 level of significance, test the null hypothesis that the 
population means are equal. Using the appropriate statis- 
tical table, what is the approximate p-value for this test? 
Construct and interpret the 95% confidence interval for 
the difference between the population means. 

11.85 Observing a sample of 35 morning customers at a 
convenience store, a researcher finds their average stay in 
the store is 3.0 minutes, with a standard deviation of 0.9 
minutes. For a sample of 43 afternoon customers, the 
mean and standard deviation were 3.5 and 1.8 minutes, 
respectively. Using the 0.05 level of significance, determine 



whether the population means could be equal. Construct 
and interpret the 95% confidence interval for the 
difference between the population means. 

11.86 In an experiment comparing a new long-distance 
golf ball with the conventional design, each of 20 golfers 
hits one drive with each ball. The average golfer in the 
sample hit the new ball 9.3 yards farther, with a standard 
deviation of 10.5 yards. At the 0.005 level, evaluate the 
effectiveness of the new ball in increasing distance. Using 
the appropriate statistical table, what is the approximate 
p-value for this test? 

11.87 The idling speed of 14 gasoline-powered generators 
is measured with and without an oil additive that is 
designed to lower friction. With the additive installed, the 
mean change in speed was +23 revolutions per minute 
(rpm), with a standard deviation of 3.5 rpm. At the 0.05 
level of significance, is the additive effective in increasing 
engine rpm? 

11.88 A motel manager, concerned with customer theft 
of towels, decided that the theft rate might be reduced by 
changing from white, imprinted towels to a drab green 
version. Of the 120 guests provided with the white tow- 
els, 35% took at least one towel with them when they 
checked out. Of the 160 guests given the drab green tow- 
els, only 25% checked out with one or more towels in 
their possession. At the 0.01 level of significance, can we 
conclude that the manager's idea is effective in reducing 
the rate of towel theft? 

11.89 An Internal Revenue Service manager is comparing 
the results of recent taxpayer interviews conducted by 
two auditors. Of 200 taxpayers audited by Ms. Smith, 
30% had to pay additional taxes. Of 100 audited by Mr. 
Burke, only 19% paid additional taxes. At the 0.01 level 
of significance, can the manager conclude that Ms. Smith 
is a more effective auditor than Mr. Burke? 

11.90 A pharmaceutical manufacturer has come up with 
a new drug intended to provide greater headache relief 
than the old formula. Of 250 patients treated with the 
previous medication, 130 reported "fast relief from 
headache pain." Of 200 individuals treated with the new 
formula, 128 said they got "fast relief." At the 0.05 level, 
can we conclude that the new formula is better than the 
old? Using the appropriate statistical table, what is the 
approximate p-value for this test? 

11.91 In tests of a speed-reading course, it is found that 
10 subjects increased their reading speed by an average of 
200 words per minute, with a standard deviation of 75 
words. At the 0.10 level, does this sample information 
suggest that the course is really effective in improving stu- 
dents' reading speed? Using the appropriate statistical 
table, what is the approximate p-value for this test? 

11.92 A maintenance engineer has been approached by a 
supplier who promises less downtime for machines using 
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its new molybdenum-based super lubricant. Of 120 ma- 
chines maintained with the new lubricant, only 20% were 
out of service for more than an hour during the month of 
the test, while 30% of the 100 machines using the previ- 
ous lubricant were down for more than an hour. Using 
the 0.05 level in an appropriate test, evaluate the 
supplier's contention. Using the appropriate statistical 
table, what is the approximate p-value for this test? 

11.93 According to the U.S. National Center for Educa- 
tion Statistics, 80% of college and university teachers use 
computers on the job, compared to 60% of teachers 
below the college level. Using a one-tail test at the 0.01 
level, and assuming that each of these independent sam- 
ples consisted of 100 teachers, is the percentage of college 
and university teachers using computers on the job 
significantly greater than the percentage for teachers 
below the college level? Determine and interpret the 
p-value for the test, source: Bureau of the Census, Statistical 
Abstract of the United States 1999, p. 442. 

11.94 In a Maritz AmeriPoll survey, 16.6% of 500 males 
said they spend over $1000 per year on clothing. In a sur- 
vey of 500 females, 11.8% said they spend over $1000 
per year on clothing. In a two-tail test at the 0.01 level of 
significance, can we conclude that males might not differ 
from females in their likelihood of spending over $1000 
per year on clothing? Determine and interpret the p-value 
for the test, then construct and interpret the 99% 
confidence interval for the difference between the popula- 
tion proportions, source: Marcy E. Mullins, "What Men Spend on 
Clothing Each Year," USA Today, September 29, 1989, p. 1D; "What 
Women Spend on Clothing Each Year," USA Today, September 28, 

1989, p. ID. 

11.95 Given the information in Exercise 11.75, and using 
the 0.05 level of significance in comparing the sample 
standard deviations, were we justified in assuming that 
the population standard deviations were equal? Would 
your conclusion change if the standard deviation of the 
estimates received by Emily had been $35? 

11.96 Given the information in Exercise 11.76, and using 
the 0.01 level of significance in comparing the sample 
standard deviations, were we justified in assuming that the 
population standard deviations were equal? Would your 
conclusion change if the standard deviation of the data 
values with the rooftop carrier had been 0.07 gallons? 

/ data Set / Note: Exercises 11.97-11.101 require a 
computer and statistical software. 

11.97 Three years after receiving their degrees, graduates 
of a university's MBA program have reported their 
annual salary rates, with a portion of the data listed in 
file XR11097. Graduates in one of the groups represented in 
the data are employed by consulting firms, while another 
group consists of graduates who are with national-level 
corporations. Considering these groups as data from 



independent samples, use the 0.05 level in determining 
whether the sample means differ significantly from each 
other. Identify and interpret the p-value for the test, then 
generate and interpret the 95% confidence interval for 
the difference between the population means. 

11.98 A supermarket manager is examining whether a 
new plastic bagging configuration in the produce area 
might make it more efficient for customers who are bag- 
ging their own fruits and vegetables. She observes a sam- 
ple of customers using the new system and a sample of 
customers using the standard system, and she notes how 
many seconds it took each of them to unroll and open the 
plastic bag in preparation for bagging the veggies. The 
data are in file XR11098. Considering these as data from in- 
dependent samples, use the 0.025 level of significance in 
examining whether the population mean for the new bag- 
ging system might be less than that for the conventional 
system. Identify and interpret the p-value for the test. 

11.99 A company that makes an athletic shoe designed for 
basketball has stated in its advertisements that the shoe in- 
creases the jumping ability of players who wear it. The 
general manager of a professional team has conducted a 
test in which each player's vertical leap is measured with 
the current shoe versus the new model, with the data as 
listed in file XR11099. At the 0.05 level of significance, eval- 
uate the claim that has been made by the shoe company. 
Identify and interpret the p-value for the test. 

11.100 The data in file XR11100 are the weights (in 
grams) for random samples of grain packages filled by 
two different filling machines. The machines have a fine 
adjustment for the mean amount of fill, but the standard 
deviations are inherent in the design of the grain delivery 
mechanism. Based on these data, and using the 0.01 level 
of significance, is there reason to conclude that the popu- 
lation standard deviations might not be equal for the 
quantities being delivered from the two machines? Iden- 
tify and interpret the p-value for the test. 

11.101 An anthropologist studying personal 
advertisements in a Utica, New York, newspaper has ob- 
served whether the advertiser included a mention of an 
interest in the outdoors in his or her ads. According to 
the anthropologist, citing the outdoors "may be taken to 
imply not only good life habits, but also sound 
character." Overall, 58% of men and 62% of women 
mentioned the outdoors in their ad. Assuming that the 
data are in file XR11101, coded as 1 = did not mention 
the outdoors and 2 = mentioned the outdoors, use the 
0.10 level of significance in examining whether the popu- 
lation percentages for men and women who mention the 
outdoors might be the same. Identify and interpret the 
p-value for the test, then construct and interpret the 90% 
confidence interval for the difference between the popula- 
tion proportions, source: Karen S. Peterson, "Personal Ads Get 
Back To Nature," USA Today, November 23, 1999, p. 1D. 
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Thorndike Sports Equipment 



The Thorndikes have submitted a bid to be the sole 
supplier of swimming goggles for the U.S. Olympic team. 
OptiView, Inc. has been supplying the goggles for many 
years, and the Olympic committee has said it will switch to 
Thorndike only if the Thorndike goggles are found to be 
significantly better in a standard leakage test. 

For purposes of fairness, the committee has purchased 
16 examples from each manufacturer in the retail market- 
place. This is to avoid the possibility that either manufacturer 
might supply goggles that have been specially modified for 
the test. Testing involves installing the goggles on a surface 
that simulates the face of a swimmer, then submitting them to 
increasing water pressure (expressed in meters of water 
depth) until the goggles leak. The greater the number of me- 
ters before leakage, the better the quality of the goggles. 

Both companies have received copies of the test results 
and have an opportunity to offer their respective com- 
ments before the final decision is made. Ted Thorndike has 
just received his company's copy of the results, rounded to 
the nearest meter of water depth. The data are also listed 
in file C11THORN. 



82 
106 



117 
114 



73 
92 



95 
108 



Thorndike Goggles (meters) 



106 



95 
95 



110 
101 



81 

92 



101 

94 



OptiView Goggles (meters) 



83 
94 



106 
77 



70 
109 



103 
90 



86 
107 



108 
108 



100 
73 



1 . Based on analysis of these data, formulate a commen- 
tary that Ted Thorndike might wish to make to the 
committee. 

2. Based on analysis of these data, formulate a commen- 
tary that OptiView might wish to make to the com- 
mittee. 

3. What would be your recommendation to the com- 
mittee? 



Springdale Shopping Survey 



J ^ 



Item C of the Springdale Shopping Survey, introduced at 
the end of Chapter 2, describes variables 7-9 of the survey. 
These variables represent the general attitude respondents 
have toward each of the three shopping areas, and range 
from 5 (like very much) to 1 (dislike very much). Samples 
involving consumer groups often differ in their results, 
and managers find it useful to determine whether the dif- 
ferences could be due to sampling variation or whether 
there is "something going on" regarding the attitudes, per- 
ceptions, and behaviors of one consumer group versus 
another. 

Variable 7 = Attitude toward Springdale Mall 
Variable 8 = Attitude toward Downtown 
Variable 9 = Attitude toward West Mall 

Part I: Attitude Comparisons Based on Marital 
Status of Respondents 

1. For variable 7 (attitude toward Springdale Mall), carry 
out an appropriate hypothesis test to determine whether 



married persons (code = 1 on variable 28, marital 
status) have a different mean attitude than unmarried 
persons (code = 2 on variable 28). Interpret the re- 
sulting computer printout, including the p-value for 
the test. 

Repeat step 1 for variable 8 (attitude toward 
Downtown). 

Repeat step 1 for variable 9 (attitude toward 
West Mall). 

Comment on the extent to which attitudes toward 
each of the shopping areas differ between married 
and unmarried respondents. 



Part II: Attitude Comparisons Based on 
Gender of Respondent 

Repeat Part I, using variable number 26 (gender of 
respondent) instead of variable 28 (marital status of 
respondent) as the basis on which the groups are 
identified. 
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Circuit Systems, Inc. (A) 

Circuit Systems, Inc., located in northern California, is a 
company that produces integrated circuit boards for the 
microcomputer industry. In addition to salaried manage- 
ment and office staff personnel, Circuit Systems currently 
employs approximately 250 hourly production workers 
involved in the actual assembly of the circuit boards. These 
hourly employees earn an average of $11.00 per hour. 

Thomas Nelson, the Director of Human Resources at 
Circuit Systems, has been concerned with hourly employee 
absenteeism within the company. Presently, each hourly 
employee earns 1 8 days of paid sick leave per year. Thomas 
has found that many of these employees use most or all of 
their sick leave well before the year is over. After an infor- 
mal survey of employee records, Thomas is convinced that 
while most hourly employees make legitimate use of their 
sick leave, there are many who view paid sick leave as 
"extra" vacation time and "call in sick" when they want to 
take off from work. This has been a source of conflict be- 
tween the hourly production workers and management. 
The problem is due in part to a restrictive vacation policy 
at Circuit Systems in which hourly employees receive only 
one week of paid vacation per year in addition to a few 
paid holidays. With only one week of paid vacation and a 
few paid holidays, the hourly production employees work 
a 50-week year, not counting paid sick leave. 

In an effort to save money and increase productivity, 
Thomas has developed a two-point plan that was recently 
approved by the president of Circuit Systems. To combat 
the abuse of paid sick leave, hourly workers will now be 
allowed to convert unused paid sick leave to cash on a 
"three-for-one" basis, i.e., each unused day of sick leave 
can be converted into an additional one-third of a day's 
pay. An hourly employee could earn up to an additional 
six days of pay each year if he or she does not take any 
paid sick leave during the year. Even though a worker 
could gain more time off by dishonestly "phoning in sick," 
Thomas hopes that the majority of hourly employees will 
view this approved conversion of sick leave into extra pay 
as a more acceptable alternative. In the second part of his 
plan, Thomas is instituting a voluntary exercise program 
for hourly employees to improve their overall health. At an 
annual company expense of $200 for each hourly em- 
ployee who participates, Circuit Systems will subsidize 
membership in a local health club. In return, the partici- 
pating employee is required to exercise at least three times 
per week outside of regular working hours to maintain his 
or her free membership. Circuit Systems believes that, in 
the long term, an investment in employees' physical well- 




being may increase their productivity as well as reduce the 
company's future health insurance premiums. In discus- 
sions with hourly employees, Thomas has found that 
many of them approve of the exercise program and are 
willing to participate. 

Many of the supervisors that Thomas has spoken with 
believe that the paid sick leave conversion and the exercise 
program may help in curbing the absenteeism problem, but 
others do not give it much hope for succeeding and think the 
cost would outweigh any benefits. The president of Circuit 
Systems agreed to give the proposal a one-year trial period. 
At the end of the trial period, Thomas must evaluate the 
new anti-absenteeism plan, present the results, and make a 
recommendation to either continue or discontinue the plan. 

Assignment 

Over the next year, during which time the sick leave con- 
version and exercise program are in place, Thomas Nelson 
has maintained data on employee absences, use of the sick 
leave conversion privilege, participation in the exercise 
program, and other pertinent information. He has also 
gone back to collect data from the year prior to starting 
the new program in order to better evaluate the new pro- 
gram. His complete data are in the file CIRCUIT on the text 
CD. A description of this data set is given in the Data 
Description section. 

Using this data set and other information given in the 
case, help Thomas Nelson evaluate the new program to 
determine whether it is effective in reducing the average 
cost of absenteeism by hourly employees, thereby increas- 
ing worker productivity. In particular, you need to com- 
pare this year's data to last year's data to determine 
whether there has been a reduction in the average cost of 
absenteeism per hourly production worker by going to the 
new program. The case questions will assist you in your 
analysis of the data. Use important details from your 
analysis to support your recommendation. 
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Data Description company who were with the company for that entire 

The CIRCUIT file on the text CD contains data for the past period of time. A partial listing of the data is shown 
two years on the 233 hourly production employees in the here. 





Hourly 


Sick Leave 


Sick Leave 


Exercise 


Employee 


Pay 


Last Year 


This Year 


Program 


6631 


$10.97 


3.50 


2.00 


0 


7179 


11.35 


24.00 


12.50 


0 


2304 


10.75 


18.00 


12.75 


0 


9819 


10.96 


21.25 


14.00 


0 


4479 


10.59 


16.50 


11.75 


0 


1484 


11.41 


16.50 


9.75 


1 



These data are coded as follows: 

Employee: Employee ID number. 

Hourly Pay: Hourly pay of the employee in both years. Unfortunately, due to economic conditions, 

there were no pay raises last year. 

Sick Leave Last Year: Actual number of days of sick leave taken by the employee last year before the new 
program started. 

Sick Leave This Year: Actual number of days of sick leave taken by the employee this year under the new 
program. 

Exercise Program: 1, if participating in the exercise program. 

0, if not participating. 

1. Using the method presented in the chapter for com- 
paring the means of paired samples, compare the two 
years in terms of days missed before and after the new 
program was implemented. On this basis alone, does it 
appear that the program has been effective in reducing 
the number of days missed? 

2. Keeping in mind that the goal of the program is to re- 
duce the cost of absenteeism, you will need to create 
two new variables for each employee: (1) the cost of 
paid absences last year, and (2) the cost associated 
with absences this year. A few hints on creating these 
variables for each person: For (1), assume an 8-hour 
workday and consider both the person's daily pay and 
his or her number of absence days. For (2), assume an 
8-hour workday and keep in mind that the total cost 
associated with absenteeism for each person must 



include the cost of paid absences, the extra pay for un- 
used sick leave (if any), and health club membership (if 
applicable). You might call these neiv variables 
Cost_Before and Cost_After. Use these new variables 
in repeating the procedure you followed in Question 
1, then discuss the results and make a recommenda- 
tion to Mr. Nelson regarding the effectiveness and pos- 
sible continuation of the new program. 

3. Using the new variables you created in Question 2, use 
this year's costs and an appropriate statistical test in 
evaluating the effectiveness of the exercise program. 
Discuss the results and make a recommendation to 
Mr. Nelson regarding the effectiveness and possible 
continuation of the company-paid health club mem- 
berships. 



Distribution of Difference Between Sample Means 



This applet has three parts: 

1 . The upper portion shows two normally distributed pop- 
ulations. For purposes of simplicity in the applet, their 
standard deviations are the same. By moving the slider 
at the top of the applet, we can shift one of the popu- 
lation curves back and forth, thus changing the differ- 
ence between the population means. By moving the 
upper of the two sliders at the right, we can change the 
standard deviations of the populations. 

2. The center portion of the applet shows the sampling 
distribution of the means for simple random samples 
taken from each of the populations at the top. To re- 
duce complexity within the applet, the sample sizes are 
the same. By moving the lower of the two sliders at the 
right, we can change the sizes of the samples. 

3. The bottom portion of the applet shows the sampling 
distribution of (x 1 - x 2 ), the difference between the 
sample means. The standard error of this sampling 
distribution is shown at the lower right as "sigma," and 



Dirt of Means =-3 0 



Obs 



•7.5 -5.0 




sigma « 3.0 




7.5 10.0 12.5 15.0 17.5 20 0 



n1 » n2 = 16 



i 



sigma = 0.75 

•7.5 -5.0 -2.5 0.0 2.5 5.0 7.5 10.0 12.5 15.0 17.5 20.0 



Difference between 
Sample Means 




-12.5-10 



sigma = 1.00 
S 10.0 12.5 15.0 



the applet calculates it using the formula provided in 
text Section 1 1 .4, but with <r 1 and a 2 replacing s., and 
s 2 , respectively. Despite the small sample sizes, this is 
appropriate because the populations are normal and 
the population standard deviations are known. In real- 
world applications, we almost never know both popu- 
lation standard deviations, but don't let this detract 
from your use of this very interesting applet. 



Applet Exercises 

14.1 Set the top slider so that the difference between the 
population means is -3.0. Set the slider at the upper right 
so that the standard deviation of each population is 2.5. Set 
the slider at the lower right so that n, = n 2 = 20. 

a. Is there very much overlap between the two population 
curves at the top of the applet? 

b. Is there very much overlap between the two sampling 
distribution curves in the center part of the applet? 

c. Viewing the bottom portion of the applet, and assuming 
that a sample is going to be taken from each of the two 
populations, does it seem very likely that (x, - x 2 ) will be 
greater than zero? 

14.2 Repeat Applet Exercise 14.1, but with the top slider set 
so the difference between the sample means is +0.5. 

14.3 Repeat Applet Exercise 14.1, but with the top slider set 
so the difference between the sample means is +3.0. 

14.4 Use the top slider to gradually change the difference be- 
tween the population means from -0.5 to 4.5. Describe how 
this affects the graphs in the three portions of the applet. 

14.5 Use the upper right slider to gradually increase the pop- 
ulation standard deviations from 1.1 to 3.0. Describe how 
this affects the graphs in the three portions of the applet. 

14.6 Use the lower right slider to gradually increase the 
sample sizes from 2 to 20. Describe how this affects the 
graphs in the three portions of the applet. 



Synergy, ANOVA, and the Thorndikes 

Synergy refers to a situation where the effect of the whole differs from the sum of 
the effects of its parts. Plaids may look nice, stripes may look nice; but, luckily, most 
of us have someone in our lives who prevents us from wearing our nice plaid pants 
with our nice striped shirt. On the other hand, some things go together, each enhanc- 
ing the effect of the other — ocean surf and a sunny sky come to mind here. 

In statistics, we sometimes want to examine 
whether two factors might be interacting in either a e || togeth e 

very positive or a very negative way, thus having ^ things don' 

a synergistic effect on a measurement of interest. 
The technique involved is called two-way analysis 
of variance, and it's the topic of Section 1 2.5 of this 
chapter. 

In our Thorndike Sports Equipment minicase for 
this chapter, Luke and Ted have sponsored a super- 
duper racquetball player and are trying to determine 
which combination of string and racquet best suits his 
power game. By the time you get to the Thorndikes and 
their latest dilemma, you'll have read the chapter and 
should be able to help them out. 
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Describe the general approach by which analysis of variance is applied and the 
type of applications for which it is used. 

Understand the relationship between analysis of variance and the design of 
experiments. 

Differentiate between the one-way, randomized block, and two-way analysis 
of variance techniques and their respective purposes. 

Arrange data into a format that facilitates their analysis by the appropriate 
analysis of variance procedure. 

Use the one-way, randomized block, and two-way analysis of variance 
methods in testing appropriate hypotheses relative to experimental data. 

Appreciate that computer assistance is especially important in analysis of vari- 
ance tests and be able to interpret computer outputs for these tests. 



12.1 INTRODUCTION 



Among the tests in Chapter 11 was the comparison of the means for two indepen- 
dent samples. This chapter introduces analysis of variance (ANOVA), a set of 
techniques that allow us to compare two or more sample means at the same time. 

The availability of ANOVA as a technique for data analysis is an important 
consideration in the design of experiments. We will discuss this further in the next 
section, along with a presentation of the basic concepts underlying ANOVA. 

In Sections 12.3-12.5, we will examine three of the most widely used tech- 
niques involving ANOVA. Although similar in some respects, they differ in terms 
of their specific purposes and procedures. In each section, we will generate and 
interpret computer-assisted results that correspond to the small-scale, hand- 
calculated examples presented for each of these ANOVA procedures. Computer 
assistance is especially useful in ANOVA because the calculations can be quite ex- 
tensive even for small amounts of data. 



1 2.2 ANALYSIS OF VARIANCE: BASIC CONCEPTS 



Analysis of Variance and Experimentation 

In the late 1700s, the British Navy began issuing daily rations of fruit to sailors. 
This was in response to the finding that vitamin C helped to prevent scurvy, then 
a common disease. More recently, you may have used a toothpaste containing 
stannous fluoride when you brushed your teeth this morning. This is a common 
toothpaste ingredient, as dental studies have found stannous fluoride to be effec- 
tive in reducing tooth decay. The rationing of fruit to the sailors and the approval 
of stannous fluoride by the American Dental Association were decisions based on 
the results of experiments. Since ANOVA is closely related to experimentation, 
we'll introduce a few terms that are important to the remainder of the chapter: 

EXPERIMENT A study or investigation designed for the purpose of examining the 
effect that one variable has on the value of another variable. 



learning 
objectives # 

After reading this 
chapter, you should 
be able to: 
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DEPENDENT VARIABLE The variable for which a value is measured or observed. 
In ANOVA, the dependent variable will be a quantitative variable — for example, soft 
drink consumption, examination score, or the time required to type a document. 

INDEPENDENT VARIABLE A variable that is observed or controlled for the pur- 
pose of determining its effect on the value of the dependent variable. In ANOVA, 
the independent variable can be qualitative (e.g., marital status) or quantitative 
(e.g., age group). The following terms are of particular relevance to the indepen- 
dent variable: 

1. An independent variable is referred to as a factor, and one or more factors 
may be involved in a given study. 

2. The experiment may involve different factor levels (categories). 

3. Each specific level of a factor (or, in multiple-factor experiments, the intersec- 
tion of a level of one factor with a level of another factor) is referred to as a 
treatment. 



When there is only one factor in an experiment, factor levels and treatments are 
synonymous. Therefore, when dealing with an experiment that involves just one 
factor, we will use the terms factor levels and treatments interchangeably. 

Our emphasis in the chapter will be on designed experiments, in which we ac- 
tually assign treatments to persons or test units on a random basis. However, the 
techniques may also be applied when we wish to compare treatments that cannot 
be randomly assigned. For example, we might like to compare incomes for high 
school, college, and graduate school graduates, but we obviously cannot ran- 
domly assign these treatments to individuals. In such cases, our analysis would 
have to be based on data that already exist. 

As has been suggested, the starting point for ANOVA is often an experiment 
in which we try to determine whether various levels of a given factor might be 
having different effects on something we're observing or measuring. For exam- 
ple, a lock manufacturer might test four different lock designs to determine 
whether they differ significantly in the amount of force each will withstand just 
prior to breakage: 

Factor: Lock Design 
Level 





1 


2 


3 


4 


Measurements: Breaking Strength, 


1050 


957 


1008 


1235 


Pounds, 3 Locks of Each Design 


1023 


1114 


849 


1093 




981 


1056 


972 


1110 



Mean 



1018.0 1042.3 943.0 1146.0 pounds 



In this experiment, the dependent variable is "breaking strength," the inde- 
pendent variable (or factor) is "lock design," and the factor levels (or treatments) 
are the four different designs being tested. Again, since there is just one factor, 
each factor level can also be referred to as a treatment. This is an example of a 
balanced experiment, one in which an equal number of persons or test units re- 
ceives each treatment. There are three test units (locks) in each treatment group. 

In its most basic application, ANOVA can determine whether the sample 
means (x 1 x 2 x 3 , and xA differ significantly from one another or whether the 
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differences observed could have been due to chance. For example, in the experi- 
ment just presented, can we conclude that the populations of these four lock de- 
signs are equally strong? 



In the preceding experiment, the variation in the breaking strengths can be viewed 
in terms of (1) variation between the groups, reflecting the effect of the factor lev- 
els (the four types of lock design), and (2) variation within the groups, which rep- 
resents random error from the sampling process. Comparing these two kinds of 
variation is the basis of ANOVA. 

To show how these two kinds of variation are applicable to ANOVA, we'll 
rely on two computer-assisted examples. Each will consist of three independent 
samples represented in dotplot diagrams, and each can be viewed as a separate ex- 
periment that involved three treatments. Based on each set of plots, you'll be 
asked to judge whether the samples could have come from populations that had 
the same value for the population mean. 

Part A As a first step in this "you be the judge" scenario, consider the three inde- 
pendent samples represented in part A of Table 12.1. From these plots (without peek- 
ing at the table caption), would it appear that all three samples could have come from 
populations having the same value for the population mean (i.e., fi^ = fi 2 = M3)? 

Part B Next, consider the three independent samples represented in part B of Table 
12.1. Does it seem possible that all three of these samples might have come from 
populations having the same value for the population mean (i.e., fi^ = fi 2 = /a 3 )? 



Variation Between and Within the Groups 




A. H 0 : /x 1 = fjL 2 = fi 3 versus Hy The population means are not equal. 



The basis for ANOVA lies 
in the amount of variation 
between the samples com- 
pared to the amount of 
variation within them. For 
the three independent 
samples in part A, we 
would tend to reject 
H Q : /J,, = H2 = /*3- For the 
three independent samples 
in part B, it appears that the 
population means could be 
equal. 



20 30 40 50 60 



_+ CR0UP2 



-+ CR0UP3 

70 



_+ CR0UP1 



B. H 0 : n^ = fjL 2 = /i 3 versus H{. The population means are not equal. 



+ + + + + +-SAMPLE1 



+ + + + + +-SAMPLE2 



+ + + + + +-SAMPLE3 

104.0 112.0 120.0 128.0 136.0 144.0 
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In part A, it does not appear that the population means could be the same. In- 
tuitively, this is because the variation between the samples is rather large compared 
to the amount of variation within the samples themselves. As an analogy, the data 
points in part A somewhat resemble the likely outcome if a single-barrel shotgun 
were to be fired in three different directions. 

Looking at part B, the same intuition would tell us that these samples don't ap- 
pear to be very different from each other. Stated another way, the variation between 
the samples is small relative to the variation within the samples. Although the sample 
means are not shown in the dotplots, they would be very close together compared to 
the amount of variation within the samples themselves. Using the shotgun analogy, 
part B could well be the result of having fired a single-barrel shotgun three times in the 
same direction. 

If the data in part A of Table 12.1 were the result of applying three different 
treatments in a one-factor experiment, we would tend to conclude that the treat- 
ments are not equally effective. Taking a similar view toward a second experiment 
described by the data in part B, we would tend to conclude that the treatments in 
this experiment might be equally effective. 

Although ANOVA quantifies variation between versus within samples, the 
basic process is an extension of the intuitive conclusions drawn from parts A and B 
of Table 12.1. The most basic approach to ANOVA is the situation in which there 
is just one factor that operates on two or more different levels, as with the four lock 
designs discussed previously. Since there is just one factor (lock design), this is re- 
ferred to as one-way analysis of variance. It is the subject of the next section. 




12.1 What is meant by a designed experiment? 

12.2 "A factor level is the same as a treatment." Is this 
statement always true? If not, what conditions are neces- 
sary for it to be true? 

12.3 What is necessary for an experiment to be "balanced"? 

12.4 Explain the basic role that between-sample variation 
and within-sample variation play in carrying out an 
analysis of variance. 

12.5 Differentiate between the independent and 
dependent variables in an experiment. 

12.6 A university president collects data showing the 
number of absences over the past academic year for a 
random sample of 6 professors in the College of 
Engineering. She does the same for a random sample of 
9 professors in the College of Business and for a random 
sample of 8 professors in the College of Fine Arts. Does 
this represent a designed experiment? Explain. 

12.7 Twenty accounting students are randomly assigned 
to two different sections of an intermediate accounting 
class. Each section ends up consisting of 10 students. In 



one of the sections, computer-assisted instruction and re- 
view software is utilized; in the other section, it is not. All 
students are given the same final examination at the end 
of the semester. Does this represent a designed experiment? 
Explain. 

12.8 From each of four suppliers, a quality-control techni- 
cian collects a random sample of 10 rivets, then measures 
the number of pounds each will withstand before it fails. 
Does this represent a designed experiment? Explain. 

12.9 For the experiment described in Exercise 12.6, iden- 
tify the dependent and independent variables. Indicate 
whether each variable is quantitative or qualitative. 

12.10 For the experiment described in Exercise 12.7, 
identify the dependent and independent variables. Indicate 
whether each variable is quantitative or qualitative. 

12.11 For the experiment described in Exercise 12.8, 
identify the dependent and independent variables. Indicate 
whether each variable is quantitative or qualitative. 

12.12 The dotplots at the top of page 414 represent 
independent samples. Would you tend to reject the null 
hypothesis that /jl a = /j, b ? Explain. 



414 



Part 4: Hypothesis Testing 



MINITAB 
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ONE-WAY ANALYSIS OF VARIANCE 



Purpose 

The one-way analysis of variance examines two or more independent samples to 
determine whether their population means could be equal. Since the treatments 
are randomly assigned to all of the persons or other test units in the experiment, 
this is also referred to as the one-factor, completely randomized design. When 
there are just two samples, it becomes the equivalent of the two-sample, pooled- 
variances £-test of Chapter 11. We will examine this equivalence later in this 
section. 

The null and alternative hypotheses are 



jx 1 = fi 2 = ■ ■ ■ = /jL t for treatments 1 through t 
H t : The population means are not equal. 



Model and Assumptions 

The preceding null and alternative hypotheses represent one of the possibilities in 
viewing the one-way ANOVA. It can also be described in terms of the following 
model in which each individual observation is considered to be the sum of three 
separate components: 



An individual observation or 
measurement, this is the rth 
observation for treatment ;'. 



The effect of treatment 



M + Tj 



+ e- 



The overall population mean 
for all of the treatments 



Random error associated 
with the sampling process 



According to this model, each measurement (x-) associated with treatment / is 
the sum of the actual population mean for all of the treatments (/a), the effect of 
treatment / (t-), and a sampling error (e- ; ). The null and alternative hypotheses 
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shown previously can now be presented in an equivalent form that is relevant to 
the model described here: 

H Q : t- = 0 for treatments ;' = 1 through t (Each treatment has no effect.) 

H^, t. + 0 for at least one of the / = 1 through t treatments 

(One or more treatments has an effect.) 

These null and alternative hypotheses are equivalent to those expressed in 
terms of the population means. If the effect of each treatment (t-, j = 1 through t) 
is zero, then the population means for the treatments /j- 2 , ■■■■> wm be equal. 
In terms of the (%•■ = fi + r- + e-) model, if these equivalent null hypotheses can- 
not be rejected, we must conclude that each population mean could be equal to 
the overall mean, p.. 



Regarding the terms effective, equally effective, and effect: It's possible that all of 
the treatments might be very effective in influencing the dependent variable (e.g., 
all of the lock designs tested could be virtually unbreakable). To this extent, p in 
the (x- = /jb + Tj + e--) model would be an extremely high number of pounds be- 
fore breakage. However, the purpose of the one-way ANOVA is to determine 
whether the treatments could be equally effective. If we are able to conclude that 
the treatments could be equally effective, this is equivalent to the conclusion that 
each treatment effect (t.) could be zero. 

The assumptions for one-way ANOVA are as follows: (1) The samples have been 
independently selected, (2) the population variances are equal {<t\ = a\ = ■■■ = aj), 
and (3) the population distributions are normal. Of these assumptions, that of equal 
population variances is especially important, since the procedure relies on two sepa- 
rate estimates of this common variance. One of these estimates is based on between- 
sample variation, the other is based on within-sample variation. In the event that we 
are either unable or unwilling to make the assumptions just described, the 
Kruskal-Wallis test for comparing the central tendency of two or more independent 
samples may be used as an alternative. This test is presented in Chapter 14. 



I 



NOT 



Procedure 

The procedure for carrying out the one-way ANOVA is summarized in Table 12.2 
(page 416), and its steps are generally similar to those for hypothesis tests performed 
in Chapters 10 and 11. Whether done by hand calculations or with the assistance of 
a computer statistical package (a much preferred approach), the procedure can be 
described according to the process shown in parts A through D of Table 12.2. 

Part A: The Null and Alternative Hypotheses 

The null and alternative hypotheses are expressed in terms of the equality of the 
population means for all of the treatment groups. 

Part B: The Format of the Data to Be Analyzed 

The data can be listed in tabular form, as shown, with a separate column for each 
of the t treatments. The number of observations in the columns (n v n 2 , « 3 , . . . , n t ) 
need not be equal. For greater convenience in the formulas to follow, N is shown 
in Table 12.2 as the total number of observations in all of the samples combined. 
Likewise, the grand mean (x) is the mean of all of the observations that have been 
recorded. 
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TABLE 12.2 



H 1 : The population means are not all the same. 



B. M 



Treatments, / = 1 to t 








1 2 

X 11 X 12 


3 

X 13 


f 

x 1f 




= the /th observation 
in the y'th sample 


X 21 X 22 
X 31 X 32 


X 23 

X 33 ■ ■ ■ 


X 2f 
X 3f 


hi 


— the total number of 
observations 

N = ±„j 

7 = 1 










x n,1 x n 2 2 


x n 3 3 


x n t t 


X 


= the grand mean; this 
is the mean of all 
the observations 

t "i 

»=1»=1 ; 
x= /V 


X .1 X .2 


x 3 







Source of 
Variation 


Sum of Squares 


Degrees of 
Freedom 


Mean Square 


F-Ratio 


Treatments, 
TR 


SSTR = ^(Xj - W 


f- 1 


MSTR = SSTR 
t - 1 


MSTR 
F ~ MSB 


Sampling 
Error, E 


7=1/= 1 


A/- f 


MSE= SSE 
N - t 




Total, T 


;=!/= 1 


N — 1 







Summary of the one-way 
ANOVA. The sample sizes 
(n v n 2 , n 3 , n f ) 



If F = 



MSTR 
MSE 



is > F[a, (f - 1 ), (N - t)], reject H 0 at the a level. 



In the format shown for listing the x t - data values, note that the mean for each 
treatment (/' = 1 through t treatments) has a dot within the subscript. For exam- 
ple, the mean for treatment 1 is expressed as x 1 . This indicates that this mean is 
for all values of x ;1 ; in other words, it represents the entire column of x n values. 
The dot can be considered a "wildcard" subscript that includes all values to 
which it applies; its use will be clarified in the numerical example that follows this 
discussion. 
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Part C: The Calculations for One-Way ANOVA 

Part C of Table 12.2 describes the specific computations necessary to carry out 
one-way ANOVA. Although these computations are not as imposing as they 
might seem, it remains advisable to use a computer statistical package if one is 
available. Each of these quantities is associated with a specific source of variation 
within the sample data, and they correspond to the = ^ + t- + e- model dis- 
cussed previously. 

The Sum of Squares Terms: Quantifying the Two Sources of Variation 

• Treatments, TR SSTR is the sum of squares value reflecting variation be- 
tween individual treatment means and the overall mean for all treatments (x). 
Weighted according to the sample sizes for the respective treatment groups, 
SSTR expresses the amount of variation that is attributable to the treatments. 

• Sampling error, E SSE is the sum of the squared differences between ob- 
served values and the means for their respective treatment groups; SSE ex- 
presses the amount of variation due to sampling error. 

• Total variation, T SST is the total amount of variation, or SST = SSTR + SSE. 

Making the Amounts from the Two Sources of Variation Comparable MSTR 
is the mean square for the between-group variation. It is obtained by dividing 
SSTR by an appropriate number of degrees of freedom (t — 1) so MSTR will be 
comparable to MSE in the calculation of the test statistic. 

MSE is the mean square for within-group variation. It is obtained by dividing 
SSE by an appropriate number of degrees of freedom (N — t), again so that MSTR 
and MSE will be comparable in the calculation of the test statistic. 

Part D: The Test Statistic, the Critical Value, 
and the Decision Rule 

The Test Statistic The F-ratio, MSTR/MSE, is the test statistic upon which we 
rely in reaching a conclusion. MSTR has estimated the common variance (a 2 ) 
based on variation between the treatment means. MSE has estimated the common 
variance based on variation within the treatment groups themselves, and the test 
statistic, F, is the ratio of these separate estimates of common variance: 

The estimate of a 2 based 
on the amount of variation 
between the treatment means 



MSTR ** 1 

MSE 1 

The estimate of a 2 based 
on the amount of variation 
within the treatment groups 



The Critical Value and the Decision Rule The test is right-tail, and, for a given 
level of significance (a), we will reject H Q : fiu-y = /jl 2 — • • • — P t if the calculated 
value of F is greater than F[a, (t — 1), (N — t)]. In referring to the F distribution 
table, we use the v 1 = (t — 1) column and the v 2 = (N — t) row in identifying the 
critical value. By using Seeing Statistics Applet 15, at the end of the chapter, you 



418 



Part 4: Hypothesis Testing 



can demonstrate how the F distribution and its probabilities respond to changes 
in v 1 and v 2 . 

The F distribution is used in this test because, as discussed in Chapter 11, it is 
the sampling distribution for the ratio of the two sample variances whenever two 
random samples are repeatedly drawn from the same, normally distributed popu- 
lation. To the extent that the calculated F is large, we will tend toward the conclu- 
sion that the population means might not be the same. Naturally, the specific 
conclusion we reach will depend on (1) the level of significance (a) we have se- 
lected and (2) our comparison of the calculated F with the critical F for this level 
of significance. 







example 




One-way ANOVA Procedure 




An accounting firm has developed three methods to guide its seasonal employees 
in preparing individual income tax returns. In comparing the effectiveness of 
these methods, a test is set up in which each of 10 seasonal employees is randomly 
assigned to use one of the three methods in preparing a hypothetical income 
tax return. The preparation times (in minutes) are shown in Table 12.3 and are in 
data file CX12ACCT. At the a = 0.025 level, can we conclude that the three meth- 
ods could be equally effective? 




SOLUTION 




As Table 12.3 shows, the three treatment groups are not equal in size, with 
n 1 = 4, « 2 = 3, and « 3 = 3, but this presents no problem in the analysis. The 
total number of observations in all treatment groups is as described in part B of 
Table 12.2, or 




t 

N = ]£«. = 4 + 3 + 3 and N = 10 

7=1 





TABLE 12.3 



Underlying data and the 
results of preliminary 
calculations for a one-way 
analysis of variance to 
determine whether three 
tax-preparation methods 
could be equally effective in 
guiding tax preparers 
through the completion of 
a hypothetical return con- 
trived for the purpose of 
the experiment. The three 
treatments were randomly 
assigned to 10 tax prepar- 
ers, and data are the num- 
ber of minutes each person 
required for completion of 
the return. 



Treatment Group /, with / = 1, 2, 3 



Method 1 



15 
20 
19 
14 



17.0 



Method 2 



10 
15 
11 



12.0 



Method 3 



18 
19 
23 



The Xjj values; the second 
observation in treatment 
group 3 isx 23 = 19 minutes. 



20.0 



The mean for treatment group 2 is 

10+15+11 . 
x 7 = = 12.0 minutes. 
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The grand mean (x) is the mean of the N = 10 observations. It is obtained by 
dividing the sum of all the observations by N, or 

t n i 

= lib 

and 

= 15 + 20 + 19 + 14 + 10 + 15 + 11 + 18 + 19 + 23 „ . 

x = =16.4 minutes 

10 

Having carried out the preceding calculations, we can now proceed with those 
described in part C of Table 12.2. 

Treatment Sum of Squares, SSTR 

This quantity expresses the amount of variation attributable to the treatments, 
and each treatment mean is compared to the overall, or grand mean, during the 
calculation of SSTR: 



SSTR = 5) »;■(«; " *") 2 
7=1 

= 4(17.0 - 16.4) 2 + 3(12.0 - 16.4) 2 + 3(20.0 - 16.4) 2 



and 



SSTR = 98.4 



Error Sum of Squares, SSE 

This quantity represents the amount of variation within the treatment groups, or 
sampling error. Notice that in this calculation, each observation is being com- 
pared with the mean of the treatment group of which it is a member, and that each 
treatment group "contributes" to the value of SSE. 



SSE 



t "i 

/=l!=l 


" */ 


" (15.0 - 


17.0) 2 " 


+(20.0 - 


17.0) 2 


+(19.0 - 


17.0) 2 


.+(14.0 - 


17.0) 2 . 



(10.0 - 12.0) 2 
+ (15.0 - 12.0) 2 
+ (11.0 - 12.0) 2 . 



(18.0 - 20.0) 2 
+ (19.0 - 20.0) 2 
+(23.0 - 20.0) 2 . 



and 



SSE = 54.0 



Total Sum of Squares, SST 

To apply one-way ANOVA to the testing of our null hypothesis, it isn't really nec- 
essary to use the formula in Table 12.2 to compute SST. In practice, we could just 
calculate SST = SSTR + SSE. The calculation of SST is shown here only to demon- 
strate that the total variation is the sum of variation due to the treatments (SSTR) 
plus that due to sampling error (SSE). In the expressions that follow, note that the 
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terms are similar to those of the SSE computation, except that each observation 
is now being compared to the grand mean, x = 16.4 minutes: 



SST=25>,y-*) 



/=1«=1 



and 



(15 


0 - 


16.4 


2 " 


+ (20 


0 - 


16.4 


2 


+ (19 


0 - 


16.4 


2 


+ (14 


0 - 


16.4 


2 _ 



+ 



(10.0 - 16.4) 2 
+ (15.0 - 16.4) 2 
+ (11.0 - 16.4) 2 



+ 



(18.0 - 16.4) 2 
+ (19.0 - 16.4) 2 
+ (23.0 - 16.4) 2 



SST = 152.4 or SST = SSTR + SSE 



Treatment Mean Square (MSTR) and Error Mean Square (MSE) 

SSTR and SSE must now be divided by their respective degrees of freedom so that 
(1) they will be comparable and (2) each will be a separate estimate of the com- 
mon variance that the treatment group populations are assumed to share. Recall 
that we have £ = 3 treatments and a total of N = 10 observations. 

The estimate of a 2 that is based on the between-treatment variation is 

A/fCTp SSTR 98A_ 
MSTR = = and 



t 



3 



MSTR = 49.20 



The estimate of a 2 that is based on the within-treatment, or sampling-error 
variation is 



MSE 



SSE 



54.0 



N - t 10-3 



and 



MSE = 7.71 



The Test Statistic, F 

The test statistic is the ratio of the two estimates for a 2 , or 



MSTR 49.20 



MSE 



7.71 



and 



At this point, we have generated all of the information described in part C of 
Table 12.2, and the results can be summarized as shown in the standard one-way 
ANOVA table in Table 12.4. Depending on your computer statistical package, the 
items in Table 12.4 might be the only information provided, and you might have to 
draw conclusions on the basis of statistical tables as we are now about to do. 



The Critical Value of F and the Decision 

The calculated F might seem large enough to suggest that the population means 
might not be the same, but the key question is, Is it large enough that we are able 
to reject the null hypothesis at the a = 0.025 level of significance} To find out, we 
must first look up the critical value of F from the a = 0.025 F distribution table 
in Appendix A: 

Critical Value of F = F(a, v v v 2 ) 
The df associated with the numerator of F is 



v 1 = (t~l) or (3-1) =2 
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Variation 
Source 


Sum of Squares 


Degrees of 
Freedom 


Mean Square 


F 


Treatments 1 


98.40 


2 


49.20 


6.38 


Error 2 


54.00 


7 


7.71 




Total 


152.40 


9 






'This source may also be described as "between-group" variation. 
2 This source may also be described as "within-group" variation. 



The df associated with the denominator of F is 
v 2 = (N - t) or (10 



3) = 7 



Thus, for a = 0.025, t = 3 treatments, and a total of N = 10 observations, the 
critical F is F(0.025, 2, 7) = 6.54. 

As Figure 12.1 shows, the calculated value (F = 6.38) does not exceed the criti- 
cal value (6.54), and we are not able to reject the null hypothesis, H Q : fi 1 = /j, 2 = /a 3 . 
At the 0.025 level of significance, the training methods could be equally effective. 

Using the 0.05, 0.025, and 0.01 F distribution tables in Appendix A, we can 
narrow down the exact p-value for the test: 



a 


Calculated F 




Critical 
F Value, 
F(a, 2, 7) 


Decision 


0.01 


6.38 


is not > 


9.55 


Cannot reject H Q 


0.025 


6.38 


is not > 


6.54 


Cannot reject H 0 


0.05 


6.38 


exceeds 


4.74 


Reject H 0 



Although the null hypothesis would be rejected at the 0.05 level, we are not 
able to reject it at the 0.025 level. Based on our statistical tables, the most accu- 
rate statement we can make about the p-value for the test is that it is somewhere 
between 0.025 and 0.05. 



TABLE 12.4 



Given the data in Table 
12.3 and the computations 
in the text, the result is this 
standard format describing 
the results of the one-way 
analysis of variance test. 
The format is similar to that 
shown in part C of Table 
12.2. Depending on your 
computer statistical pack- 
age, this may be all the in- 
formation that you will be 
provided. 



Do not reject H 0 



Ml = ^2 = Pi 



H l : The population means aren't equal. 



Reject H 0 

- Area = 0.025 



FIGURE 12.1 



Results of the one-way 
analysis of variance for the 
data in Table 12.3 and sum- 
mary figures in Table 1 2.4. 
The test was conducted at 
the 0.025 level, and the 
calculated Fdoes not 
exceed the critical value 
for this level of significance, 
so we are unable to reject 
the null hypothesis that 
the population means are 
equal. 



F = 6.54 



Test statistic: 
F=6.38 
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Computer Outputs and Interpretation 

The printouts in Computer Solutions 12.1 show the Excel and Minitab results for 
the ANOVA problem we've just carried out by pocket calculator. Although their 
formats differ slightly, the content of each output is very similar to the summary 
results shown in Table 12.4. 

In the Excel portion of Computer Solutions 12.1, the same quantities calcu- 
lated in our example appear in the Excel output, along with the p>-value of 
0.0265. Among the values listed is the total sum of squares (152.400), with sepa- 
rate entries for its between-group (98.400) and within-group (54.000) compo- 
nents. These are equivalent to the SST, SSTR, and SSE terms that we have been 
using in describing the respective sources of variation. 

The Minitab portion of Computer Solutions 12.1 shows the same quantities 
as Excel, including the p-value for the test. The respective p-values differ only due 



computer solutions 12.1 



One-Way Analysis of Variance 
EXCEL 







A 


B 


C 


D 


E 


F 


G 


H 


I 


J 


1 


Method 1 


Method2 


Method3 


Anova: Single Factor 














2 


15 


10 


18 
















3 


20 


15 


19 


SUMMARY 














4 


19 


11 


23 


Groups 


Count 


Sum 


Average 


Variance 






5 


14 






Method 1 


4 


68 


17 


8.667 






6 








Method2 


3 


36 


12 


7 






7 








Method3 


3 


60 


20 


7 






8 






















9 






















10 








ANOVA 














11 








Source of Variation 


SS 


df 


MS 


F 


P-value 


Fcrit 


12 








Between Groups 


98.400 


2 


49.200 


6.378 


0.0265 


6.542 


13 








Within Groups 


54.000 


7 


7.714 








14 






















15 








Total 


152.400 


9 











Excel one-way ANOVA 

1 . For the sample data in Table 1 2.3 (file CX12ACCT.XLS), with the data in columns A through C and their labels in the first 
row: Click Tools. Click Data Analysis. Click Anova: Single Factor. Click OK. 

2. Enter A1:C5 into the Input Range box. Select Grouped By Columns. Select Labels in First Row. Specify the signif- 
icance level for the test by entering 0.025 into the Alpha box. Select Output Range and enter D1 into the box. Click 
OK. The results are as shown above. 

Excel Unstacking Note: Excel requires that the samples be in separate and adjacent columns. If the data are "stacked" 
in one column, with subscripts identifying group membership in another column, it will be necessary to "unstack" the 
data. For example, if the data were in column A and the subscripts (methods) were in column B: (1) Click and drag to se- 
lect the numbers and labels in columns A and B. (2) Click Data. Click Sort. (3) Enter method into the first sort by box. 
Select Ascending. Select My list has header row. Click OK. The data for sample 1 will now be listed directly above the 
data for sample 2, which in turn will be directly above the data for sample 3 — from here, we need only select each group- 
ing of data and either move or copy it to its own column. 
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MINITAB 

Minitab one-way ANOVA 



One-way ANOVA: Methodl, Method2, Method3 



Source 


DF 




SS 




MS 


F 


P 


Factor 


2 


98 


.40 


49 


20 


6.38 


0.026 


Error 


7 


54 


.00 


7, 


71 






Total 


9 


152 


.40 











S = 2.777 R-Sq = 64.57% R-Sq(adj) = 54.44% 

Individual 95% CIs For Mean Based on 
Pooled StDev 

Level N Mean StDev + + + + 

Methodl 4 17.000 2.944 ( * ) 

Method2 3 12.000 2.646 ( * ) 

Method3 3 20.000 2.646 ( * ) 



12.0 16.0 20.0 24.0 



Pooled StDev = 2.777 



1. For the sample data in Table 12.3 (file CX12ACCT.MTW), with the data in columns C1 through C3: Click Stat. Select 
ANOVA. Click One-Way (Unstacked). 

2. Enter C1-C3 into the Responses (in separate columns) box. Click OK. 

Note: If all the data had been in a single column [i.e., "stacked"], it would have been necessary to select the One-way 
option in step 1, then to specify the column containing the data and the column containing the subscripts in step 2. 



to rounding. Note that "Factor" is used in representing the variation due to the 
treatments. As pointed out previously, when there is just one factor, treatments 
and factor levels are synonymous. Beyond the standard listing of one-way 
ANOVA summary information, Minitab provides futher information in the 
form of: 

1. The pooled estimate of the population standard deviation, "Pooled StDev = 
2.777." Keeping in mind that the model assumes the treatment-group popula- 
tions have the same variance, this part of the output indicates that 2.777 2 is 
the estimated value for the variance for each of the populations from which 
the treatment groups were drawn. 

Note that 2.777 2 = 7.71, the value listed for the error mean square in the 
"Error" row and "MS" column of the Minitab output. (MSE is equivalent to 
Sp, the pooled estimate of o 2 .) As discussed previously, one-way ANOVA uses 
the error mean square as one of the two estimates of the common variance 
that the treatment-group populations are assumed to share. 

2. The 95% confidence interval for the population mean corresponding to each 
treatment. The Minitab output shows each of these confidence intervals in a 
visual format, but they are based on the pooled estimate of tr, which is 
"Pooled StDev = 2.777." Each of these confidence intervals is calculated 
using the t distribution and the same df that is associated with the error mean 
square, MSE: 
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One-way ANOVA, confidence interval for the mean, treatment ;': 
The confidence interval for /jl- is 

where s p = ViVISE, the pooled estimate of a 
n- = number of persons or test units 

in treatment group / 
X: = sample mean for treatment group j 
t = from the t distribution, the t value for the 
desired confidence level, using the same df 
as associated with MSE 

For example, the mean for the 4 persons in treatment group 1 ("Method 1") was 
17.0 minutes, and the number of degrees of freedom associated with MSE was df= 7. 
Using the pooled estimate for the population standard deviation (2.777 minutes), 
the 95% confidence interval for p 1 is 

where df =1 degrees of freedom associated with 

MSE, and t = 2.365 for a 95% interval 



17.000 ± t 



2.777 



V4 , 

= 17.000 ± 2.365(1.389), or from 13.72 to 20.28 minutes 

We have 95% confidence that the population mean for treatment 1 is some- 
where from 13.72 to 20.28 minutes; the confidence intervals for /x 2 and /j, 3 shown 
in the Minitab portion of Computer Solutions 12.1 were determined through the 
same procedure. The simultaneous display of confidence intervals such as those in 
the Minitab printout helps us to see at a glance the extent to which one or more 
treatments might differ from the rest. 

One-Way ANOVA and the Pooled-Variances t-Test 

As mentioned previously, when there are just two treatments, one-way ANOVA and 
the pooled-variances Mest are equivalent. Each is testing whether two population 
means could be the same, and each relies on a pooled estimate for the variance that 
the populations are assumed to share. In addition, both tests assume the population 
is (at least approximately) normally distributed and the samples are independent. 

With two treatment groups, one-way ANOVA is testing H Q : = fi 2 versus 
fij: ¥= /jl 2 - The test is nondirectional, since H Q can be rejected by an extreme dif- 
ference in either direction. In a nondirectional, pooled- variances i-test, the null and al- 
ternative hypotheses are exactly the same as in one-way ANOVA with two treatment 
groups. In each case, the null hypothesis can be rejected by an extreme result in either 
direction. The tests are equivalent and their results will provide the same p-value. 



y» exercises 






v FW1 wm&~ 
V V*- ML :..WL--> 




J 















Note: The computer is highly recommended when using 
ANOVA techniques. For exercises involving raw data, 
data files are shown in parentheses. 

12.13 What is the purpose of the one-way ANOVA? 

12.14 What assumptions are required in using the 
one-way ANOVA? 



12.15 Why is the F distribution applicable to ANOVA? 

12.16 For the one-way ANOVA model, = fi + r- + 
e--, explain the meaning of each term in the equation, 
then use the appropriate terms from the equation in 
presenting and explaining the null hypotheses to be 
examined. 
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12.17 In terms of the assumptions required for using 
ANOVA, what is the special relevance of the value calcu- 
lated for the error mean square, MSF? 

12.18 On snow-covered roads, winter tires enable a car 
to stop in a shorter distance than if summer tires were 
installed. In terms of the additive model for one-way 
ANOVA, and for an experiment in which the mean stop- 
ping distances on a snow-covered road are measured for 
each of five brands of winter tires, differentiate between 
the use of the terms "effectiveness" and "effect." 

12.19 In using ANOVA to compare the means of several 
samples, what value would be expected for the calculated 
F-statistic if the population means really are the same? 
Explain. 

12.20 In a one-way ANOVA, there are three independent 
samples, with Kj = 8, n 2 = 10, and » 3 = 7. The calculated 
F-statistic is F = 3.95. At the 0.05 level of significance, 
what conclusion would be reached? Based on the F distri- 
bution tables, what is the most accurate statement that 
can be made about the p-value for this test? 

12.21 In a one-way ANOVA, there are two independent 
samples, with n l = 20 and n 2 = 15- The calculated F- 
statistic is F = 3.60. At the 0.05 level of significance, 
what conclusion would be reached? Based on the F distri- 
bution tables, what is the most accurate statement that 
can be made about the p-value for this test? 

12.22 A one-way ANOVA has been conducted for an 
experiment in which there are three treatments and each 
treatment group consists of 10 persons. The results include 
the sum of squares terms shown here. Based on this infor- 
mation, construct the ANOVA table of summary findings 
and use the 0.025 level of significance in testing whether 
all of the treatment effects could be zero. 

SSTR = 252.1 SSE = 761.1 

12.23 For the experiment described in Exercise 12.6, what 
null and alternative hypotheses would be appropriate for the 
one-way ANOVA to examine the data? Given the following 
data, what conclusion would be reached at the 0.05 level of 
significance? From the F distribution tables, what is the 
approximate p-value for this test? (Use data file XR12023.) 

Absences, Sample Members from the College of 



Engineering Business Fine Arts 



8 5 9 

10 7 10 

6 6 10 

8 7 9 

4 7 7 

8 6 5 

8 13 

8 7 
1 



12.24 For the experiment described in Exercise 12.7, 
what null and alternative hypotheses would be appropri- 
ate for the one-way ANOVA to examine the data? If the 
data are as shown here, what conclusion would be 
reached at the 0.025 level of significance? From the F dis- 
tribution tables, what is the approximate p-value for this 
test? (Use data file XR12024.) 

Final Exam Scores of Students for 
Whom Computers and Software Were 

Used 73 82 72 75 88 73 74 81 86 83 
Not Used 78 67 71 72 69 81 64 79 73 73 

12.25 For the experiment described in Exercise 12.8, what 
null and alternative hypotheses would be appropriate for 
the one-way ANOVA to examine the data? If the data are 
as shown here, what conclusion would be reached at the 



0.01 level of sij 


mificance? (Use 


data file XR12025.) 


Breaking Strengths (Pounds) of 10 Rivets from 


Supplier A 


517 484 


463 


452 502 




447 481 


500 


485 566 


Supplier B 


479 499 


488 


430 482 




457 424 


488 


526 455 


Supplier C 


435 443 


480 


465 435 




430 465 


514 


463 510 


Supplier D 


526 537 


443 


505 468 




533 481 


477 


490 470 


12.26 For the following data from independent samples, 
could the null hypothesis that the population means are 
equal be rejected at the 0.05 level? (Use data file XR12026.) 


Sample 1 


Sample 2 


Sample 3 


Sample 4 


15.2 


10.2 


14.9 


11.5 


12.2 


12.1 


13.0 


13.1 


15.0 


8.5 


10.8 


11.5 


14.6 


8.1 


10.7 


6.3 


9.7 


10.9 


13.3 


12.0 




13.6 


10.6 


8.7 




8.5 




11.9 



12.27 A study has been carried out to compare the United 
Way contributions made by clerical workers from three 
local corporations. A sample of clerical workers has been 
randomly selected from each firm, and the dollar amounts 
of their contributions are as follows. (Use data file XR12027.) 



Firm 1 Firm 2 Firm 3 



199 108 162 

236 104 86 

167 153 160 

263 218 135 

254 210 207 

96 201 
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a. What are the null and alternative hypotheses for 
this test? 

b. Use ANOVA and the 0.05 level of significance in 
testing the null hypothesis identified in part (a). 

12.28 Students in a large section of a biology class 
have been randomly assigned to one of two graduate 
students for the laboratory portion of the class. A 
random sample of final examination scores has been 
selected from students supervised by each graduate 
student, with the following results. (Use data file 
XR12028.) 

Grad student A 78 78 71 89 80 93 73 76 

Grad student B 74 81 65 73 80 63 71 64 50 80 

a. What are the null and alternative hypotheses for 
this test? 

b. Use ANOVA and the 0.01 level of significance in 
testing the null hypothesis identified in part (a). 

12.29 Each of fifteen 60-watt lightbulbs has been 
randomly placed into an outlet for which the voltage is 

3 volts below the line voltage, 2 volts below the line volt- 
age, or equal to the line voltage. The following data are 
the lifetimes of the bulbs, expressed in days of continuous 
use. (Use data file XR12029.) 

Three volts below line 58 63 46 57 51 
Two volts below line 46 59 51 46 42 
Equal to line voltage 52 48 38 48 42 

a. What are the null and alternative hypotheses for this 
test? 

b. Use ANOVA and the 0.01 level of significance in 
testing the null hypothesis identified in part (a). 

c. For each sample, construct the 95% confidence 
interval for the population mean. 

12.30 Safety researchers, interested in determining 
whether the occupancy of a vehicle might be related to 
the speed at which the vehicle is driven, have observed 
the following speed (mph) measurements for two random 
samples of vehicles. (Use data file XR12030.) 

Driver alone 64 50 71 55 67 61 80 56 59 74 
At least one 

passenger 44 52 54 48 69 67 54 57 58 51 62 67 

a. What are the null and alternative hypotheses for 
this test? 

b. Use ANOVA and the 0.025 level of significance in 
testing the null hypothesis identified in part (a). 

c. For each sample, construct the 95% confidence 
interval for the population mean. 

12.31 For the following summary table for a one-way 
ANOVA, fill in the missing items (indicated by asterisks), 
identify the null and alternative hypotheses, then use the 
0.05 level of significance in reaching a conclusion regard- 
ing the null hypothesis. 



Variation 
Source 



Sum of 
Squares 



Degrees of 
Freedom 



Mean 
Square 



Treatments 6752.0 2 3376.0 *** 

Error 30178.0 

Total 36930.0 29 

12.32 For the following summary table for a one-way 
ANOVA, fill in the missing items (indicated by asterisks), 
identify the null and alternative hypotheses, then use the 
0.025 level of significance in reaching a conclusion 
regarding the null hypothesis. 

Variation Sum of Degrees of Mean 
Source Squares Freedom Square F 

Treatments 665.0 4 *** *** 

Error *** 60 *** 

Total 3736.3 

12.33 A researcher has used a standard test in measuring 
the job-satisfaction scores for employees randomly 
selected from three departments of a large firm. Interpret 
the results summarized in the following Minitab output. 

Analysis of Variance 

Source DF SS MS F p 

Factor 2 69.9 35.0 2.45 0.092 

Error 92 1311.2 14.3 

Total 94 1381.1 

Individual 95% CIs for Mean 
Based on Pooled StDev 

Level N Mean StDev h + 1 

DEPT1 30 17.640 3.458 ( * ) 

DEPT2 25 15.400 3.841 ( * ) 

DEPT3 40 16.875 3.956 C * ) 

Pooled StDev - 3.775 15.0 16.5 18.0 

12.34 A large investment firm claims that no discrepancy 
exists between the average incomes of its male and female 
investment counselors. Random samples consisting of 15 
male counselors and 17 female counselors have been 
selected, and the results examined through the use of 
ANOVA. In terms of this situation, interpret the compo- 
nents of the following Minitab output. 

Analysis of Variance 

Source DF SS MS F p 

Factor 1 142.8 142.8 2.70 0.111 

Error 30 1586.0 52.9 

Total 31 1728.8 

Individual 95% CIs for Mean 
Based on Pooled StDev 

Level N Mean StDev H 1 1 

MALES 15 47.654 6.963 ( * ) 

FEMALES 17 43.420 7.530 ( « ) 

Pooled StDev - 7.271 42.0 45.5 49.0 

12.35 Given the summary information in Exercise 12.33, 
verify the calculation of the 95% confidence interval for 
each of the treatment means. 

12.36 Given the summary information in Exercise 12.34, 
verify the calculation of the 95% confidence interval for 
each of the treatment means. 

12.37 In general, how do the assumptions, procedures, and 
results of one-way ANOVA compare with those for the 
pooled- variances f-test of Chapter 11? 
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12.38 Use the pooled-variances t-test of Chapter 11 in 
comparing the sample means for the data of Exercise 
12.28. Is the conclusion consistent with the one reached 
in that exercise? Explain. 



12.39 Use the pooled-variances t-test of Chapter 11 in 
comparing the sample means for the data of Exercise 
12.30. Is the conclusion consistent with the one reached 
in that exercise? Explain. 



THE RANDOMIZED BLOCK DESIGN 



12.4 



Purpose 

In the one-way, or completely randomized, ANOVA of the previous section, treat- 
ments are randomly assigned to all of the persons or other test units in the exper- 
iment. As a result, the composition of the treatment groups may be such that 
certain kinds of people or test units are overrepresented in some treatment groups 
and underrepresented in others, simply by chance. If the characteristics of the par- 
ticipants or test units have a strong influence on the measurements we obtain, we 
may be largely measuring the differing group compositions rather than the effects 
of the treatments. 

For example, let's assume we have randomly selected 12 citizens from a small 
community and these persons are to participate in an experiment intended to 
compare the night-vision effectiveness of four different headlamp designs. If we 
have treatment groups of equal size and randomly assign the treatments, as 
shown in part A of Table 12.5, it's likely that the representation of older drivers 



Measured: The distance (yards) at which a traffic sign can be read at night. 

Treatments: Four headlamp designs, 1, 2, 3, and 4. 

Age categories: Y, driver is under 30 years; M, 30-60 years; 0, >60 years. 

A. Completely Randomized Approach 

1 . Twelve drivers are randomly selected from the community (e.g., 5 Y, 3 M, and 4 O). 

2. One of the four treatments is randomly assigned to each person. With random 
assignment, the treatment groups could end up like this: 



Members of Treatment Group 



1 


2 


3 


4 


Y 


M 


V 


0 


Y 


0 


M 


Y 


0 


0 


Y 


M 



For night-vision measurement, 
treatment 2 would be at a distinct 
disadvantage and treatment 3 
would have a distinct advantage. 
B. Randomized Block Approach 

1 . Four members from each age group are randomly selected from the community 
(4 Y, 4 M, and 4 O). 

2. For each age category, treatments are randomly assigned to the members. Each 
treatment group will include one person from each age category: 



Members of Treatment Group 
12 3 4 



4 Y, treatments randomly assigned - 
4 M, treatments randomly assigned 
4 0, treatments randomly assigned - 



V 


Y 


Y 


Y 


M 


M 


M 


M 


0 


0 


0 


0 



TABLE 12.5 



In the one-way, or 
completely randomized, 
design of part A, our ability 
to compare the night-vision 
effectiveness of headlamp 
treatments may be reduced 
as the result of chance 
differences in the age- 
category compositions of 
the groups. The randomized 
block design in part B ensures 
that groups will be compa- 
rable in terms of the age 
categories of their members. 
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would not be exactly the same in all four groups. This would reduce our ability to 
compare the headlamp designs, since night vision tends to decrease with age. In 
this situation, the value of the variable that we really want to measure (i.e., the 
distance at which a headlamp enables a suburban traffic sign to be read) is being 
strongly influenced by another variable (age category) that has not been consid- 
ered in the experiment. 

In the randomized block design presented in this section, persons or test units 
are first arranged into similar groups, or blocks, before the treatments are as- 
signed. This allows us to reduce the amount of error variation. For example, in 
the night-vision experiment just described, use of the randomized block design 
would ensure that the treatment groups are comparable in terms of the age cate- 
gories of their members. Exerting this control over the age-category variable (now 
referred to as a blocking variable) allows us to better compare the effectiveness of 
the headlamp designs, or treatments. The resulting experiment would have a for- 
mat like that shown in part B of Table 12.5. 

Although we are controlling, or blocking, one variable, our primary concern 
lies in testing whether the population means could be the same for all of the treat- 
ment groups. Accordingly, the null and alternative hypotheses are 

H 0 : /jl^ = /jl 2 = ■ ■ ■ = fi t for treatments 1 through t 
H^; The population means are not equal. 



Model and Assumptions 

As with the one-way ANOVA of the previous section, the null and alternative hy- 
potheses can also be expressed in terms of an equation in which each individual 
observation is considered to be the sum of several components. For the random- 
ized block design, these components include both treatment and block effects: 



An individual observation 
measurement, this is 



or 



the observation in the z'th 
block for treatment ;. 



The effect of treatment 



The effect of block i 



IX + T, + ft + 



The overall population mean 
for all of the treatments 



Random error associated 
with the sampling process 



In this model, a measurement (x^) associated with block i and treatment /' is 
the sum of the actual population mean for all of the treatments (/a), the effect of 
treatment /' (t-), the effect of block i (ft), and a sampling error (e-y). We are only 
controlling for the effect of the blocking variable, not attempting to examine its 
influence. Thus, the equivalent null and alternative hypotheses are expressed only 
in terms of the treatments: 



H 0 : 



Tj = 0 for treatments / = 1 through t (Each treatment has no effect.) 

Hji Tj ¥= 0 for at least one of the / = 1 through t treatments. 

(One or more treatments has an effect. 
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In the randomized block design, there is just one observation or measure- 
ment for each block-treatment combination. For example, as part B of Table 
12.5 shows, headlamp treatment 1 would be administered to one person in the 
under-30 category, one person age 30-60, and one person over 60. Thus, each 
combination of block level and treatment represents a sample of one person or 
test unit. 

The randomized block design assumes (1) for each block-treatment combina- 
tion (i.e., each combination of the values of i and /), the sample of size 1 has been 
randomly selected from a population in which the values are normally dis- 
tributed; (2) the variances are equal for the values in these populations; and (3) 
there is no interaction between the blocks and the treatments. In the randomized 
block design, interaction is present when the effect of a treatment depends on the 
block to which it has been administered. For example, in the headlamp experi- 
ment in part B of Table 12.5, the performance of a given headlamp design relative 
to the others should be approximately the same, regardless of which age group is 
considered. The presence or absence of such interactions will be the subject of a 
later discussion. As with one-way ANOVA, an alternative method is available if 
one or more of these assumptions cannot be made. In the case of the randomized 
block design, this is the Friedman test of Chapter 14. 

Procedure 

The procedure for carrying out the randomized block design, which is summarized in 
Table 12.6 (page 430), is generally similar to the one-way ANOVA described in Table 
12.2. The following descriptions correspond to parts A through D of Table 12.6. 

Part A: The Null and Alternative Hypotheses 

The null and alternative hypotheses are expressed in terms of the equality of the 
population means for all of the treatment groups. 

Part B: The Format of the Data to Be Analyzed 

The data can be listed in tabular form, as shown, with a separate column for each 
of the t treatments and a separate row for each of the n blocks. Each cell (combi- 
nation of a block i and a treatment ;') contains just one observation. As with one- 
way, or completely randomized, ANOVA, the grand mean (x) is the mean of all of 
the observations that have been recorded. 

In addition to listing a format for the x- values, part B of Table 12.6 shows the 
block means and the treatment means. For example, x 1 is the mean for block 1, 
with the dot (.) in the subscript indicating the mean is across all of the treatments. 
Likewise, x 3 is the mean for treatment 3, with the dot portion of the subscript in- 
dicating the mean is across all of the blocks. 

Part C: The Calculations for the Randomized 
Block Design 

Part C of Table 12.6 describes the specific computations for the randomized block 
design; each quantity is associated with a specific source of variation within the 
sample data. They correspond to the x t - = fi + t- + /3 ; + model discussed pre- 
viously, and their calculation and interpretation are similar to their counterparts 
in the one-way, or completely randomized, design. 
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TABLE 12.6 



Summary of the 
randomized block design 



1 ton 



Ml = M 2 



H 1 : The population means are not all the same. 



Treatments, / = 1 to t 



A n2 A n3 
X-, X 3 



Treatment means 





1 


2 


3 


f 






X 11 


X 12 


X 13 


■ X 1t 


X 1. 




X 21 


X 22 


X 23 ■ 


■ X 2t 


X 2. 


Blocks, 


X 31 


X 32 


X 33 ■ 


■ X 3t 


X 3. 



Block means 



Xjj = the observation 
for the /'th 
block and the 
y'th treatment 
x - grand mean, 
mean of all the 
observations 

t n 

22*. 



* A/ 



'J 



Source of 
Variation 



Sum of Squares 



Degrees of 
Freedom 



Mean Square 



F-Ratio 



Treatments, 
77? 



SSTR = n^Ofj - x) 2 
J=1 



t- 1 



MSTR = 



SSTR 
t - 1 



F = 



MSTR 
MSE 



Blocks, B 



SSB = tj?(x A - x) 2 



n - 1 



/WS6 = 



S55 
n - 1 



F = 



MSB 
~MSE 



Sampling 
Error, E 

Total, T 



SSE = SST - SSTR 
- SSB 

SST=±±(x ir =xy 



(f- D(n - 1) 



tn - 1 



MSE = 



SSE 



(t- 1Xn - 1) 



lfF = W s>F[a ' (t 



1 ), (f - 1 )(n - 1 )] reject H Q at the a level. 



The Sum of Squares Terms: Quantifying the Sources of Variation 

• Treatments, TR SSTR is the sum of squares that reflects the amount of vari- 
ation between the treatment means and the grand mean, x. SSB is the sum of 
squares that reflects the amount of variation between the block means and the 
grand mean, x. 

• Sampling error, E SSE is the sum of squares that reflects the total amount of 
variation that is due to sampling error. It is most easily calculated by first de- 
termining SSTthen subtracting SSTR and SSB, 
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• Total variation, T SST is the sum of squares that reflects the total amount of 
variation in the data, with each data value being compared to the grand mean, 
then the differences are squared and summed. 

Making the Amounts from the Sources of Variation Comparable MSTR, 
MSB, and MSE are the mean squares for treatments, blocks, and error, respectively. 
As part C of Table 12.6 shows, each is obtained by dividing the corresponding 
sum of squares by an appropriate value for df. 

Part D: The Test Statistic, the Critical Value, 
and the Decision Rule 

The Test Statistic The test statistic is MSTR/MSE. As with the one-way or com- 
pletely randomized design, MSTR has estimated the common variance (a 1 ) based 
on variation between the treatment means, while MSE has estimated the common 
variance based on variance within the treatment groups. The test statistic is the 
ratio of these separate estimates of the common variance. 

The Critical Value and the Decision Rule The test is right-tail, and, for a given 
level of significance (a), we will reject H Q : /u^ = /jl 2 = ■ ■ ■ = f* t if the calculated 
value of F is greater than F[a, (t — 1), (t — l)(n — 1)]. In referring to the F distri- 
bution table, we use the v t = (t — 1) column and the v 2 = (t — l)(n — 1) row in 
identifying the critical value. 



Example 




; 





Randomized Block ANOVA Procedure 



To illustrate the application of the randomized block design, we will use an exam- 
ple that corresponds to our introductory discussion for this procedure. Computer 
outputs and their interpretation will then be presented. 

The new-product development team for an automotive headlamp firm has 
four different headlamp designs under consideration. A test is set up to com- 
pare their effectiveness in night-driving conditions, and the measurement of in- 
terest is the distance at which a suburban traffic sign can be read by the driver. 
Recognizing that younger drivers tend to have better night vision than older 
drivers, the team has planned the experiment so that age group will be a block- 
ing variable. Four persons from each age group (or block) are selected, then the 
treatments are randomly assigned to members from each block. When each 
person is subjected to one of the headlamp designs, the distance at which the 
traffic sign can be read is measured, with the results in Table 12.7 (page 432). 
At the 0.05 level, could the headlamp designs be equally effective? The data are 
also in file CX12LITE. 

SOLUTION 

For this test, the null hypothesis is H Q : /a 1 = fi 2 = = ^4 for the four headlamp 
designs, or treatments. According to the null hypothesis, the treatment popula- 
tion means are the same — i.e., the four headlamp designs are equally effective. 
The alternative hypothesis holds that the population means are not equal and 
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TABLE 12.7 



The data and the results of 
preliminary calculations for 
the randomized block de- 
sign to determine whether 
four headlamp designs 
could be equally effective. 
The blocks are according to 
age group, with / = 1 for 
the four under-30 drivers, 
/ = 2 for the four 30-60 
drivers, and / = 3 for the 
four over-60 drivers. Data 
are the distance (in yards) at 
which a suburban traffic 
sign can be read. 



Treatments, headlamps / = 1 to 4 





1 


2 


3 


4 


Blocks, 


90 


87 


93 


85 


age groups 


86 


79 


87 


83 


/ = 1 to 3 


76 


74 


77 


73 



84.00 80.00 85.67 80.33 



Treatment means 



88.75 
83.75 
75.00 





Block means 




\ 





The mean for block / = 3 is 
76 + 74 + 77 + 73 



75.00 yards 



The mean for treatment group j = 2 is 

87 + 79 + 74 „„ „„ 
x 2 = = 80.00 yards 



The grand mean is the mean of all the data values, or 

- 90 + 86 + 76 + ■ ■ ■ + 83 + 73 „ 

x = = 82.50 yards 

12 y 



that the headlamp designs are not equally effective. Preliminary calculations, 
such as the block means, treatment means, and the grand mean are shown in 
Table 12.7. We will now proceed with the remaining calculations, described in 
part C of Table 12.6. 

Treatment Sum of Squares, SSTR 

The treatment sum of squares is calculated in a manner analogous to that of the 
one-way ANOVA method. In the randomized block design, the treatment groups 
are of equal size, and in this experiment each group has n = 3 persons. 

SSTR = »2 (*.; - 
;=i 



(84.00 - 82.50) 2 + (80.00 - 82.50) 2 
.+(85.67 - 82.50) 2 + (80.33 - 82.50) 2 



and 



SSTR = 69.77 



Block Sum of Squares, SSB 

The sum of squares comparing block means with the overall mean is calcu- 
lated as 

SSB = *2 (*< " x) 2 

i=l 

= 4[(88.75 - 82.50) 2 + (83.75 - 82.50) 2 + (75.00 - 82.50) 2 ] 



anc 



SSB = 387.50 
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Total Sum of Squares, SST 

SST considers the individual data values. In this sum of squares, each x- value is 
compared to the grand mean, x: 



t n 

SST =22 

;=li=l 


(x ;/ - x ) 2 










' (90 


- 82.50) 2 " 




" (87 


- 82.50) 2 




+(86 


- 82.50) 2 


+ 


+ (79 


- 82.50) 2 




_+(76 


- 82.50) 2 _ 




_ + (74 


- 82.50) 2 




" (93 


- 82.50) 2 " 




" (85 


- 82.50) 2 


+ 


+(87 


- 82.50) 2 


+ 


+ (83 


- 82.50) 2 




_+(77 


- 82.50) 2 . 




_+(73 


- 82.50) 2 



and 



SST = 473.00 



Error Sum of Squares, SSE 

This quantity is most easily computed by first calculating the preceding quantities, 
then using the relationship SST = SSTR + SSB + SSE, or 



SSE = SST - SSTR - SSB = 473.00 - 69.77 - 387.50 and 



SSE = 15.73 



Treatment Mean Square (MSTR) and Error Mean Square (MSE) 

SSTR and SSE are now divided by their respective degrees of freedom so that 
(1) they will be comparable and (2) each will be a separate estimate of the com- 
mon variance that the treatment group populations are assumed to share. Recall 
that we have t = 4 treatments and n = 3 blocks. 

The estimate of <r 2 that is based on the between-treatment variation is 



MSTR 



SSTR 69.77 



t 



and 



MSTR = 23.26 



The estimate of a 2 that is based on the within-treatment, or sampling-error 
variation is 



MSE 



SSE 



15.73 



(t-l)(»-l) (4-l)(3-l) 



and 



MSE = 2.62 



The Test Statistic, F 

The test statistic is the ratio of the two estimates for a 2 , or 



MSTR 23.26 



MSE 



2.62 



and 



We have now generated the information described in part C of Table 12.6, 
and the results are summarized in the randomized block ANOVA table shown 
in Table 12.8 (page 434). Note that Table 12.8 includes the listing of the mean 
square [MSB = SSB/(n - 1) = 193.75] and the P-ratio (F = MSB/MSE = 73.95) 
for the blocking variable, neither of which was calculated previously. These 
were omitted from our previous calculations because our intent in the random- 
ized block design is only to control for the effect of the blocking variable, not 
to investigate its effect. In Section 12.5, we will examine the simultaneous 
effects of two independent variables in a method called two-way analysis of 
variance. 
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TABLE 12.8 



From the data in Table 12.7 
and the computations in 
the text, we can construct 
this standard table describ- 
ing the results of the 
randomized block ANOVA 
test. The format is similar to 
that shown in part C of 
Table 12.6. Because our 
calculations were rounded 
to two decimal places, the 
results shown here will 
differ slightly from those 
in the corresponding com- 
puter printouts. 



Variation 
Source 


Sum of Squares 


Degrees of 
Freedom 


Mean Square 


F 


TraatrYiDntc 
1 1 ca LI I Ifcrl 1 Lz> 


fiQ 77 


3 
D 




R RR 
o.oo 


Blocks 


387.50 


2 


193.75 


73.95 


Error 


15.73 


6 


2.62 




Total 


473.00 


11 







The Critical Value of F and the Decision 

The test is being carried out at the a = 0.05 level, and the critical value of F can 
be found from Appendix A as 

Critical value of F = F(a, v v v 2 ) 

The df associated with the numerator of F is 

v 1 = (t-l) or (4 - 1) = 3 

The df associated with the denominator of F is 

v 2 =(t-l)(n-l) 

= ( 4 -1)(3-1) = 6 

For a = 0.05, v 1 = 3 and v 2 = 6, the critical F is F (0.05, 3, 6) = 4.76. The cal- 
culated value (F = 8.88) exceeds the critical value, and, at the 0.05 level, we are able 
to reject the null hypothesis that the population means are equal. At this level of sig- 
nificance, our conclusion is that the headlamp treatments are not equally effective. 

Using the F distribution tables in Appendix A, we can narrow down the p- value for 
the test. For the 0.05, 0.025, and 0.01 levels, the respective critical values are 4.76, 
6.60, and 9.78, and the calculated F (F = 8.88) falls between the critical values for the 
0.025 and 0.01 levels. Based on our statistical tables, the most accurate statement we 
can make about the p-value for the test is that it is somewhere between 0.025 and 0.01. 



Figure 12.2 is a multiple plot of the measurements according to the treatments 
and blocks. Each line connects the distances measured for persons in one of the 
age-group blocks. Regardless of the headlamp design, the distance at which the 
traffic sign could be read was longest for the youngest person exposed to that 
treatment, and shortest for the oldest person exposed to the treatment. This fur- 
ther demonstrates the appropriateness of using the randomized block design with 
age group as a blocking variable. 

Testing the Effectiveness of the Blocking Variable 

As mentioned earlier, our primary intent is to control for the effect of the block- 
ing variable, not to measure its effect. If we do wish to examine the effectiveness 
of the blocking variable, we can use this procedure: 

Hypotheses for testing the effectiveness of the blocking variable: 

H Q : The levels of the blocking variable are equal in their effect. 
Hj: At least one level has an effect different from the others. 

Calculated value of F = MSB/MSE, computed as shown in Table 12.6 
Critical value of F = (a, v^, v 2 ) 



Chapter 12: Analysis of Variance Tests 



435 




where v i = (n — 1), ^associated with the numerator, and 

v 2 = (t — l)(n — 1), df associated with the denominator 
and n = number of blocks, t = number of treatments 

Decision rule: If calculated F > F(a, v v v 2 ), reject H 0 at the a level. 

Applying this procedure to the blocking variable (age), the calculated F-ratio 
is F = MSB/MSE = 73.95, with degrees of freedom v x = (n - 1) = 2 for the 
numerator and v 2 = (t — l)(n — 1) = 6 for the denominator. These are listed in 
Table 12.8 and were computed as shown in Table 12.6. Using the F distribution 
tables in Appendix A, we see that this calculated F easily exceeds even the critical 
value for the 0.01 level of significance, F(0.01, 2, 6) = 10.92. The null hypothesis 
for the blocking variable is "the blocking levels are equal in their effect," and it 
would be rejected at the 0.01 level of significance. The results suggest that the 
blocking variable has been quite effective. 

Computer Outputs and Interpretation 

Computer Solutions 12.2 includes the Excel output for the randomized block 
ANOVA test we've just carried out. The treatment and block means are listed, as 
is the grand mean, and the output (subject to rounding) closely matches our re- 
sults in Table 12.8. Excel lists the p-value as 0.013 for our test of whether the 
population means could be equal for the four treatments. Note that Excel has also 
listed an f-ratio (73.421) and a p-value (6.05 X 10" 5 , or 0.0000605) for a test of 





computer solutions 1 2.2 
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Randomized Block Analysis of Variance 
EXCEL 





A 


B 


C 


D 


E 


F 


G 


1 




Lamp 1 


Lamp 2 


Lamp 3 


Lamp 4 






2 


Age block 1 


90 


87 


93 


85 






3 


Age block 2 


86 


79 


87 


83 






4 


Age block 3 


76 


74 


77 


73 






5 
















6 


Anova: Two-Factor Without Replication 










7 
















8 


SUMMARY 


Count 


Sum 


Average 


Variance 






9 


Age block 1 


4 


355 


88.750 


12.250 






10 


Age block 2 


4 


335 


83.750 


12.917 






1 1 


Age block 3 


4 


300 


75.000 


3.333 






12 
















13 


Lamp 1 


3 


252 


84.000 


52.000 






14 


Lamp 2 


3 


240 


80.000 


43.000 






15 


Lamp 3 


3 


257 


85.667 


65.333 






16 


Lamp 4 


3 


241 


80.333 


41.333 






17 
















18 
















19 


ANOVA 














20 


Source of Variation 


SS 


df 


MS 


F 


P-value 


Fcrit 


21 


Rows 


387.500 


2 


193.750 


73.421 


6.0496E-05 


5.143 


22 


Columns 


69.667 


3 


23.222 


8.800 


0.013 


4.757 


23 


Error 


15.833 


6 


2.639 








24 
















25 


Total 


473.000 


11 











Excel randomized block ANOVA 



1. For the sample data in Table 12.7 (file CX12LITE.XLS), with the data and labels in columns A through E, as shown 
above: Click Tools. Click Data Analysis. Click Anova: Two-Factor Without Replication. Click OK. 

2. Enter A1:E4 into the Input Range box. Select Labels. Specify the significance level for the test by entering 0.05 into 
the Alpha box. Select Output Range and enter A6 into the box. Click OK. The results are as shown above. 

MINITAB 

Minitab randomized block ANOVA 

Two-way ANOVA: Distance versus Block, Treatmnt 

Source DF SS MS F P 

Block 2 387.500 193.750 73.42 0.000 

Treatmnt 3 69.667 23.222 8.80 0.013 
Error 6 15.833 2.639 

Total 11 473.000 

S = 1.624 R-Sq = 96.65% R-Sq(adj) = 93.86% 

1. For the sample data in Table 12.7 (file CX12LITE.MTW), with the distances in C1, the block identifications in C2, and 
the treatment identifications in C3: Click Stat. Select ANOVA. Click Two-way. 

2. Enter C1 into the Response box. Enter C2 into the Row factor box. Enter C3 into the Column factor box. Click OK. 
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the block effects. In terms of our model, this would be a test of (H Q : /3- = 0, i = 1 
through n) versus (H^. /3 [ + 0 for at least one of the n blocks, or age categories). 

Although the Excel printout indicates that the blocking variable is exerting an 
extremely strong influence, remember that our purpose was only to control for 
the blocking variable in examining the effect of the four headlamps, or treat- 
ments. That the blocking variable is so highly influential comes as no surprise, 
since its impact on night vision is the reason why the randomized block design 
was used in the first place. 

In examining the Excel and Minitab printouts, our purpose in this random- 
ized block ANOVA was not to examine the impact of the blocking variable (age 
category) as an independent variable. Also, our model does not include any con- 
sideration of possible interaction between blocks and treatments, as will the two- 
way ANOVA in the next section. 

Minitab can include confidence intervals for the treatment means, as was the 
case for the one-way ANOVA printout in the Minitab portion of Computer Solu- 
tions 12.1. With the randomized block design, however, the confidence interval for 
a treatment mean would not be very meaningful. Unlike the one-way, completely 
randomized design, we have tampered with the randomness of the treatment groups 
by placing drivers into blocks consisting of persons whose ages are similar. 

Randomized Block ANOVA and the 
Dependent-Samples t-Test 

In the previous section, we pointed out that one-way ANOVA with two treatment 
groups is equivalent to the two-tail, pooled-variances West in Chapter 11. When 
there are just two treatments, the randomized block ANOVA is also equivalent to 
one of the tests in Chapter 11, in this case the two-tail, dependent-samples £-test. 
In applying the randomized block ANOVA to data representing dependent sam- 
ples, each pair of x 1 and x 2 values would represent a separate block. Because the 
tests are equivalent when they compare two treatments, they will provide the 
same p-value when applied to a given set of matched pairs. 




Note: The computer is highly recommended when using 
ANOVA techniques. For exercises involving raw data, 
data files are shown in parentheses. 

12.40 What is the purpose of the randomized block 
design? 

12.41 What assumptions are required in using the 
randomized block design? 

12.42 How does the randomized block design differ from 
the one-way, completely randomized design? 

12.43 For the randomized block ANOVA model, 

Xx = + r- + fi i + e- t :, explain the meaning of each term 
in the equation, then use the appropriate terms from the 
equation in presenting and explaining the possible sets of 
null and alternative hypotheses that can be tested. 



12.44 In the randomized block design, what benefit is 
gained from blocking? 

12.45 In one-way ANOVA, we were able to determine 
the confidence interval for the population value of each 
treatment mean. Why is this not appropriate for the treat- 
ments being examined in the randomized block design? 

12.46 For a randomized block experiment in which there 
are three treatments and two blocks, the calculated value 
of F = MSTR/MSE is 16.1. Using the 0.05 level of signif- 
icance, what conclusion would be reached? Based on the 
F distribution tables, what is the most accurate statement 
that could be made about the p-value for the test? 

12.47 For a randomized block experiment in which 
there are five treatments and four blocks, the calculated 
value of F = MSTR/MSE is 3.75. Using the 0.05 level 
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of significance, what conclusion would be reached? 
Based on the F distribution tables, what is the most ac- 
curate statement that could be made about the p-value 
for the test? 

12.48 For a randomized block experiment having three 
treatments and three blocks, the results include the follow- 
ing sum of squares terms. Based on this information, con- 
struct the appropriate table of ANOVA summary findings 
and use the 0.05 level of significance in testing whether all 
of the treatment effects could be zero. 

SSFR = 394.0 SSB = 325.0 
SSE = 180.0 

12.49 A randomized block design has three different in- 
come groups as blocks, and members of each block have 
been randomly assigned to the treatment groups shown 
here. For these data, use the 0.01 level in determining 
whether the treatment effects could both be zero. Using 
the 0.01 level, evaluate the effectiveness of the blocking 
variable. (Use data file XR12049.) 

Treatment 



Block 



A 
B 
C 



46 
37 
44 



31 
26 
35 



12.50 A randomized block design has five different age 
groups as blocks, and members of each block have been 
randomly assigned to the treatment groups shown here. 
For these data, use the 0.05 level in determining whether 
the treatment effects could all be zero. Using the 0.01 
level, evaluate the effectiveness of the blocking variable. 
(Use data file XR12050.) 

Treatment 





1 


2 


3 


4 


A 


51.2 


50.3 


47.2 


42.0 


B 


41.0 


37.6 


37.0 


35.7 


C 


57.5 


56.9 


54.7 


49.2 


D 


51.2 


49.3 


46.9 


50.9 


£ 


36.9 


34.6 


37.2 


33.2 



Block 



12.51 The continuous operating lifetime of a battery 
depends very much on the electrical demands of the de- 
vice in which the battery is installed. In a test of three 
battery brands, four examples of each brand are 
randomly selected. Three examples of four different 
toys are also randomly selected, randomly equipped 
with one of the battery brands, then each is operated 
until it stops. The times, in hours, are shown here. At 
the 0.025 level of significance, test whether the battery 
treatment effects could all be zero. Using the 0.01 level, 
evaluate the effectiveness of the blocking variable. (Use 
data file XR12051.) 



Hours of Continuous 
Performance When 
Installed in 



Battery Brand 





"Jumper 


"Drummer 




Kabult 


Duck" 


A 


6.1 


7.7 


B 


5.6 


5.7 


C 


3.4 


4.3 




"Talking 


"Toddling 




Teeth" 


Turtle" 


A 


7.4 


5.5 


B 


6.9 


4.9 


C 


6.4 


5.0 



Battery Brand 



12.52 Three racquetball players, one from each skill 
level, have been randomly selected from the membership 
list of a health club. Using the same ball, each person hits 
five serves, one with each of five racquets, and using the 
racquets in a random order. Each serve is clocked with a 
radar gun, and the results are shown here. With player 
skill level as a blocking variable, use the 0.025 level of 
significance in determining whether the treatment effects 
of the five racquets could all be zero. Using the 0.01 level, 
evaluate the effectiveness of the blocking variable. (Use 
data file XR12052.) 

Player Skill Level 



Beginner Intermediate Advanced 



A 


73 mph 


64 mph 


83 mph 


B 


63 


72 


89 


C 


51 


54 


72 


D 


56 


81 


86 


E 


69 


90 


97 



Racquet 
Model 



12.53 Two different machine control-button 
configurations are being tested. Recognizing that some 
operators are faster than others, management uses seven 
operators as blocks, then has each operator carry out a 
standard operation using each control-button layout. 
Each operator proceeds from one treatment to the next in 
a random order, and the times for completion of the oper- 
ation are shown here. At the 0.05 level, determine 
whether the treatment effects of the two layouts could be 
zero. (Use data file XR12053.) 

Control-Button 
Configuration 



Operator 





1 




2 


A 


5 


6 


seconds 


B 


9 


6 




C 


1 1 


8 




D 


13 


10 




E 


10 


7 




F 


9 


10 




G 


12 


9 
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12.54 Given the following summary table for a random- 
ized block design, fill in the missing items (indicated by 
asterisks), identify the null and alternative hypotheses in 
terms of the treatment effects, then use the 0.025 level of 
significance in reaching a conclusion regarding the null 
hypothesis. 

Variation Sum of Degrees of Mean 
Source Squares Freedom Square F 

Treatments 30.89 2 *** *** 

Blocks 80.22 2 

Error 7.11 

Total 118.22 8 

12.55 Given the following summary table for a random- 
ized block design, fill in the missing items (indicated by 
asterisks), identify the null and alternative hypotheses in 
terms of the treatment effects, then use the 0.05 level of 
significance in reaching a conclusion regarding the null 
hypothesis. 

Variation Sum of Degrees of Mean 
Source Squares Freedom Square F 

Treatments 35.33 

Blocks 134.40 2 

Error 16.27 8 

Total 186.00 14 



12.56 The following Minitab output summarizes the 
results of a randomized block ANOVA in which the 
blocks consisted of 4 different income categories, the 
treatments were 3 different kinds of appeals for contri- 
butions to a national charity, and the variable measured 
(dollars) was the amount contributed to the charity. 
State the null and alternative hypotheses in terms of 
the treatment effects, then use the 0.025 level of 
significance in reaching a conclusion regarding the 
null hypothesis. 

Analysis of Variance for Dollars 



Source 


DF 


SS 


MS 




F 




P 


Block 


3 


75613 


25204 


9. 


08 


0 


012 


Treatment 


2 


36422 


18211 


6 


56 


0 


031 


Error 


6 


16663 


2777 










Total 


11 


128698 













12.57 Compare the randomized block ANOVA with 
two treatments to the dependent-samples t-test of 
Chapter 11. 

12.58 Apply the dependent-samples J-test to the 

data in Exercise 12.49. Is the conclusion consistent with 
the one reached in that exercise? Explain. 

12.59 Apply the dependent-samples f-test to the data in 
Exercise 12.53. Is the conclusion consistent with the one 
reached in that exercise? Explain. 



TWO-WAY ANALYSIS OF VARIANCE 12.5 



Purpose 

In the one-way ANOVA of Section 12.3, our emphasis was on determining the ef- 
fect of a single factor (tax-preparation method) on the dependent variable, or 
measurement. In the randomized block ANOVA of Section 12.4, we included a 
blocking variable (age category), but this was only for the purpose of exerting im- 
proved control over the examination of the single factor of interest (headlamp de- 
sign). This section considers two-way analysis of variance, a method that simulta- 
neously examines (1) the effect of two factors on the dependent variable, along 
with (2) the effects of interactions between the different levels of these two fac- 
tors. Two-way ANOVA is the most basic form of the factorial experiment, one in 
which there are two or more factors and the treatments represent all possible 
combinations of their levels. 

Unlike the randomized block ANOVA, two-way ANOVA examines interac- 
tions between different levels of the factors, or independent variables. In a two-fac- 
tor experiment, interaction exists when the effect of a level for one factor depends 
on which level of the other factor is present. Medical professionals are always con- 
cerned about the interactive effects of drugs, whether prescribed or unprescribed. 
For example, the slight drowsiness that would be caused by either a small amount 
of alcohol or a small amount of cough medication can become major drowsi- 
ness when both alcohol and cough medication have been consumed. So-called 
binary chemical-warfare weapons also exemplify the phenomenon of interaction. 
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Although each of the separate components is harmless by itself, they become a 
deadly combination when the weapon is activated and they are mixed together. 

In the randomized block example of the preceding section, there was no 
need to consider interactions between the factor levels and the blocks. As Fig- 
ure 12.2 showed, the rank order of viewing distance according to age group 
remained the same for all four headlamp designs. Regardless of the headlamp 
design considered, the youngest driver had the longest vision, the oldest driver 
the shortest. However, one or more interaction effects would have been present 
if there had been any crossovers — for example, if the oldest driver exposed to 
headlamp design 4 had been able to see farther than the youngest driver who 
was exposed to that headlamp. The concept of interaction will be further ex- 
amined in the context of the two-factor ANOVA example of this section. 

Model and Assumptions 

In the two-way ANOVA, random assignments are made such that two or more 
persons or other test units are subjected to each possible combination of the fac- 
tor levels. The number of persons or test units within each of these combinations 
(or cells) is referred to as r = the number of replications, with r s 2. In this sec- 
tion, we will consider only the balanced design, where there is an equal number 
of replications (r) within each combination of factor levels. In other words, within 
each combination of levels, there will be k = 1 through r observations. As with 
the other ANOVA designs we've examined, each individual observation is consid- 
ered to be the sum of several components: 



An individual observation 
or measurement, this is 
the £th observation 
within the zth level for 
factor A and the /th 
level for factor B. 



Random error associated 
with the sampling process 



c >ik 



1 



The overall population mean 



The effect of the zth 
level of factor A 



L 



T 

ijk 



The effect of the interaction 
between the zth level of factor A 
and the /th level of factor B 



The effect of the /th 
level of factor B 



In this model, the kth observation for level i of factor A and level ;' of factor B 
is x ( y£, and it is viewed as the sum of the overall population mean (/jl), the effect of 
the zth level of factor A (a t ), the effect of the /th level of factor B (/3 ; ), the interac- 
tion effect between level i of factor A and level / of factor B [(a/3)-], plus random 
error due to sampling (e^). There are three sets of null and alternative hypothe- 
ses to be tested. 

1. Testing for main effects, factor A: 



1 through a. 



Hq. a ; = 0 for each level of factor A, with i 
(No level of factor A has an effect. ) 

H^, a [ + 0 for at least one value of z, with z = 1 through a. 
(At least one level of factor A has an effect.) 
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2. Testing for main effects, factor B; 

Hq. fi- = 0 for each level of factor B, with / = 1 through b. 
(No level of factor B has an effect.) 

Hji j3- ¥= 0 for at least one value of /, with / = 1 through b. 
(At least one level of factor B has an effect. ) 

3. Testing for interaction effects between levels of factors A and B: 

Hq. (a/3)-y = 0 for each combination of i and /'. 

(There are no interaction effects.) 

Hji (aj8)-- # 0 for at least one combination of i and /. 

(At least one combination of i and /' has an effect.) 

The assumptions for the two-way ANOVA design are similar to those for 
the randomized block design, with blocks being replaced by factor levels. There 
are a X b factor-level combinations, or cells, and it is assumed that the r observa- 
tions in each cell have been drawn from normally distributed populations with 
equal variances. The assumption of no interactions no longer applies. If one or 
more of these assumptions cannot be made, the Friedman test of Chapter 14 can 
be applied. 

Procedure 

Table 12.9 (page 442) summarizes the procedure for carrying out the two-way 
ANOVA design. Although the process is generally similar to the randomized 
block method of Table 12.6, there are now three sets of null and alternative 
hypotheses to be tested, interaction effects will be examined, and each observa- 
tion is identified with three subscripts instead of two. The following descriptions 
correspond to parts A-D of Table 12.9. 

Part A: The Null and Alternative Hypotheses 

The null and alternative hypotheses are expressed in terms of the main effects 
(factors A and B) and interaction effects (combinations of levels of these factors). 

Part B: The Format of the Data to Be Analyzed 

The data can be listed in tabular form, as shown, with each cell identified as a 
combination of the ith level of factor A with the ;th level of factor B. Each cell 
contains r observations, or replications. For each level of each factor, a mean is 
calculated. For example, 3c 2 is the mean for all observations that received the sec- 
ond level of factor A. Likewise, x 1 is the mean for all observations that received 
the first level of factor B. As in previous analyses, the grand mean (3c) is the mean 
of all the observations that have been recorded. 

Part C: The Calculations for the Two-Way ANOVA Design 

Part C of Table 12.9 describes the specific computations, with each quantity being 
associated with a specific source of variation within the sample data. They corre- 
spond to the Xj- k = fx, + a ; . + /?■ + (aj8) ■■ + e--^ model discussed previously. 

The Sum of Squares Terms: Quantifying the Sources of Variation 

• Variation due to the factors SSA is the sum of squares reflecting variation 
caused by the levels of factor A. SSB is the sum of squares reflecting variation 
caused by the levels of factor B. 
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TABLE 12.9 



Summary of the two-way 
ANOVA design 



A. Null and Alternative Hypotheses to Be Tested 



1 . Main effects, factor A: 



0 



for each value of /' with / : 
for at least one value of /' 

2. Main effects, factor B: 

H Q : Pj = 0 for each value of j, with / ■ 
H 1 : j3 ■ ¥= 0 for at least one value of j 

3. Interaction effects: 



through a 



through b 



H 0 . (a/3) ; ; = 0 



H, 



for each combination of / and j 

for at least one combination of / and j 



B. Da fa Format 



Factor A, 
i = 7 to a 



/' = 7 



/ = 2 



i = a 



Factor B, j = 1 to b 



V11 



A 212 



A 21r 



A 221 



A 222 



A 22r 



/=1 


i = 2 


/=/> 


x 111 




x l21 




X 1t)1 


X 112 




X 122 




X 1t)2 






X 12r 




x 1far 



, 2b1 



"262 



"2br 



x a11 




x a21 




x afa1 


x a12 




x a22 




x afa2 


x a1r 




x a2r 




x abr 



Means for levels 
of factor A 



x .b. 




Means for levels 
of factor B 


-< — 



V 
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C. Calculations 



Source of 
Variation 



Sum of Squares 



Degrees of 
Freedom 



Mean Square 



F-Ratio 



Factor A 



SSA = rb^(x L -x) 2 



a - 1 



MSA 



SSA 
a - 1 



MSA 
MSE 



Factor B 



SSB = ra^(xj - x) 2 

7=1 



MSB = 



SSB 
b - 1 



F = 



MSB 
~MSE 



Interaction 
between A 
and B 



SSAB = SST - SSA - SSB - SSE 



(a - - 1) 



MSAB = 



SSAB 



(a - - 1) 



F = 



MSAB 
MSE 



Sampling 
Error, E 



SSE=±i±(x lJk -x ij ) 2 

;=iy=iit=i 



ab(r- 1) 



MSE 



SSE 
ab(r- 1) 



Total, T 



/=i/=iit=i 



abr - 1 



D. Critical values and decisions 



1 . Main effects, factor A: 



MSA 



Reject H n : all a, = 0, if F = is > F[a, (a - 1 ), ab(r - 1 ) 

u ' MSE 



2. Main effects, factor B: 



MSB 



Reject H 0 : all Pj = 0, if F = —7: is > F[a, (b - 1 ), ab(r - 1 ) 
3. Interaction effects: 

Reject H 0 : all (o^)j = 0, if F = ! 



is > F[a, (a - - 1), afa(r- 1)]. 
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• Variation due to interactions between factor levels SSAB is the sum of 
squares reflecting variation caused by interactions between the levels of factors 
A and B. This is most easily calculated by first computing the other sum of 
squares terms, then SSAB = SST — SSA — SSB — SSE. 

• Sampling error, E SSE is the sum of squares reflecting variation due to sam- 
pling error. In this calculation, each data value is compared to the mean of its 
own cell. 

• Total variation, T SST is the sum of squares reflecting the overall variation 
in the data, with each observation being compared to the grand mean, then 
the differences squared and summed. 

Making the Amounts from the Sources of Variation Comparable MSA, 
MSB, MSAB, and MSE are the mean squares for factors A and B, interactions be- 
tween A and B, and error, respectively. As part C of the table shows, each is ob- 
tained by dividing the corresponding sum of squares by the number of degrees of 
freedom (df) associated with this sum of squares. 

Part D: Test Statistics, Critical Values, and Decision Rules 

For each null hypothesis to be tested, a separate test statistic is calculated. The nu- 
merator and denominator are separate estimates of the variance that the cell pop- 
ulations are assumed to share. For each null hypothesis, the critical value of F will 
depend on the level of significance that has been selected, and on the number of 
degrees of freedom associated with the numerator and denominator of the F 
statistic. In testing each H Q , the values of v 1 and v 2 are shown in the table. 

If a calculated F exceeds F[a, v v v 2 ], the corresponding null hypothesis will be 
rejected. 





example 







Two-way AN OVA Procedure 



An aircraft firm is considering three different alloys for use in the wing construction 
of a new airplane. Each alloy can be produced in four different thicknesses (1 = 
thinnest, 4 = thickest). Two test samples are constructed for each combination of 
alloy type and thickness, then each of the 24 test samples is subjected to a labora- 
tory device that severely flexes it until failure occurs. For each test sample, the num- 
ber of flexes before failure is recorded, with the results shown in Table 12.10. At the 
0.05 level of significance, examine (1) whether the alloy thickness has an effect on 
durability, (2) whether the alloy type has an effect on durability, and (3) whether 
durability is influenced by interactions between alloy thickness and alloy type. The 
data are in file CX12WING. 

SOLUTION 

The null and alternative hypotheses for this analysis are as shown in part A of 
Table 12.9. In terms of these data, the three null hypotheses hold that (1) no level 
of factor A (thickness) has an effect, (2) no level of factor B (alloy type) has an effect, 
and (3) there is no interaction between levels of factor A and levels of factor B. 

In Table 12.10, there are 4 levels of factor A and 3 levels of factor B, leading to 
4 X 3 = 12 combinations, or cells. Within each cell, there are r = 2 observations, 
or replications. For example, the cell for (i = 2, j = 3) contains the following 
observations: x 231 = 807 flexes and x 232 = 819 flexes. Also shown in Table 12.10 
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Factor B, alloy type, / = 1 to 3 



Factor A, 
thickness 
i= 1 to 4 



i = 7 



/' = 2 



/ = 3 



i = 4 



;'= 1 


y' = 2 


J = 3 


804 
816 




836 
828 




804 
808 








819 
813 




844 
836 




807 
819 








820 
821 




814 
811 




819 
829 








806 
805 




811 
806 




827 
835 



Means for levels 
of factor A 



816.00 



823.00 



819.00 



815.00 



•813.00 



823.25 

A 



818.50 



Means for levels 
of factor B 



The mean for the j = 2 level of factor B is 

836 + 828 + 844 + ■ ■ ■ + 81 1 + 806 

.2. g 

= 823.25 flexes 



The mean for the / = 4 level of factor A is 

806 + 805 + 81 1 + 806 + 827 + 835 

*4_ = e 

= 815.00 flexes 



The grand mean is the mean of all the data values, or 
= 804 + 816 + 836 + 828 + ■■■ + 827 + 835 



24 



= 81 8.25 flexes 



TABLE 12.10 



Data and example calcula- 
tions for a two-way ANOVA 
design to examine main 
and interactive effects of 
factor A (alloy thickness) 
and factor B (alloy type). 
Each cell is a combination 
of factor levels /' and j, and 
contains r = 2 observations 
or replications. 



are examples of the calculation of the means for the levels of both factors. The dot 
in the subscripts has the same meaning as in previous analyses — e.g., x 2 indi- 
cates that this mean is for all observations that received the j = 2 level of factor 
B. The calculations that follow correspond to part C of Table 12.9. 



Factor A Sum of Squares, SSA 

There are r = 2 replications within each cell and b 
based on differences between the grand mean (x 
means for the a = 4 levels of factor A: 



3 levels for factor B. SSA is 
818.25) and the respective 



SSA 



rb^ {x u 

i=l 



2(3) 



(816.00 
+(819.00 



818.25) 2 + (823.00 - 818.25) 2 
818.25) 2 + (815.00 - 818.25) 2 



6(38.75) and 



SSA = 232.50 
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Factor B Sum of Squares, SSB 

There are r = 2 replications within each cell and a = 4 levels for factor A. SSB is 
based on differences between the grand mean (x = 818.25) and the respective 
means for the b = 3 levels of factor B. 



SSB 



ra 2 (*/. ~ x ) 



;=i 

2(4)[(813.00 - 818.25) 2 + (823.25 - 818.25) 2 + (818.50 - 818.25) 2 ] 
8(52.625) and SSB = 421.00 



Error Sum of Squares, SSE 

In this calculation, each observation is compared to the mean of its own cell. For 



example, the mean of the (i = 2, / = 3) cell is x 



ijk 



</.' 



+ 



lj=\k=\ 

(804 - 810.00) 2 
810.00) 2 

816.00) 2 
816.00) 2 

820.50) 2 
820.50) 2 

805. 50) 2 
805. 50) 2 



+ (816 

(819 
+ (813 

(820 
+ (821 

(806 
+ (805 



and 



+ 



(836 
+ (828 

(844 

+ (836 

(814 
+ (811 

(811 
+ (806 



23. 



832.00) 2 
832.00) 2 

840.00) 2 
840.00) 2 

812.50) 2 
812.50) 2 

808.50) 2 
808.50) 2 



(807 + 819)/2, or 813.00. 



SSE = 334.00 



+ 



+ 



+ 



(804 


- 806.00) 2 


+ (808 


- 806.00) 2 


(807 


- 813. 00) 2 


+ (819 


- 813. 00) 2 


(819 


- 824.00) 2 


+ (829 


- 824.00) 2 


(827 


- 831. 00) 2 


+ (835 


- 831. 00) 2 



Total Sum of Squares, SST 

This calculation compares each observation with the grand mean (3c 
with the differences squared and summed: 



818.25), 



!=1/'=U=1 

(804 
+ (816 



x ,jk ~ x > 



+ 



+ 



+ 



and 



(819 
+ (813 

(820 
+ (821 

(806 
+ (805 



818.25) 2 
818.25) 2 

818.25) 2 
818.25) 2 

818.25) 2 
818.25) 2 

818.25) 2 
818.25) 2 



+ 



+ 



+ 



+ (828 

(844 

+ (836 

(814 
+ (811 

(811 
+ (806 



818.25) 2 
818.25) 2 

818.25) 2 
818.25) 2 

818.25) 2 
818.25) 2 

818.25) 2 
818.25) 2 



+ 



+ 



+ 



+ 



(804 
+ (808 



818 
818 



(807 - 818 
+(819 - 818 

(819 - 818 
+(829 - 818 

(827 - 818 
+(835 - 818 



25 ) 2 
25) 2 

25) 2 
25) 2 

25) 2 
25 ) 2 

25) 2 
25) 2 



SST = 3142.50 
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Interaction Sum of Squares, SSAB 

Having calculated the other sum of squares terms, we can obtain SSAB by sub- 
tracting the other terms from SST, or 



SSAB = SST - SSA - SSB - 
= 3142.50 - 232.50 



SSE 

- 421.00 - 334.00 



and 



SSAB = 2155.00 



The Mean Square Terms 

As part C of Table 12.9 shows, each sum of squares term is divided by the num- 
ber of degrees of freedom with which it is associated. There are a = 4 levels for 
factor A, b = 3 levels for factor B, and r = 2 replications per cell, and the mean 
square terms are as follows: 



Factor A: 



MSA 



SSA 232.50 



Factor B: 



MSB 



4 



SSB 421.00 



3 



and 



and 



MSA = 77.50 



MSB = 210.50 



Interaction, AB: 



MSAB = 
Error, E: 



SSAB 



l)(b 



2155.00 

(4 - 1)(3 - 1) 



and 



MSAB = 359.17 



MSE 



SSE 



ab(r 



334.00 
4(3)(2 - 1 



and 



MSE = 27.83 



The Test Statistics, the Critical Values, and the Decisions 

There are three sets of null and alternative hypotheses to be evaluated. In each 
case, the calculated E is compared to the critical F, listed in the F distribution table 
and described in part D of Table 12.9; then a decision is reached. The denomina- 
tor of the F-ratio for each test is MSE, which has ab(r - 1), or 4(3 )(2 - 1) = 12 
degrees of freedom. 

1. Testing for main effects, factor A The calculated F is 

MSA 77.50 2J8 



MSE 27.83 
The critical F for the 0.05 level is 

F[0.05, (a - 1), ab(r 



DJ 



or 



F(0.05, 3, 12) = 3.49 



Because the calculated value of F (2.78) does not exceed the critical value (3.49), 
H 0 : all a ■ = 0 cannot be rejected. Our conclusion is that none of the thicknesses 
has any effect on the number of flexes a test unit will withstand before it fails. 
2. Testing for main effects, factor B The calculated F is 



MSB 210.50 



MSE 27.83 



7.56 



448 



Part 4: Hypothesis Testing 



The critical F for the 0.05 level is 



or 



3. 



F[0.05,(fe - 1), ab(r - 1)] 



F(0.05,2, 12) = 3. 



In this case, the calculated F (7.56) is greater than the critical value (3.89), and H Q : 
all B- = 0 is rejected. At least one of the alloys has an effect on the number of flexes 
a test unit will withstand before it fails. The calculated F also exceeds the critical 
values for both the 0.025 level (5.10) and 0.01 level (6.93), so the null hypothesis 
would be rejected at these levels as well. From the F table listings, the most accu- 
rate statement we can make about the p-value for this test is that the p-value is less 
than 0.01. 

Testing for interaction effects between levels of factors A and B The calculated 
F is 



MSAB 359.17 



12.91 



MSE 27.83 
The critical F for the 0.05 level is 

F[0.05, (a - l)(b - \),ab(r - 1)] 



or 



£(0.05, 6, 12) = 3.00 



In the test for interaction effects, the calculated F (12.91) exceeds the critical 
value (3.00) and H 0 : all (aB)- = 0 is rejected. The factors are not operating inde- 
pendently, and there is some relationship between the levels of thickness (factor 
A) and alloy type (factor B) in determining how many flexes a test unit will with- 
stand before failure. The calculated F also exceeds the critical values for both the 
0.025 level (3.73) and the 0.01 level (4.82), so the null hypothesis would be re- 
jected at these levels as well. As with the test for the main effects of factor B, the 
most accurate statement we can make based on our tables is that the p-value for 
this test is less than 0.01. 

The summary findings for the preceding analysis are shown in Table 12.11, a 
standard format for displaying the results of a two-way ANOVA. As suggested 
from the listings in the table, it is evident that a great deal of the variability has 
come from interactions between the levels of factor A (thickness) and factor B 
(alloy type). 



TABLE 12.11 


Variation 




Degrees of 
Freedom 






Summary results for the 
two-way ANOVA 
performed on the data 
shown in Table 12.10 


Source 


Sum of Squares 


Mean Square 


F 


Factor/^ 
Factor B 


232.50 
421.00 


3 
2 


77.50 
210.50 


2.78 
7.56 




Interaction, AB 


2155.00 


6 


359.17 


12.91 




Error 


334.00 


12 


27.83 






Total 


3142.50 


23 
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FIGURE 12.3 




Interaction between levels 
of factor A (thickness) and 
factor 6 (alloy type) is pre- 
sent. Thicknesses 1 and 2 
seem best for alloy 2, but 
thickness 4 seems best for 
alloy 3. 



1 2 

Level of factor B (alloy type) 



To obtain further insight into the nature of the interactions between the factor lev- 
els, let's first consider the plots shown in Figure 12.3. In this graph, the vertical axis rep- 
resents the average number of flexes for the r = 2 test units within each cell. The hori- 
zontal axis represents the levels for factor B, alloy type. Thicknesses 1 and 2 seem 
relatively durable for alloy 2, but these thicknesses lead to early failure for alloys 1 and 
3. According to this figure, the longest-lasting combination is thickness 2 and alloy 2. 

Figure 12.4 (page 450) is similar to Figure 12.3, except that we now have the 
levels of factor A (thickness) as the horizontal axis and three lines that represent lev- 
els 1, 2, and 3 of factor B (alloy type). Keeping in mind that the test unit becomes 
thicker from thickness = 1 to thickness = 4, some interesting observations can be 
made. Referring to the lines in the plot, alloy 2 lasts longest when manufactured at 
thickness 2 but fails more quickly when it is at thickness levels 3 and 4. Similarly, 
alloy 1 turns in the best performance when it is made at thickness 3. On the other 
hand, the durability of alloy 3 progressively increases as it is made in greater thick- 
nesses. Figure 12.4 suggests that alloy 3 might last even longer if it were produced 
at a greater level of thickness than the levels that were included in this experiment. 

You can use Seeing Statistics Applet 16, at the end of the chapter, to see how 
graphs like Figures 12.3 and 12.4 respond to changes in the row, column, and 
interaction effects. 

Computer Outputs and Interpretation 

The Excel portion of Computer Solutions 12.3 shows the Excel data configura- 
tion and partial output for the two-way ANOVA we've just carried out. Excel 
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FIGURE 12.4 



Another perspective 
regarding interaction 
between levels of factor A 
(thickness) and factor 6 
(alloy type). The thicker the 
test unit, the longer alloy 3 
seems to last, but the 
reverse seems true for 
alloy 2. 




Level of factor A (thickness) 



provides the count, sum, mean, and variance for each factor level and cell, but 
only the means are shown in our printout. 

Note that Excel has identified factor A as "Sample" (rows) and factor B as 
"Columns." Although this is consistent with the format of the data in Table 12.10, 
it's not carved in stone that tabular displays of the data must have the levels of fac- 
tor A represented by rows and the levels of factor B represented by columns, so be 
careful. As long as you use and interpret the subscripts consistently, it doesn't re- 
ally matter whether the levels of a given factor are associated with the rows or with 
the columns in a given display of the data. 

The Excel printout in Computer Solutions 12.3 also shows exact p-values 
for the respective hypothesis tests that were carried out earlier. In testing for the 
main effects of factor A, we were unable to reject H 0 : all a i = 0 at the 0.05 level 
of significance. This is consistent with the p-value of 0.0864 reported by Excel 
for this test. For the test of the main effects of factor B, H Q : all B- = 0 can be 
rejected at the 0.0075 level. For the test of the interaction effects between levels of 
factors A and B, H Q : all (a/3L = 0 can be rejected at an even more significant level, 
0.0001. 

The Minitab portion of Computer Solutions 12.3 shows results comparable to 
the Excel printout. As an option, Minitab can also display the 95% confidence in- 
tervals for the population means associated with the factor levels. These intervals 
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Two-Way Analysis of Variance 
EXCEL 





A 


□ 

D 




U 


b 


r- 

r 




1 




Alloy 1 


Alloy 2 


Alloy o 








o 


Thicknessl 


804 


836 


oU4 








o 
o 




816 


828 


OUo 








A 

4 


Thickness 2 


819 


844 


OU/ 








0 




813 


836 


81 Q 








0 


Thickness 3 


820 


814 


o i y 








"7 
/ 




821 


811 


o^.y 








Q 


Thickness 4 


806 


811 










Q 

y 




805 


806 


OOD 








10 
















11 


Anova: Two-Factor With Replication 










12 
















13 


ANOVA 














14 


Source of Variation 


SS 


df 


MS 


F 


P- value 


Fcrit 


15 


Sample 


232.50 


3 


77.500 


2.784 


0.0864 


3.490 


16 


Columns 


421.00 


2 


210.500 


7.563 


0.0075 


3.885 


17 


Interaction 


2155.00 


6 


359.167 


12.904 


0.0001 


2.996 


18 


Within 


334.00 


12 


27.833 








19 
















20 


Total 


3142.50 


23 











Excel two-way ANOVA 

1 . For the sample data in Table 12.10 (file CX12WING.XLS), with the data and labels in columns A through E, as shown 
above: Click Tools. Click Data Analysis. Click Anova: Two-Factor With Replication. Click OK. 

2. Enter A1:D9 into the Input Range box. Enter 2 into the Rows per sample box. Specify the significance level for the 
test by entering 0.05 into the Alpha box. Select Output Range and enter A11 into the box. Click OK. The data and a 
portion of the printout are shown above. 

MINITAB 

Minitab two-way ANOVA 



Two-way ANOVA: Flexes versus Thicknes, Alloytyp 



Source 


DF 


SS 


MS 


F 




P 


Thi cknes 


3 


232 


.5 


77.500 2. 


78 


0 


086 


Alloytyp 


2 


421 


.0 


210.500 7. 


56 


0 


007 


Interaction 


6 


2155 


.0 


359.167 12 


90 


0 


.000 


Error 


12 


334 


0 


27.833 








Total 


23 


3142 


.5 










S = 5.276 


R-Sq 


= 89 


.37% 


R-Sq(adj) 




79. 


63% 



1. For the sample data in Table 12.10 (file CX12WING.MTW), with the performances (number of flexes) in C 1 , the thick- 
ness identifications in C2, and the alloy identifications in C3: Click Stat. Select ANOVA. Click Two-way. 

2. Enter C1 into the Response box. Enter C2 into the Row factor box. Enter C3 into the Column factor box. To get a 
display of the 95% confidence intervals for the population means associated with the factor levels (not shown here), 
click to select the Display means boxes. Click OK. 
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are calculated using a procedure similar to that described in Section 12.3. Each 
confidence interval is constructed by employing the t distribution, with s p (com- 
puted as the square root of MSE) used as an estimate of a. These confidence inter- 
vals are calculated as follows: 

Two-way ANOVA, confidence interval for the mean, level k of factor M: 
The confidence interval for /jL kM is 




where s 



P 



VMSE, the pooled estimate of a 

the total number of persons or test units 

receiving level k of factor M 

the mean for all persons or test units 

receiving level k of factor M 

from the t distribution, the t value for 

the desired confidence level, using the 

same df as associated with MSE 



For example, the mean for level 1 of factor A (thickness) was 816.00 flexes, 
and a total of 6 test units was subjected to this level of factor A. The pooled es- 
timate for the population standard deviation is s p = VM5£ = V27.8, or 
Sp = 5.27. Using the t distribution with the df = 12 associated with MSE, the 
95% confidence interval for ju 1A , the population mean associated with level 1 of 
factor A (thickness) is 



1 1A 



816.00 
816.00 



5-27 where df = 12 degrees of freedom associated with 
V6 | MSE, and t = 2.179 for a 95% interval 

2.179(2.15), or from 811.32 to 820.68 flexes 



We have 95% confidence that the population mean for level 1 of factor A is 
somewhere from 811.32 to 820.68 flexes. The other confidence intervals for the 
population means associated with the factor levels can be similarly calculated. 



exercises 



Note: The computer is highly recommended when using 
ANOVA techniques. For exercises involving raw data, 
data files are shown in parentheses. 

12.60 What is the purpose of two-way analysis of 
variance? 

12.61 What assumptions are required in using the two- 
way ANOVA? 

12.62 Why are there more sets of null and alternative 
hypotheses that can be tested in two-way ANOVA com- 
pared to the one-way and randomized block designs? 




12.63 How is two-way ANOVA similar to the random- 
ized block design? How does it differ? 

12.64 What are main effects and interactive effects in the 
two-way ANOVA? 

12.65 In the two-way ANOVA, what is meant by the 
term replications'! 

12.66 In a two-way ANOVA experiment, factor A is op- 
erating on 4 levels, factor B is operating on 3 levels, and 
there are 3 replications per cell. How many treatments 
are there in this experiment? Explain. 
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12.67 For the two-way ANOVA model, x ; ^= /x + ot j + 
B- + (aft)- + e^, explain the meaning of each of the six 
terms in the equation, then use the appropriate terms from 
the equation in presenting and explaining the three sets of 
null and alternative hypotheses that can be tested. 

12.68 In a two-way ANOVA experiment, factor A is 
operating on 3 levels, factor B is operating on 2 levels, 
and there are 2 replications per cell. If MSA/MSE = 
5.35, MSB/MSE = 5.72, and MSAB/MSE = 6.75, and 
using the 0.05 level of significance, what conclusions 
would be reached regarding the respective null hypothe- 
ses for this experiment? 

12.69 In a two-way ANOVA experiment, factor A is op- 
erating on 4 levels, factor B is operating on 3 levels, and 
there are 3 replications per cell. If MSA/MSE = 3.54, 
MSB/MSE = 5.55, and MSAB/MSE = 12.40, and using 
the 0.025 level of significance, what conclusions would 
be reached regarding the respective null hypotheses for 
this experiment? 

12.70 For a two-way ANOVA in which factor A operates 
on 3 levels and factor B operates on 4 levels, there are 2 
replications within each cell. Given the following sum of 
squares terms, construct the appropriate table of ANOVA 
summary findings and use the 0.05 level in examining the 
null and alternative hypotheses associated with the 
experiment. 

SSA = 89.58 SSB = 30.17 
SSAB = 973.08 SSE = 29.00 

12.71 Given the following data for a two-way ANOVA, 
identify the sets of null and alternative hypotheses, then use 
the 0.05 level in testing each null hypothesis. (Use data file 
XR12071.) 

Factor fi 



1 



Factor A 



12.72 Given the following data for a two-way ANOVA, 
identify the sets of null and alternative hypotheses, then 
use the 0.05 level in testing each null hypothesis. (Use data 
file XR1 2072.) 

Factor B 



Factor A 



1 


2 


3 


152 
151 




158 
154 




160 
160 








158 
154 




164 
158 




152 
155 








160 
161 




147 
150 




147 
146 



12.73 Given the following data for a two-way ANOVA, 
identify the sets of null and alternative hypotheses, then use 
the 0.05 level in testing each null hypothesis. (Use data file 
XR12073.) 

Factor B 



Factor A 2 



12.74 Given the following summary table for a two-way 
ANOVA, fill in the missing items (indicated by asterisks), 
identify the sets of null and alternative hypotheses to be 
examined, then use the 0.05 level in reaching a 
conclusion regarding each null hypothesis. 

Degrees 



Variation 


Sum of 


of 


Mean 




Source 


Squares 


Freedom 


Square 


F 


Factor A 


40.50 


*** 


* * * 


* * * 


Factor B 


154.08 


1 


* * * 


* * * 


Interaction, 










AB 


70.17 


2 


* -k * 


* * * 


Error 


31.50 


6 


** * 




Total 


296.25 


11 







12.75 A study has been undertaken to examine the effect 
of background music and assembly method on the 
productivity of workers on a production line for 
electronic circuit boards. Each of 8 workers has been 
randomly assigned to one of 4 cells, as shown here. So 
that each worker will hear only the intended type of 
music, headphones are used for the background music in 
the experiment. Data are the number of circuit boards 
assembled in 1 hour. Using the 0.05 level of significance 
in testing for the main and interactive effects, what 
conclusions should be drawn from this experiment? 
Determine the 95% confidence interval for the 
population mean for each factor level. (Use data 
file XR12075.) 



Music 



Classical 



Rock 



Assembly Method 



35 
32 



41 
40 
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12.76 For Exercise 12.75, and using the levels of the as- 
sembly method factor as the horizontal axis and "units 
produced" as the vertical axis, plot and connect the cell 
means for the cells associated with the "classical music" 
level of the background music factor. On the same graph, 
plot and connect the cell means for the cells associated 
with the "rock music" level of the background music fac- 
tor. Do the plots indicate the presence of interaction 
between the factor levels? Explain. 

12.77 Each of 12 undergraduate students has been 
randomly assigned to one of the 6 cells shown here. The 
purpose of the study is to test whether factor A (if a 
shopping bag is being carried) and factor B (mode of 
dress) have main and interactive effects regarding the 
number of seconds it takes to get waited on at a jewelry 
counter of an upscale department store. Students have 
entered the store at times that have been randomly 
scheduled for the day of the experiment. Using the 
0.025 level of significance in testing for the main and 
interactive effects, what conclusions should be drawn 
from this experiment? Determine the 95% confidence 
interval for the population mean for each factor level. 
(Use data file XR12077.) 

Mode of Dress 



Sloppy Casual Dressy 



Shopping 
Bag 

Carried? 



Yes 



No 



"minutes" as the vertical axis, plot and connect the cell 
means for the cells associated with the "sloppy" level of 
the dress factor. On the same graph, plot and connect the 
cell means for the cells associated with the "casual" level 
of the dress factor. Do a third plot, connecting the cell 
means for cells associated with the "dressy" level of the 
dress factor. Do the plots indicate the presence of interac- 
tion between the factor levels? 

12.79 An experiment has been conducted to examine main 
and interactive effects of factor A (keyboard configuration, 
3 levels) and factor B (word processing package, 2 levels). 
Each cell consists of two replications, representing the num- 
ber of minutes each of two secretaries randomly assigned to 
that cell required to type a standard document. Interpret 
the accompanying computer output for this study, including 
your conclusion (use the 0.05 level) for each null hypothesis 
associated with the experiment. 

Two-Way Analysis of Variance 
Analysis of Variance for MINUTES 

Source DF SS MS F P 

KEYBOARD 2 0.6450 0.3225 5.53 0.043 

W0RDPACK 1 0.2133 0.2133 3.66 0.104 

Interaction 2 0.9117 0.4558 7.82 0.021 

Error 6 0.3500 0.0583 
Total 11 2 . 1200 

Individual 95% CI 
KEYBOARD Mean -+ + + + + 

1 2.83 C * ) 

2 2.38 C * ) 

3 2.90 C * ) 

2.10 2.40 2.70 3.00 3.30 
Individual 95% CI 
W0RDPACK Mean + + + + 

1 2.567 C * ) 

2 2.833 C * ) 

2.400 2.600 2.800 3.000 



12.78 For Exercise 12.77, and using the levels of the 
"shopping bag carried" factor as the horizontal axis and 



12.80 Through separate calculations, verify the 
confidence interval for each of the factor-level means 
shown in the output for Exercise 12.79. 



1 2.6 SUMMARY 



• ANOVA and experiments 

The chapter goes beyond the two-sample tests of Chapter 11 to introduce analy- 
sis of variance (ANOVA), a procedure capable of comparing the means of two or 
more independent samples at the same time. The starting point for ANOVA is 
often an experiment in which the goal is to determine whether various levels of an 
independent variable, or factor, might be exerting different effects on the depen- 
dent variable, or measurement. When there is only one factor in an experiment, 
each factor level can be referred to as a treatment. 

• The ANOVA procedure 

Basic to ANOVA is the comparison of variation between samples versus the amount 
of variation within the samples. The test statistic is an P-ratio in which 
the numerator reflects variation between the samples and the denominator reflects 
the variation within them. If the calculated F-statistic exceeds the critical F for a 
given test, the null hypothesis of equal population means is rejected. The numerator 
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and denominator of the F-statistic are separate estimates of the variance that the 
populations are assumed to share. Further assumptions are that the samples have 
been independently selected and that the population distributions are normal. 

• One-way ANOVA 

In the one-way ANOVA, there is just one factor, and the null hypothesis is that the 
population means are equal for the respective treatments, or factor levels. Treat- 
ments are randomly assigned to the persons or test units in the experiment, so this 
method is also referred to as the completely randomized ANOVA. The null hy- 
pothesis can also be expressed in terms of a model where each measurement is 
assumed to be the sum of three components: an overall mean, the effect of a treat- 
ment, and a random error due to sampling. When there are only two treatments, 
the one-way ANOVA is equivalent to the nondirectional pooled-variances i-test 
of Chapter 11. 

• Randomized block ANOVA 

In the randomized block design, persons or test units are first arranged into simi- 
lar groups, or blocks, then the treatments are randomly assigned. Blocking helps 
reduce error variation by ensuring that the treatment groups are comparable in 
terms of the blocking variable. This design assumes that no interactions exist be- 
tween the blocks and the treatments. Although the blocking variable can be 
viewed as representing a second independent variable, it is introduced solely for 
the purpose of controlling error variation, and the major purpose continues to be 
the examination of the effects of the factor levels, or treatments. When there are 
just two treatments, the randomized block design is equivalent to the nondirec- 
tional, dependent-samples £-test of Chapter 11. 

• Two-way ANOVA 

In two-way ANOVA, there are two factors, each of which operates on two 
or more levels. In this design, it is no longer appropriate to refer to the factor lev- 
els as treatments, since each combination of their levels constitutes a separate 
treatment. The two-way ANOVA examines the main effects of the levels of both 
factors as well as interactive effects associated with the combinations of their lev- 
els. Accordingly, significance tests include those for the main effects of each factor 
and for interactive effects between the combinations of factor levels. 

• Computer-assisted ANOVA 

A computer statistical package is especially useful in ANOVA procedures, since 
calculations become quite extensive for even small amounts of data. Along with 
small-scale examples for demonstration purposes, the chapter includes the corre- 
sponding Excel and Minitab printouts and interpretation for each analysis. 



m 



One-Way Analysis of Variance 

This ANOVA procedure assumes that (1) the samples have been independently selected, 

(2) the population variances are equal (oj = cr| = • • • = a\) for the t treatments, and 

(3) the population distributions are normal. The null and alternative hypotheses, data for- 
mat, calculation formulas, and decision rule appear in Table 12.2. 
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One-Way Analysis of Variance, Confidence Interval for a Treatment Mean 

For treatment /, the confidence interval for /jl - is 

x- ± tl— 4=1 where s p = VMSE, the pooled estimate of <x 
\ n j\ n- = number of persons or test units 

in treatment group /' 
X- = sample mean for treatment group ;' 
t = from the t distribution, the t value for 
the desired confidence level, using the 
same df associated with MSE 



Randomized Block Design 

The randomized block design introduces a blocking variable to reduce error variation, 
but the intent of the design is not to measure the effect of this variable. Accordingly, the 
significance test usually considers only the treatment variable. Assumptions for the ran- 
domized block design are that (1) the one observation in each treatment-block combina- 
tion (cell) has been randomly selected from a normal population, (2) the variances are 
equal for the values in the respective populations, and (3) there is no interaction between 
the blocks and the treatments. The null and alternative hypotheses, data format, calcula- 
tion formulas, and decision rule appear in Table 12.6. The process of blocking persons 
or test units into homogeneous blocks interferes with the randomness of the treatment 
groups, and we are not able to calculate confidence intervals for the treatment means. 
For those who do wish to examine the effectiveness of the blocking variable, the proce- 
dure is discussed in the chapter and briefly summarized below: 

Hypotheses for testing the effectiveness of the blocking variable: 

H Q : The levels of the blocking variable are equal in their effect. 
Hj: At least one level has an effect different from the others. 

Calculated value of F = MSB/MSE, computed as shown in Table 12.6 

Critical value of F = F(a, v v v 2 ) 

where v 1 = (n — 1), ^associated with the numerator 

Vj = (t — l)(n — 1), df associated with the denominator 
and n = number of blocks 

f = number of treatments 

Decision rule: If calculated F > F(a, v v v 2 ), reject H Q at the a level. 
Two-Way Analysis of Variance 

In this design, there are two factors, and the significance tests involve both main effects 
and interactive effects. There are r replications within each of the factor-level combina- 
tions, or cells. It is assumed that (1) the r observations within each cell have been drawn 
from a normal population, and (2) the variances of the respective populations are equal. 
The null and alternative hypotheses, data format, calculation formulas, and decision 
rules appear in Table 12.9. 

Two-Way Analysis of Variance, Confidence Interval for the Mean of a Factor Level 

For level k of factor M, the confidence interval for is 

x kM ± tl X- — | where s p = VMSE, the pooled estimate of a 

\ v n kMj n kM = total number of persons or test units 

receiving level k of factor M 
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x kM = mean for all persons or test units 
receiving level k of factor M 
t = from the t distribution, the t value for the 
desired confidence level, using the same 
df associated with MSE 



chapter exercises ji I ' '^*fflM 



Note: The computer is highly recommended when using 
ANOVA techniques. For exercises involving raw data, 
data files are shown in parentheses. 

12.81 Compare the respective purposes of one-way, 
randomized block, and two-way analysis of variance. In 
general, under what circumstances would each method 
be used? 

12.82 Since the f-tests of Chapter 11 and the ANOVA 
techniques of this chapter are both involved with the 
comparison of sample means, why can't the techniques 
always be used interchangeably? 

12.83 The data in an experiment have been examined 
using ANOVA, and a confidence interval has been con- 
structed for the mean associated with each level for a 
factor. Which ANOVA technique (one-way, randomized 
block, or two-way) has not been used in this analysis? 
Explain. 

12.84 The personnel director for a large firm selects a 
random sample consisting of 100 clerical employees, then 
finds out whether they have been with the firm for more 
than 5 years and how many shares of the company's stock 
they own. The purpose is to see if longevity within the 
firm has an influence over the number of shares of stock 
that are held; ANOVA is to be used in analyzing the data. 
Identify the independent and dependent variables and 
explain whether this is a designed experiment. 

12.85 Three different faceplate designs have been 
selected for the radio intended for a new luxury automo- 
bile, and safety engineers would like to examine the 
extent to which their operation will be relatively intuitive 
to the motorist. Fifteen drivers are selected and randomly 
placed into one of three groups (faceplate 1, faceplate 2, 
and faceplate 3). Each driver is then asked to insert a 
music CD and play track number 18, and engineers mea- 
sure how long it takes for this task to be accomplished. 
ANOVA is to be used in analyzing the data. Identify the 
independent and dependent variables, and explain 
whether this is a designed experiment. 

12.86 Given the situation described in Exercise 12.85, 
which ANOVA procedure from this chapter would be 
appropriate for the analysis of the resulting data? 



12.87 Given the situation described in Exercise 12.85, 
suppose the groups are contrived so that each group 
contains two drivers who are under 21 years of age, 
two drivers who are between 22 and 60, and one 
driver who is over 60. If the three groups were 
constructed in this manner, would the ANOVA 
procedure associated with Exercise 12.86 still be 
applicable? Why or why not? 

12.88 A firm that specializes in preparing recent law 
school graduates for the state bar exam has formulated 
two alternatives to their current preparation course. 
To examine the relative effectiveness of the three possi- 
ble courses, they have set up a study in which they 
randomly assigned a group of recent graduates so that 
five of them take each preparation program. For the 
five who have taken alternative program A, their even- 
tual scores on the bar exam are 82, 75, 57, 74, and 74. 
For the five who have taken alternative program B, 
their eventual scores on the bar exam are 73, 63, 57, 
59, and 76. For the five who have taken the current 
program, their eventual scores on the bar exam are 74, 
71, 66, 78, and 61. (The data are also in file XR12088.) 
At the 0.025 level of significance, can the firm conclude 
that the three preparation programs might be equally 
effective in getting recent graduates ready for the state 
bar exam? 

12.89 An industrial sales manager, testing the effectiveness 
of three different sales presentations, randomly selected a 
presentation to be used when making the next sales call on 
14 customers. The numbers of units purchased as a result 
of each sales call are shown here. Use the 0.05 level in de- 
termining whether the three sales presentations could be 
equally effective. (Use data file XR12089.) 

Presentation 



1 2 3 



7 13 14 

9 10 15 

11 11 12 

7 13 10 

14 13 
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12.90 Four different brands of brake shoes have been 
installed on 12 city transit buses, with each brand 
installed on 3 buses selected at random from the 12. 
The number of thousands of miles before the lining 
required replacement was as follows for each bus. 
(Use data file XR12090.) 

Brand of Brake Lining 



12 3 4 



23 19 23 17 

20 11 25 13 

17 16 30 19 



a. From the variation between the samples, what is the 
estimated value of the common variance, a 2 , that is 
assumed to be shared by the four populations? 

b. From the variation within the samples, what is the 
estimated value of the common variance, a 2 , that is 
assumed to be shared by the four populations? 

c. At the 0.01 level of significance, could the four types 
of brake linings be equally durable? 

12.91 In an attempt to compare the assessments 
provided by the four assessors it employs, a municipal 
official sends each assessor to view the same five homes. 
Their visits to the homes are in a random order, and the 
assessments they provide are as shown here. Use the 
0.025 level in comparing the assessor effects. (Use data 
file XR12091.) 

Assessor Number 
12 3 4 

A 40 48 55 53 thousand dollars 

B 49 46 52 50 

Home C 35 47 51 48 

Assessed D 60 54 70 72 

E 100 81 109 88 

12.92 Given the following data from three independent 
samples, use the 0.025 level in determining whether 
the population means could be the same. (Use data 

file XR12092.) 

Sample 



1 2 3 



6 7 14 

9 20 18 

18 11 23 

12 15 17 

13 23 27 
10 16 



12.93 A testing agency is evaluating three different 
brands of bathroom scales and has selected random sam- 
ples of each brand. For brand A, a test object was found 
to weigh 204, 202, 197, 204, and 205 pounds on the five 



scales sampled. For four scales of brand B, the test object 
weighed 201, 199, 196, and 203 pounds, while for six 
scales of brand C, the object weighed 195, 197, 192, 
196, 198, and 196 pounds. Using the 0.025 level of sig- 
nificance, determine whether the three brands could have 
the same population mean for this test object. (Use data 
file XR12093.) 

12.94 Four different alloy compositions have been used 
in manufacturing metal rods, with the hardness measure- 
ments shown here. Use the 0.05 level of significance in 
testing whether the population means might be equal, 
then construct the 95% confidence interval for each 
population mean. (Use data file XR12094.) 

Alloy Composition 



12 3 4 

42 52 45 46 

51 54 48 54 

46 45 39 63 

50 56 

53 



12.95 An investor has consulted four different financial 
advisors with regard to the expected annual rate of 
return for each of three portfolio possibilities she is 
considering. The financial advisors have been chosen 
because they are known to range from very conservative 
(advisor A) to very optimistic (advisor D). The advisors' 
respective estimates for the three portfolios are shown 
here. Use the 0.05 level in comparing the portfolios. 
(Use data file XR12095.) 



Estimated Annual Rate of Return for Portfolio 







1 


2 


3 




A 


8% 


8 


5 


Advisor 


B 


12 


10 


8 




C 


8 


1 1 


10 




D 


15 


12 


1 1 



12.96 Researchers have obtained and tested samples of 
four different brands of nylon rope that are advertised 
as having a breaking strength of 100 pounds. Given 
the breaking strengths (in pounds) shown here, use the 
0.025 level in comparing the brands. (Use data file 
XR12096.) 



Brand A 


Brand B 


Brand C 


Brand D 


103.1 


111.6 


109.0 


118.0 


108.9 


1 17.8 


111.8 


115.8 


106.7 


109.8 


113.0 


114.2 


1 14.3 


110.1 


109.7 


117.3 


1 13.3 


1 18.3 


108.6 


113.8 


110.5 


116.7 


114.7 


110.6 
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12.97 A state law enforcement agency has come up 
with three different methods for publicizing burglary- 
prevention measures during vacation periods. Recogniz- 
ing that there are more burglaries in larger communities 
than in smaller communities, three communities 
have been randomly selected from each size category, 
then assigned to a treatment group. The accompanying 
results are the number of home burglaries during the 
test month for communities in which the respective 
publicity methods were employed. Use the 0.05 level 
in comparing the publicity techniques. (Use data 
file XR12097.) 

Publicity Method 



Community 
Size 



12.98 Interested in comparing the effectiveness of four 
different driving strategies, a government agency has 
equipped a compact car with a fuel-consumption meter 
that measures every 0.01 gallon of gasoline consumed. 
Each of five randomly selected drivers applies each strat- 
egy in negotiating the same test course. The order in 
which each driver applies the strategies is randomly deter- 
mined, and the fuel-consumption data are shown here. 
Use the 0.05 level in comparing the driving strategies. 
(Use data file XR12098.) 

Driving Strategy 



Type Darkness 





1 


2 


3 


Small 


14 


13 


8 burglaries 


Medium 


17 


15 


13 


Large 


27 


23 


17 





1 


2 


3 


4 


A 


21 


34 


25 


38 


B 


23 


29 


20 


32 


C 


28 


33 


26 


37 


D 


25 


28 


18 


24 


E 


19 


26 


23 


16 



Driver 



hundredths of a gallon 

12.99 A magazine publisher is studying the influence of 
type style and darkness on the readability of her publica- 
tion. Each of 12 persons has been randomly assigned to 
one of the cells in the experiment, and the data are the 
number of seconds each person requires to read a brief 
test item. For these data, use the 0.05 level of significance 
in drawing conclusions about the main and interactive 
effects in the experiment. Determine the 95% confidence 
interval for the population mean for each factor level. 
(Use data file XR12099.) 



Light 



Medium 



Dark 



Type Style 



12.100 Three different heat-treatment methods are being 
tested against four different zinc-coating techniques, with 
data representing the number of pounds required for a 
test probe to penetrate the layer of zinc coating. Metal 
samples have been randomly assigned to the factor levels, 
with two replications in each cell. For the data shown 
here, use the 0.01 level of significance in drawing conclu- 
sions from this experiment. Determine the 95% 
confidence interval for the population mean for each fac- 
tor level. (Use data file XR12100.) 

Zinc-Coating Technique 



663 
695 



736 
740 



736 
675 



747 
736 



Heat Treatment 2 
Method 



726 
759 



778 
759 



690 
650 



697 
710 



774 
745 



671 

690 



756 
774 



774 
813 



12.101 Three different point-of-purchase displays are 
being considered for lottery ticket sales at a chain of con- 
venience stores. Two different locations within the store 
are also being considered. Eighteen stores for which 
lottery ticket sales have been comparable are randomly 
assigned to the six cells shown here, with three stores 
receiving each treatment. Data are the number of lottery 
tickets sold during the day of the experiment. For these 
data, use the 0.025 level of significance in drawing 
conclusions about the main and interactive effects in the 
experiment. Determine the 95% confidence interval for 
the population mean for each factor level. (Use data file 
XR12101.) 

Type of Display 

Position 12 3 



Next to 1 

Cash 

Register 

Mounted 2 
on Entrance 
Door 



43, 39, 40 39, 38, 43 57, 60, 49 



53,46,51 58,55,50 47,42,46 
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Thorndike Sports Equipment (Video Unit Six) 



Ted and Luke Thorndike have been approached by several 
local racquetball players, each of whom would like to be 
sponsored by Thorndike Sports Equipment during regional 
tournaments to be held during the next calendar year. In 
the past, Luke has resisted the sponsorship possibility. His 
philosophy has been, "Our racquets are the best in the 
business. If they're smart, they'll use a Thorndike racquet; 
and if we're smart, we won't give it to them for free." 

It wasn't easy, but Ted finally convinced Luke of the ben- 
efits to be gained by exposing racquet club members and 
tournament spectators to high-caliber players who rely on 
Thorndike racquets and related sports equipment. According 
to Ted, this exposure is easily worth a few racquets, a couple 
of pairs of court shoes, and a half-dozen "Thorndike Power" 
T-shirts. 

The Thorndikes compromise between Luke's original 
idea of sponsoring nobody and Ted's proposal to sponsor 
at least three players. Their eventual selection is Kermit 
Clawson, an outstanding young player who finished third 
in the regional championship tournament last year. Kermit 
and the Thorndikes have reached agreement on all of the 



details of the sponsorship arrangement but must decide on 
which combination of racquet and string would be best. 

Kermit is a power player who relies heavily on the speed 
of his serves and return shots. The Thorndike line currently 
includes four "power" racquets, each of which can be strung 
with one of three types of string most often used by strong 
hitters like Kermit. Ted sets up a test in which Kermit will hit 
two "power serves" with each combination of racquet and 
string. The 24 serves will be made in a random order, and 
each will be clocked with a radar gun. Besides helping Ker- 
mit select the combination he likes best, the test will also help 
the Thorndikes study what Ted has referred to as the "main" 
and "interactive" effects of the racquets and strings. The 
data from the test are shown here and are provided in file 
THORN12. 

From these data, which combination of racquet and 
string would you recommend that Kermit use? Also, em- 
ploy the appropriate ANOVA procedure in helping Ted re- 
spond to Luke's statement that "the racquets are all the 
same, the strings are all the same, and it doesn't matter 
which string goes into which racquet." 



Racquet 



1 



7 105,108 113,109 114,112 109, 108 mph 

String 2 110,108 109,107 118,114 113,113 

3 113,112 114,109 114,114 110,107 



Springdale Shopping Survey 

Item C of the Springdale Shopping Survey, introduced at 
the end of Chapter 2, describes variables 7-9 for the survey. 
These variables represent the general attitude respondents 
have toward each of the three shopping areas, and they 
range from 5 (like very much) to 1 (dislike very much). The 
data are in file SHOPPING. 

Variable 7 = Attitude toward Springdale Mall 
Variable 8 = Attitude toward Downtown 
Variable 9 = Attitude toward West Mall 

The information gained from this database exercise 
would be useful in determining whether the population 
mean attitudes toward the three shopping areas could be 



the same for men and women, for persons from various 
education levels, or for persons who differ on the basis of 
other criteria. Such knowledge could help managers to bet- 
ter understand and serve their customers. 



Part I: Attitude Comparisons Based on the 
Gender of the Respondent 

1. For variable 7 (attitude toward Springdale Mall), 
compare the mean scores for respondents according to 
variable 26 (gender of respondent). At the 0.05 level of 
significance, could the population means of these 
groups be the same? What is the p- value for the test? 
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2. Repeat step 1 for variable 8 (attitude toward Down- 
town). 

3. Repeat step 1 for variable 9 (attitude toward West 
Mall). 

Part II: Attitude Comparisons Based on 
Education Level 

1. For variable 7 (attitude toward Springdale Mall), com- 
pare the mean scores for respondents according to vari- 
able 27 (education category). At the 0.05 level of signif- 
icance, could the population means of these groups be 
the same? What is the p-value for the test? 

2. Repeat step 1 for variable 8 (attitude toward Down- 
town). 

3. Repeat step 1 for variable 9 (attitude toward West 
Mall). 



Part III: Some Further Comparisons 

Given the number of classificatory variables in this 
database, a great many other comparisons can be made 
among sample means. For example, variables 10-17 indi- 
cate which shopping area the respondent feels is best in 
terms of possessing a specific, desirable attribute. As just 
one example, compare the means for variable 7 (attitude 
toward Springdale Mall) for groups classified according to 
variable 11 (shopping area having the highest-quality 
goods). Depending on your time, curiosity, and computer 
access, you may wish to try some of the many other classi- 
ficatory variables in selecting groups and comparing atti- 
tude scores. 




Fastest Courier in the West 



The law firm of Adams, Babcock, and Connors is located in 
the Dallas-Fort Worth metroplex. Randall Adams is the 
senior and founding partner in the firm. John Babcock has 
been a partner in the firm for the past eight years, and 
Blake Connors became a partner just last year. The firm 
employs two paralegal assistants and three secretaries. In 
addition, Bill Davis, the newly hired office manager, is in 
charge of day-to-day operations and manages the financial 
affairs of the law firm. 

A major aspect of the law firm's business is the prepa- 
ration of contracts and other legal documents for its 
clients. A courier service is employed by the firm to deliver 
legal documents to its many clients as they are scattered 
throughout the metroplex. The downtown centers of Dal- 
las and Fort Worth are separated by a distance of approx- 
imately 30 miles. With the large sizes of these cities and 
their associated heavy traffic, a trip by car from the south- 
west side of Fort Worth to the northeast side of Dallas can 
easily take longer than an hour. Due to the importance of 
the legal documents involved, their timely delivery is a 
high priority. At a recent partners' meeting, the topic of 
courier delivery came up. 

Adams: "Recently, we have received a couple of com- 
plaints from some of our best clients about delayed 
contract deliveries. I spent the better part of an hour 
yesterday afternoon trying to calm down old man 
Dixon. He claims that if those contracts had arrived 
any later, his deal with the Taguchi Group would have 
fallen through." 




Connors: "Well, it wasn't our fault. Anne had the contracts 
all typed and proofread before nine in the morning." 

Adams: "No, no. Everything was handled fine on our end. 
The delay was the courier's fault. Something about a 
delay in the delivery. ..." 

Babcock: "Metro Delivery has always done a good job 
for us in the past. I am sure that these are just a few 
unusual incidents." 

Connors: "On the other hand, it could be an indication 
that their service is slipping." 

Adams: "In any event, we cannot afford to offend our 
clients. No one is perfect, but it only takes one or two 
bad incidents to lose important clients. At least two 
new courier services have opened in the metroplex 
during the last two years. I hear good things about 
them from some of my friends. The question is, Should 
we keep using Metro or consider using one of these 
other services?" 



[continued) 



F Distribution and ANOVA 



When this applet loads, it shows an F distribution diagram 
similar to the one in Figure 12.1. The components include 
the degrees of freedom for the numerator {v } = 2 is shown 
as "df1 = 2"), the degrees of freedom for the denomina- 
tor (v 2 = 7 is shown as "df2 = 7"), and the critical value of 
F (6.54) associated with the specified level of significance 
for the test ("prob = 0.025"). 

By moving the left and right sliders, we can change the 
number of degrees of freedom associated with the numer- 
ator and denominator of the F-ratio, and the shape of the 
curve changes accordingly. Likewise, we can use the text 
boxes to enter a desired value for either the F-ratio or the 
probability. 




Note that the F-ratio and probability values in the text 
boxes will sometimes differ slightly from those we would 
look up in our F distribution tables in the back of the book. 
Besides being more accurate because they do not rely on 
printed tables, these text boxes can also provide a wider va- 
riety of values for F-ratios and their related upper-tail prob- 
abilities than would be available from our printed tables. 

Applet Exercises 

15.1 Using the left slider, increase the number of degrees of 
freedom for the numerator of the F-ratio to df1 = 5, then 
to df 1 = 1 0. What happens to the shape of the F distribu- 
tion curve? 

15.2 Use the left and right sliders to set the degrees of free- 
dom back to df 1 = 2 and df2 = 7. Next, use the right slider 
to set the number of degrees of freedom for the denomina- 
tor to df2 = 10, then to df2 = 1 5. What happens to the 
shape of the F distribution curve? 

15.3 Use the left and right sliders to set the degrees of free- 
dom back to df 1 = 2 and df2 = 7. Next, use the left text 
box to increase the F value to 9.55. (Be sure to press the 
Return or Enter key after changing the text box entry.) 
What effect does this have on the probability? 

15.4 Use the left and right sliders to set the degrees of free- 
dom back to df 1 = 2 and df2 = 7, and use the left text box 
to return the F value to 6.54. (Be sure to press the Return 
or Enter key after changing the text box entry.) Next, use 
the right text box to change the probability to 0.01 . What 
effect does this have on the F value? 



Connors: "How would you suggest that we make this 
decision?" 

Babcock: "Why not give each one a trial period and 

choose the best performer?" 
Adams: "Great idea! But how would you decide who's 

best?" 

Babcock: "Well, obviously the choice boils down to pick- 
ing the fastest courier service. Given our recent prob- 
lem, we also want to avoid the infrequent, but costly, 
delayed deliveries. Delivery cost is an important sec- 
ondary criterion." 

Connors: "Why not let our new office manager run this 
little 'contest' for a few weeks? Bill normally keeps 
fairly detailed information about contract deliveries 
anyway. As the need for deliveries arises, he can rotate 
through all three couriers." 

Adams: "Let's be sure not to let any of the couriers know 
about the contest; otherwise, we may not see their typ- 



ical performance. We'll take up this topic again after 
Bill has collected and analyzed some data and is ready 
to make a presentation." 

During the past month, Bill Davis has kept detailed 
records of the deliveries made by each of three courier ser- 
vices: DFW Express, Carborne Carrier, and Metro Deliv- 
ery (the courier presently used by the law firm). Due to the 
importance of the documents delivered, a courier is re- 
quired to phone the law office as soon as a delivery has 
been made at its destination. For each delivery, Bill's data 
set contains the courier used, the pickup time, the delivery 
time, the mileage of the delivery, and the cost of the deliv- 
ery. Each of the courier services charges a flat fee plus a 
mileage charge for each delivery. These charges vary from 
courier to courier. 



Interaction Graph in Two-Way ANOVA 



This applet shows interaction graphs similar to those in 
Figures 1 2.3 and 1 2.4, but it gives us the opportunity to see 
how the interaction graphs change in response to changes 
we make in the row, column, and interaction effects. The top 
slider changes the difference between the row means 
("Row"), the middle slider changes the difference between 
the column means ("Col"), and the bottom slider changes 
the interaction effects ("RxC"). 
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Applet Exercises 

16.1 If they are not already in this position, center all three 
sliders so that the value at the far right of each slider scale is 
a zero. Next, slide the top slider to the right and set it at + 1 0 



to increase the difference between the row means. Now slide 
it further to the right, to +20. Describe the nature of the 
graph and what happens when the difference between the 
row means is increased. 

16.2 Center all three sliders so that the value at the far right 
of each slider scale is a zero. Next, slide the middle slider to 
the right and set it at + 1 0 to increase the difference between 
the column means. Now slide it further to the right, to +20. 
Describe the nature of the graph and what happens when 
the difference between the column means is increased. 

16.3 Center all three sliders so that the value at the far right 
of each slider scale is a zero. Next, set each of the top two 
sliders to -20 and describe the appearance of the graph. 
Now set each of the top two sliders to +20. In what way(s) 
has the graph changed, and in what way(s) has it remained 
the same? 

16.4 With each of the top two sliders set at +20, move the 
bottom slider across its range of movement, from far left to 
far right. Describe how the graph responds to this movement. 

16.5 Set the sliders so that the following conditions will exist 
simultaneously: (a) Row 1 will have a greater effect (higher 
mean) than Row 2; (b) Column 2 will have a greater effect 
than Column 1; and (c) the strongest positive interaction 
effect will be from the combination of Row 1 and Column 2. 
(Note: There are a great many possibilities, but just select three 
slider positions of your choice that will cause all three of these 
conditions to be present at the same time.) 



Assignment 

As office manager, Bill Davis is responsible for making the 
decision as to which courier service will be given the exclu- 
sive contract. Using the data set stored in the file COURIER 
on the text CD, assist Bill in choosing among the three 
courier services and defending his choice to the partners of 
the firm by performing a statistical analysis of the data. 
Write a short report to the partners giving your final rec- 
ommendation. The case questions will assist you in your 
analysis of the data. Use important details from your anal- 
ysis of the data to support your recommendation. 



Data Description 

For each delivery, Bill Davis' data set contains the courier 
used, the pickup time, the delivery time, the mileage of the 
delivery, and the cost of the delivery. The first few entries in 
the database are shown here. The entire database is in the 
file COURIER on the text CD, and it contains information on 
182 courier deliveries. 



Courier Pickup Time Delivery Time Mileage Cost 

1 13 14 7 16.55 
3 20 51 20 26.50 

2 22 33 12 19.00 

3 11 47 19 25.60 
3 17 18 8 15.70 



(continued) 
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The variables are defined as follows: 



Courier: 



1 = DFW Express 

2 = Carborne Carrier 

3 = Metro Delivery 

Pickup Time: Time in minutes from when the 
order is phoned in until a courier 
agent arrives 

Delivery Time: Time in minutes that it takes for the 
documents to be delivered to the 
destination from the firm 

Mileage: Distance in miles from the law firm 

to the destination 

Cost: Charge for the delivery. Each of the 

courier services charges a flat fee 
plus a mileage charge. These charges 
vary from courier to courier. 

1. Based on the sample data, is it possible that the popula- 
tion mean delivery times for all three couriers might be 
the same? Using a level of significance of your choice, 
perform an appropriate ANOVA statistical analysis 
and attach any supporting computer output. What is the 



p-value for the test? What recommendation would you 
make to the company based on this analysis? 

2. Based on the sample data, is it possible that the popu- 
lation mean delivery distances for all three couriers 
might be the same? Using a level of significance of 
your choice, perform an appropriate ANOVA statisti- 
cal analysis and attach any supporting computer out- 
put. What is the p-value for the test? Has the result of 
this test altered in any way the recommendation you 
made in response to question 1 ? 

3. Create a new variable called "Average Speed" by di- 
viding "Mileage" by "Delivery Time." For each trip, 
this will be the average speed during the trip. Based on 
the sample data, is it possible that the population 
mean average speeds for all three couriers might be the 
same? Using a level of significance of your choice, per- 
form an appropriate ANOVA statistical analysis and 
attach any supporting computer output. What is the p- 
value for the test? On the basis of this analysis, what 
recommendation would you make to the firm? 

4. Overall, considering the analyses performed above, do 
you feel that the evidence is sufficiently conclusive that 
the firm should switch from their present courier? Pre- 
sent a brief written recommendation to the firm. 



Proportions Testing and the Restroom Police 



On the basis of observations in selected restrooms across the nation, researchers have 
discovered that many Americans don't wash their hands after using the facilities. 
Among their findings, obtained by peering from stalls or pretending to be combing 
their hair, are that 64% of 814 baseball fans at an Atlanta Braves home game, 
71 % of 1251 people observed in a New Orleans casino, and 69% of 888 persons 
at San Francisco's Golden Gate Park washed their hands. 

Could these three percentages — 64%, 71 %, and 69% — differ from each other 
just by chance (sampling error), or can we conclude that the populations involved 
really don't exercise the same level of restroom fastidiousness? Actually, there's only 
a 0.004 probability (the p-value) that such great differences among expected 
the sample percentages could have occurred 0 ften diff er ^ffjathan's 

simply by chance. In Section 1 3.5 of this observed da ^ of the 2°°^ waS the 

chapter, you'll learn about the test we used data- T « Dog Eating ^ hQ ate more 



Note: The study was sponsored by the 
American Society for Microbiology, a group that 
feels very strongly about washing our hands to 
reduce the spread of those little micro-critters that 
give us flu, colds, and other ailments. Another note: 
It's not at all unusual for organizations to package 
a rather straightforward message in the context of 
reporting the results of an interesting survey or other 
research effort to which newspaper editors are more 
likely to be attracted. 



to get this result, and you'll be able to verify it 
for yourself. 




Source: Anita Manning, "Caught Dirty-Handed: Many Fail to Wash 
When They Should," USA Today, September 17, 1996, p. 10D. 
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Explain the nature of the chi-square distribution. 

List and understand the general procedures involved in chi-square testing. 

Apply the chi-square distribution in testing whether a sample could have come 
from a population having a specified probability distribution. 

Apply the chi-square distribution in testing whether two nominal-scale 
(category) variables could be independent. 

Apply the chi-square distribution in comparing the proportions of two or more 
independent samples. 

Determine the confidence interval for, and carry out hypothesis tests for a 
population variance. 



13.1 INTRODUCTION 



The chapter introduces a new probability distribution called the chi-square distri- 
bution. Using this distribution, along with sample data and frequency counts, we 
will be able to examine ( 1 ) whether a sample could have come from a given type 
of population distribution, (2) whether two nominal variables could be indepen- 
dent of each other, and (3) whether two or more independent samples could have 
the same population proportion. The chi-square distribution will also be used in 
constructing confidence intervals and carrying out hypothesis tests regarding the 
value of a population variance. As in previous chapters, relevant computer out- 
puts for examples will be presented and interpreted. 



1 3.2 BASIC CONCEPTS IN CHI-SQUARE TESTING 



The Chi-Square Distribution 

Like the t and F distributions covered previously, the chi-square (\ 2 , "chi" rhymes 
with "sky") distribution is a sampling distribution. Specifically, when samples of 
size n are drawn from a normal population, the chi-square distribution is the sam- 
pling distribution of x 1 = ( n ~ l)s 2 /o" 2 - 

The chi-square distribution is a family of probability distributions. As with 
the t distribution, the specific member depends on a value for the number of de- 
grees of freedom. (Recall that the F distribution is dependent on two separate df 
values, one associated with the numerator of F, the other with the denominator. ) 
The chi-square distribution is skewed positively, but as (^/increases, it approaches 
the shape of the normal distribution. In Figure 13.1, which shows the chi-square 
curve for selected df values, note the extreme positive skewness that is present 
when df takes on smaller values. Since (n — 1), s 2 , and a 2 will always be s 0, x 2 
will always be s 0 as well. By using Seeing Statistics Applet 17, at the end of the 
chapter, you can observe how the chi-square distribution shape and probabilities 
change in response to changes in df. 

As with the t distribution, it isn't practical to include a table of areas beneath 
the curve for every possible member of this family of probability distributions. The 
X 1 table in Appendix A lists the values of x 1 corresponding to selected right-tail 
areas for various values of df. A portion of this table is shown as Table 13.1. In 
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The chi-square distribution is really a family of distributions, and the specific member depends on the 
number of degrees of freedom. For small values of df, there is extreme positive skewness, but the 
curve approaches the normal distribution as df becomes larger. 



TABLE 13.1 



A portion of the chi-square 
table from Appendix A. 
Like the f distribution, the 
chi-square distribution is 
really a family of 
continuous distributions, 
but the shape of the 
chi-square distribution 
is positively skewed. 



df 


0.99 


0.975 


0.95 


0.90 


0.10 


0.05 


0.025 


0.01 


1 


0.00016 


0.00098 


0.0039 


0.0158 


2.706 


3.841 


5.024 


6.635 


2 


0.0201 


0.0506 


0.103 


0.211 


4.605 


5.991 


7.378 


9.210 


3 


0.115 


0.216 


0.352 


0.584 


6.251 


7.815 


9.348 


11.345 


4 


0.297 


0.484 


0.711 


1.064 


7.779 


9.488 


1 1.143 


13.277 


5 


0.554 


0.831 


1.145 


1.610 


9.236 


11.070 


12.833 


15.086 


6 


0.872 


1.237 


1.635 


2.204 


10.645 


12.592 


14.449 


16.812 


7 


1.239 


1.690 


2.167 


2.833 


12.017 


14.067 


16.013 


18.475 


8 


1.646 


2.180 


2.733 


3.490 


13.362 


15.507 


17.535 


20.090 


9 


2.088 


2.700 


3.325 


4.168 


14.684 


16.919 


19.023 


21.666 


10 


2.558 


3.247 


3.940 


4.865 


15.987 


18.307 


20.483 


23.209 


11 


3.053 


3.816 


4.575 


5.578 


17.275 


19.675 


21.920 


24.725 


12 


3.571 


4.404 


5.226 


6.304 


18.549 


21.026 


23.337 


26.217 


13 


4.107 


5.009 


5.892 


7.042 


19.812 


22.362 


24.736 


27.688 


14 


4.660 


5.629 


6.571 


7.790 


21.064 


23.685 


26.119 


29.141 


15 


5.229 


6.262 


7.261 


8.547 


22.307 


24.996 


27.488 


30.578 




using the x 2 table, we need only refer to the upper-tail area of interest, then use the 
appropriate df row in identifying the x 2 value. For example, if the right-tail area 
of interest is 0.05 and df = 10, the corresponding x 2 value will be at the 
intersection of the a = 0.05 column and the df = 10 row, x 2 = 18.307. Although 
most of our tests will be right-tail, those in Section 13.6 can be two-tail as well. 
This is why our chi-square table also includes columns for right-tail areas of 0.99, 
0.975, 0.95, and 0.90. 
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There are two general procedures for the chi-square tests of this chapter. As Fig- 
ure 13.2 shows, the tests in the next three sections all involve a calculated x 2 ™at 
reflects the extent to which a table of observed frequencies differs from one con- 
structed under the assumption that H Q is true. The tests in these sections are right- 
tail; i.e., H Q is rejected whenever the calculated x 2 is greater than the critical value. In 
turn, the critical value for the chi-square statistic depends on the level of significance 
selected and on the number of degrees of freedom associated with the test. 



FIGURE 13.2 



General procedures for 
chi-square tests discussed 
in the chapter 



Formulate null hypothesis, H 0 







Specify the level of significance (a) for the test. 



Determine the number of degrees of 
freedom (df ) associated with the test. 



For goodness of fit, testing 
for independence of variables, 
and comparing sample proportions 
(Sections 13.3, 13.4, and 13.5) 






Construct table of observed 
frequencies, 0». 






Assuming Hq to be true, construct 
table of expected frequencies, Ey. 







Calculate the test statistic: 

r k (0..-E.S 2 

Calculated X 2 = I £ \ " 

r = Number of rows 

k = Number of columns in the table 



Is calculated x 2 > critical 

J 



No 



Yes 



J 



Do not reject H 0 Reject H 0 



For tests involving the 
population variance 
(Section 13.6) 



I 



Calculate the test statistic: 

Calculated y 2 = — — 

n = Sample size 
s = Sample standard deviation 
<7q = Hypothesized value for the 
population variance, a 2 



■ 



Is the calculated \ 2 within a 
rejection region for the test? 



No 



Yes 



Do not reject Hq Reject H 0 
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Sections 13.3, 13.4, and 13.5 each present a specific variant of the general 
formula shown in Figure 13.2. The frequency tables in these sections will differ 
in the number of rows they contain, with the tests in Sections 13.3, 13.4, and 
13.5 having 1, > 2, and 2 rows, respectively. 

In using the chi-square distribution to construct confidence intervals and carry 
out hypothesis tests involving a population variance, the topics of Section 13.6, 
there are no observed versus expected tables to be compared. In this application, 
the chi-square statistic is used in a manner that is more analogous to our use of 
the £-statistic in Chapters 9 and 10, where we constructed confidence intervals 
and carried out tests involving a population mean. 






13.1 For what kinds of tests can chi-square analysis be 
used? 

13.2 Is the chi-square distribution a continuous distribu- 
tion or a discrete distribution? Explain. 

13.3 In what way are the chi-square and normal distribu- 
tions related? 

13.4 Sketch the approximate shape of the chi-square 
curve when df = 2 and when df = 100. 

13.5 Why can the chi-square statistic never be negative? 

13.6 For df = 5 and the constant A, identify the value 
of A such that 

a. P(x 2 > A) = 0.90 b. P(x 2 > A) = 0.10 

c. P(x 2 > A) = 0.95 d. P(x 2 > A) = 0.05 

e. Plx 2 < A) = 0.975 f. Pix 2 < A) = 0.025 



13.7 For df = 

A such that 
a. P( X 2 >A) 
c. P(x 2 >A) 
e. P(x 2 <A) 



8 and the constant A, identify the value of 

= 0.90 b. P(x 2 >A) = 0A0 

= 0.95 d. P(x 2 > A) = 0.05 

= 0.975 f. pQ 2 - <A) = 0.025 



13.8 For df = 10 and the constants A and B, identify the 
values of A and B such that the tail areas are equal and 

a. P(A <x 2 <B) = 0.80 

b. P(A < x 2 < B) = 0.90 

c. P(A<x 2 <B) = 0.95 

d. P(A < x 2 < B) = 0.98 

13.9 For df = 15 and the constants A and B, identify the 
values of A and B such that the tail areas are equal and 

a. P(A < x 2 < B) = 0.80 

b. P(A < x 2 < B) = 0.90 

c. P(A <x 2 < B) = 0.95 

d. P(A <x 2 < B) = 0.98 



TESTS FOR GOODNESS OF FIT AND NORMALITY 13.3 



Purpose 

In goodness-of-fit tests, chi-square analysis is applied for the purpose of examining 
whether sample data could have been drawn from a population having a specified 
probability distribution. For example, we may wish to determine from sample data 
(1) whether a random number table differs significantly from its hypothesized 
discrete uniform distribution, (2) whether a set of data could have come from a pop- 
ulation having a normal distribution, or (3) whether the distribution of contributions 
to a local public television station differs significantly from those for such stations 
across the nation. With regard to item 2, we have already discussed a number of 
techniques which assume that sample data have come from a population that is ei- 
ther normally distributed or approximately so. This section provides an Excel-based, 
chi-square goodness-of-fit test designed specifically to examine whether sample data 
could have come from a normally distributed population. 
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Procedure 

The procedure first requires that we arrange the sample data into categories that are 
mutually exclusive and exhaustive. We then count the number of data values in each 
of these categories and use these counts in constructing the table of observed fre- 
quencies. In the context of the general procedure shown in Figure 13.2, this table of 
observed frequencies will have one row and k columns. Since there is just one row, 
we will simply refer to each observed frequency as O-, with / = 1 through k. 

The null and alternative hypotheses are the key to the next step. Again, these 

are 

H 0 ; The sample is from the specified population. 
Hj: The sample is not from the specified population. 

Under the assumption that the null hypothesis is true, we construct a table of 
expected frequencies (£■, with / = 1 through k) that are based on the probability 
distribution from which the sample is assumed to have been drawn. The test statis- 
tic, x 2 -> is a measure of the extent to which these tables differ. From an intuitive 
standpoint, a large amount of discrepancy between the frequencies that are ob- 
served and those that are expected tends to cause us to reject the null hypothesis. 
The test statistic is calculated as follows: 



Chi-square test for goodness-of-fit: 






k (O, - E ; ) 2 
Calculated x 2 = 2 






where k 


= number of categories, 


;'=1 E j 




or cells in the table 




°i 


= observed frequency 




in cell ;' 




E , 


= expected frequency 




in cell / 



and df = k — 1 — m, with m = the number of population parameters (e.g., jx or a) 
that must be estimated from sample data in order to carry out the goodness-of- 
fit test. 

The test is upper-tail, and the critical value of x 1 will be that associated with 
the level of significance (a) and degrees of freedom (df) for the test. Regarding the 
determination of df, the (k — 1) portion of the df expression is required because 
we have already lost one degree of freedom by arranging the frequencies into cat- 
egories. Since we know the total number of observations in all cells (the sample 
size), we need only know the contents of (k — 1) cells to determine the count in the 
kth cell. We must also further reduce df by 1 for each population parameter that 
has been estimated in order to carry out the test. Each such estimation represents 
additional help in our construction of the table of expected frequencies, and the 
purpose of the reductions beyond (k — 1) is to correct for this assistance. This 
aspect of the df determination will be further discussed in the context of the 
examples that follow. 

As a rule of thumb, it is important that the total sample size and selection of 
categories be such that each expected frequency (£■) is at least 5. This is because the 
chi-square distribution is continuous, while the counts on which the test statistic is 
based are discrete, and the approximation will be unsatisfactory whenever one or 
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more of the expected frequencies is very small. There are two ways to avoid this 
problem: (1) use an overall sample size that is large enough that each expected fre- 
quency will be at least 5, or (2) combine adjacent cells so that the result will be a 
cell in which the expected frequency is at least 5. 





example 














Poisson Distribution 



Automobiles leaving the paint department of an assembly plant are subjected to a 
detailed examination of all exterior painted surfaces. For the most recent 380 au- 
tomobiles produced, the number of blemishes per car is summarized here — for 
example, 242 of the automobiles had 0 blemishes, 94 had 1 blemish, and so on. 
The underlying data are in file CX13BLEM. 



Blemishes per car: 
Number of cars: 



0 



3 



242 94 



38 



Using the 0.05 level of significance, could these data have been drawn from a pop- 
ulation that is Poisson distributed? 



SOLUTION 

Formulate the Null and Alternative Hypotheses 

The null hypothesis is H Q : the sample was drawn from a population that is 
Poisson distributed. The alternative hypothesis is that it was not drawn from such 
a population. 



Specify the Level of Significance (a) for the Test 

The a = 0.05 level has been specified. 



Construct the Table of Observed Frequencies, O- 

The observed frequencies are as listed earlier and are repeated below. 

Blemishes per car: 0 1 2 3 4 

Observed frequency, O : 



242 94 38 



380 



Assuming the Null Hypothesis to Be True, Construct the Table of Expected 
Frequencies, E } 

To construct the table of expected frequencies, we must first use the mean of the sam- 
ple data to estimate the mean of the Poisson population that is being hypothesized. 
The sample mean is simply the total number of blemishes divided by the total num- 
ber of cars, or: 

_ _ 0(242) + 1(94) + 2(38) + 3(4) + 4(2) blemishes 
X ~ 380 cars 

and x = 190/380 = 0.5 blemishes per car 

Using x = 0.5 blemishes per car as our estimate of the true population mean (A), 
we now refer to the "A = 0.5" portions of the individual and cumulative tables of 
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Poisson probabilities in Appendix A. For a Poisson distribution with A = 0.5, and 
x = the number of blemishes per car: 

x = number Expected number 

of blemishes P(x) • n of cars with x blemishes 



0 0.6065 ■ 380 = 230.470 

1 0.3033-380 = 1 15.254 

2 0.0758 ■ 380 = 28.804 

3 0.0126-380 = 4.788 

4 0.0016-380 = 0.608 

5 or more 0.0002 ■ 380 = 0.076 



1 .0000 380.000 



The final three categories (x = 3, x = 4, and x ^ 5 blemishes) have expected fre- 
quencies that are less than 5.0. To satisfy the requirement that all expected frequen- 
cies be at least 5.0, we will combine these into a new category called "3 or more 
blemishes," and it will have an expected frequency of 4.788 + 0.608 + 0.076, or 
5.472 cars. After this merger, the expected frequencies will be as shown below. 

Blemishes per car: 0 12 >3 

Expected frequency, E-: 



230.470 115.254 28.804 5.472 



380.000 



The observed frequencies must be based on the same categories used in the ex- 
pected frequencies, so we need to express the observed frequencies in terms of 
the same categories (0, 1, 2, and &3) used above. The observed frequencies can 
now be as shown below. 



Blemishes per car: 0 12 >3 

Observed frequency, O : 



242 94 38 



380 



Determine the Calculated Value of the \ 2 Test Statistic 

Over the k = 4 cells, the calculated chi-square is 7.483, computed as follows: 



* (Of ~ Ej) 1 
Calculated x 2 = 2 p 

/ = ! E j 



(242 - 230.470) 2 (94 - 115.254) 2 
230.470 " 115.254 

(38 - 28.804) 2 (6 - 5.472) 2 
28.804 5.472 

7.483 



Identify the Critical Value of the Chi-Square Statistic 

The number of degrees of freedom is df = k — 1 — m, with k = 4 categories and 
m = 1 parameter (the mean) being estimated from sample data, and df = 4 — 1 — 1, 
or 2. For the 0.05 level of significance and df= 2, the critical chi-square value is 5.991. 

Regarding tests involving the Poisson distribution, in which the mean and the vari- 
ance are equal, df = k — 1 — m will have either m = 0 or m = 1 parameters being 
estimated. If A is given in the null hypothesis (e.g., H Q : "The sample was drawn 
from a Poisson distribution having A = 1.8"), m will be 0. If it is necessary to esti- 
mate A based on your sample data (e.g., H Q : "The sample was drawn from a popu- 
lation that is Poisson distributed"), m will be 1. Just because A and a 1 are equal in 



Chapter 13: Chi-Square Applications 



473 



Hq. Sample was drawn from a 

Poisson population. 
H j : Sample was not drawn from a 
Poisson population. 




FIGURE 13.3 



Given the frequency 
distribution for the number 
of paint blemishes on 
380 automobiles, the null 
hypothesis is that the 
sample actually was drawn 
from a Poisson population. 
Using the chi-square 
goodness-of-fit test, we 
are able to reject the null 
hypothesis at the 0.05 level 
of significance. 



^ 2 = 5.991 

Test statistic: 
X 1 = 7.483 



this distribution doesn't mean that they must both be counted in the df expression 
if A has to be estimated based on the sample data. 

Compare the Calculated and Critical Values of Chi-Square 

The calculated chi-square (7.483) exceeds the critical value (5.991), and the null 
hypothesis can be rejected at the 0.05 level. At the 0.05 level of significance, we con- 
clude that the sample was not drawn from a population that is Poisson distributed. 

The calculated chi-square also exceeds 7.378, the critical value for the 0.025 
level of significance. However, it does not exceed 9.210, the critical value for the 
0.01 level. Based on the chi-square table, the most accurate statement we can 
make about the exact p-value for the test is that it is somewhere between 0.025 
and 0.01. The test is summarized in Figure 13.3. 



Computer Solutions 13.1 shows how we can use Excel to carry out goodness-of- 
fit tests like the one summarized in Figure 13.3. Keep in mind that Excel's CHITEST 
function should be used only if m = 0 population parameters have been estimated 
on the basis of sample data, so it would not be applicable to this example. 



computer solutions 13.1 



Chi-Square Test for Goodness of Fit 
EXCEL 



■ 





A 


B 


c 


D 


E 


1 


Chi-Square Goodness-of-Fit Test 


no. of cells, k = 


4 


2 


Cell Frequencies: 




no. of parameters estimated, m = 


1 


3 


Observed (Oj): 


Expected (Ej): 


(Oj-Ej)A2/Ej: 


df = k - 1 - m = 


2 


4 


242 


230.470 


0.577 


calculated chi-square = 


7.483 


5 


94 


115.254 


3.919 


p-value = 


0.024 


6 


38 


28.804 


2.936 






7 


6 


5.472 


0.051 







[continued) 
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Excel Chi-Square Test for Goodness of Fit 

1 . Given the observed and expected frequencies, as in the paint-blemish example in this section: Open Excel worksheet 
template TMCHIFIT.XLS, provided with the text. The directions are within the template. There are 4 categories in this 
analysis, so we must reduce the number of categories in the template from 1 0 to 4. Click on the button for Excel row 
7 and drag to select rows 7 through 12. Right-click on any one of the selected row buttons, then click Delete. Enter 
into E2 the number of parameters being estimated based on sample data — in this case, m = 1 because we used sam- 
ple data to estimate the Poisson mean. 

2. Enter the observed frequencies, starting with cell A4. Enter the expected frequencies, starting with cell B4. Click at the 
lower right corner of C4 and drag downward to autofill. The calculated chi-square and the p-value will be in cells E4 
and E5, as shown in the accompanying printout. 

If we have only raw data and the data values are integers, as in this example, we can use Excel's COUNTIF func- 
tion to count how many times each value appears. For example, using CX13BLEM.XLS, the file containing 380 values 
consisting of 0, 1, 2, 3, or 4: Click on any empty cell. Click f x , then select Statistical and COUNTIF. Click OK. Enter 
A2:A381 into the Range box. Enter 0 into the Criteria box. Click OK. The number of 0 values (242) will appear. Re- 
peat the preceding for each of the other codes (1 , 2, 3, and 4). 

Note: Excel offers a function called CHITESTthat applies only when m = 0 parameters are being estimated from sample data. 
If this had been the case here, we could have applied CHITEST as follows: With the actual and expected values as shown 
above, click on an empty cell. Click f x . Select Statistical and CHITEST. Click OK. Enter A4:A7 into the Actual_range box. 
Enter B4:B7 into the Expected_range box. The p-value is provided. 

(Minitab does not provide this good ness-of -fit test.) 



Example 



Test for Normality 

A researcher in the department of education has collected data consisting of a 
simple random sample of 300 SAT scores of high school seniors in her state who 
took the college entrance examination last year. For this sample, x = 945.04, 
s = 142.61, and the underlying data are in file CX13SAT. Based on these sample 
data, use the 0.01 level of significance in determining whether the sample could 
have been drawn from a population in which the scores are normally distributed. 

SOLUTION 

Although this test could be performed with tables and pocket calculator, we will 
use the procedure described in Computer Solutions 13.2, and we will discuss it in 
terms of the printout shown there. The null hypothesis is that the sample came 
from a normal distribution, and the test will be conducted using the 0.01 level of 
significance. Whether the test is carried out by hand calculations or using the 
computer, the key components will be those shown in Computer Solutions 13.2: 

1. The expected frequencies can be defined in terms of area under the normal 
curve. For example, as shown in cells B8 and C8, 2.28% of the observations 
(or 300[0.0228] = 6.825 observations) would be expected to be at least two 
standard deviations below the mean. 

2. If we were to arrange the data in order of size, we would find that 8 observa- 
tions were actually at least two standard deviations below the mean. 

3. Using the same procedure as in steps 1 and 2, we could obtain the probabilities, 
expected frequencies, and actual frequencies for the other five groups (intervals). 
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The expected frequencies are shown in cells C8:C13 and the observed frequen- 
cies are shown in cells D8:D13. 

4. We can use the same formula as in Figure 13.2 and the previous example to ob- 
tain the calculated value of the chi-square statistic, or calculated chi-square = 
15.24. This could be calculated as 

UO-E) 2 (8 -6.825) 2 (28-40.772) 2 (11 -6.825) 2 _ „ 

V, — — = + + ••• + = 15.24 

p x E y 6.825 40.772 6.825 

5. We have used the sample mean and standard deviation to estimate their popula- 
tion counterparts, so the value of df is k — 1 — m, or 6 — 1— 2 = 3. The critical 
value of chi-square for the 0.01 level of significance and df = 3 is 11.345. 

6. The calculated chi-square (15.24) exceeds the critical value (11.345) and, at 
the 0.01 level of significance, we reject the null hypothesis that these data were 
drawn from a normal distribution. At this level of significance, we conclude 
that the disparity between the expected and observed frequencies is simply too 
great to have happened by chance. The exact p-value for the test is shown in 
cell B17as 0.0016. 
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Chi-Square Goodness-of-Fit Test for Normality 



EXCEL 





A B 


C 


D 


1 


Chi-Squared Test of Normality 






2 




SAT 






3 


Mean 


945.04 






4 


Standard deviation 


142.61 






5 


Observations 


300 






6 










7 


Intervals 


Probability 


Expected 


Observed 


8 


(z <= -2) 


0.0228 


6.825 


8 


9 


(-2<z<= -1) 


0.1359 


40.772 


28 


10 


(-1 < z <= 0) 


0.3413 


102.404 


127 


11 


(0<z<=1) 


0.3413 


102.404 


94 


12 


(1 < z <= 2) 


0.1359 


40.772 


32 


13 


(z>2) 


0.0228 


6.825 


11 


14 










15 


chi-squared Stat 


15.24 






16 


df 


3 






17 


p-value 


0.0016 






18 


chi-squared Critical 


1 1 .345 







Excel Chi-Square Goodness-of-Fit Test for Normality (requires n > 32) 

1 . For example, testing the data in file CX13SAT.XLS, which contains 300 Scholastic Aptitude Test (SAT) scores with label 
and data values in A1 :A301 : Click Tools. Click Data Analysis Plus. Click Chi-Squared Test of Normality. Click OK. 

2. Enter A1:A301 into the Input Range box. Click Labels. Enter 0.01 into the Alpha box. Click OK. The observed and 
expected frequencies, the chi-square statistic, and the p-value will be as shown above. The p-value is quite small, and 
we can conclude that the sample data did not come from a normally distributed population. 

(Minitab does not provide the chi-square good ness-of -fit test for normality. Other Minitab tests for normality are presented 
in Chapters 14 and 15.) 
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13.10 In carrying out a chi-square goodness-of-fit test, 
what are the "k" and "m" terms in the "df= k — 1 — m" 
expression and why is each term present? 

13.11 If a table of expected frequencies differs very little 
from the frequencies that were observed, would the calcu- 
lated chi-square be large or small? Why? 

13.12 For the null hypothesis, H Q : "The data were drawn 
from a Poisson distribution with A = 7.0," how many de- 
grees of freedom would be associated with the test if there 
are 8 cells in the table of frequencies to be analyzed? 

13.13 For the null hypothesis, H Q : "The data were drawn 
from a uniform continuous distribution," how many 
degrees of freedom would be associated with the test 

if there are 5 cells in the table of frequencies to be analyzed? 

13.14 Sample data have been collected, and the null 
hypothesis to be tested is, H Q : "The sample was drawn 
from a normal population." If the analysis is based on a 
categorization that includes 5 cells: 

a. How many degrees of freedom will be associated 
with the test? 

b. For a test at the 0.05 level, what is the critical value 
of chi-square? 

c. If the calculated value of the chi-square statistic is 
8.13, what conclusion would be reached regarding 
the null hypothesis? 

13.15 Sample data have been collected, and the null hy- 
pothesis to be tested is H 0 : "The sample was drawn from 
a normal population in which /jl = 130." If the analysis is 
based on a categorization that includes 8 cells: 

a. How many degrees of freedom will be associated with 
the test? 

b. For a test at the 0.05 level, what is the critical value 
of chi-square? 

c. If the calculated value of the chi-square statistic is 
11.25, what conclusion would be reached regarding 
the null hypothesis? 

13.16 According to the Bureau of the Census, 19.0% of 
the U.S. population lives in the Northeast, 22.9% in 
the Midwest, 35.6% in the South, and 22.5% in the 
West. In a random sample of 200 recent calls to a 
national 800-number hotline, 39 of the calls were from 
the Northeast, 55 from the Midwest, 60 from the South, 
and 46 from the West. At the 0.05 level, can we 
conclude that the geographical distribution of hotline 
callers could be the same as the U.S. population 
distribution? source: Bureau of the Census, Statistical Abstract 

of the United States 2006, p. 28. 

13.17 From the one-day work absences during the past 
year, the personnel director for a large firm has identified 



the day of the week for a random sample of 150 of the ab- 
sences. Given the following observed frequencies, and for 
a = 0.01, can the director conclude that one-day absences 
during the various days of the week are not equally likely? 



Monday Tuesday Wednesday Thursday Friday 



Absences 



42 



24 



27 39 150 



13.18 It has been reported that 10.3% of U.S. households 
do not own a vehicle, with 34.2% owning 1 vehicle, 
38.4% owning 2 vehicles, and 17.1% owning 3 or more 
vehicles. The data for a random sample of 100 households 
in a resort community are summarized in the frequency 
distribution below. At the 0.05 level of significance, can 
we reject the possibility that the vehicle-ownership 
distribution in this community differs from that of the 
nation as a whole? source: Bureau of the Census, Statistical 
Abstract of the United States 2006, p. 864. 



Number of 
Vehicles Owned 

0 

1 

2 

3 or more 



Number of 
Households 

20 
35 
23 
22 
100 



13.19 For the data provided in Exercise 13.18, use the 
0.05 level in testing whether the sample could have been 
drawn from a Poisson population with A = 1.5. 

13.20 Employees in a production department assemble 
the products, then submit them in batches of 25 for qual- 
ity inspection. The number of defectives in each batch 

of a random sample of 50 recently produced batches is 
shown below. At the 0.01 level, test whether the popula- 
tion of x = the number of defectives in a batch could be 
Poisson distributed with A = 2.0. 



x = Number 
of Defectives 



Number of 
Batches 



5 
15 
14 
10 

4 

50 



13.21 Given the information in Exercise 13.20, use the 
0.05 level in testing whether the number of defectives in 
each batch of n = 25 could be binomially distributed 
with tt = 0.1. 



Chapter 13: Chi-Square Applications 



477 



13.22 Approximately 13.3% of U.S. drivers are younger 
than age 25, with 39.2% in the 25-44 age group, and 
47.5% in the 45-and-over category. For a random sample 
of 200 fatal accidents in her state, a safety expert finds 
that 42 drivers were under 25 years old, 80 were 
25-44 years old, and 78 were at least 45 years old. 

At the 0.05 level, test whether the age distribution of 
drivers involved in fatal accidents within the state could 
be the same as the age distribution of all U.S. drivers. 
source: The World Almanac and Book of Facts 2006, p. 117. 

/ data set/ Note: Exercises 13.23-13.26 require a 
computer and statistical software. 

13.23 For a random sample of 200 U.S. motorists, the 
mileages driven last year are in data file XR13023. Use 
the 0.01 level of significance in determining whether 



the mileages driven by the population of U.S. motorists 
could be normally distributed. 

13.24 For the previous 300 plywood panels from a 
lumber company's production line, the data in file 
XR13024 list the number of surface defects per panel. 
Based on this information, use the 0.05 level of 
significance in examining whether the data could have 
come from a Poisson distribution. 

13.25 For Exercise 13.24, use the 0.05 level of 
significance in examining whether the data could have 
come from a normal distribution. 

13.26 The outstanding balances for 500 credit-card 
customers are listed in file XR13026. Using the 0.10 level of 
significance, examine whether the data could have come 
from a normal distribution. 
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Purpose 

When data involve two nominal-scale variables, with each variable having two or 
more categories, chi-square analysis can be used to test whether or not there 
might be some relationship between the variables. For example, a campaign man- 
ager might be interested in whether party affiliation could be related to educa- 
tional level, or an insurance underwriter may wish to determine whether different 
occupational groups tend to suffer different kinds of on-the-job injuries. The pur- 
pose of chi-square analysis in such applications is not to identify the exact nature 
of a relationship between nominal variables; the goal of this technique is simply 
to test whether or not the variables could be independent of each other. 

Procedure 

The starting point for the chi-square test of variable independence is the contin- 
gency table. Introduced in Chapter 2, this is a table where the rows represent cat- 
egories of one variable, the columns represent categories of another variable, and 
entries are the frequencies of occurrence for various row and column combina- 
tions. A contingency table has r rows and k columns, where r^2 and k^2. The 
null and alternative hypotheses are 

H Q ; The variables are independent of each other. 
H^. The variables are not independent of each other. 

As in the preceding section, there will be a table of observed frequencies and a 
table of expected frequencies, and the amount of disparity between these tables will 
be compared during the calculation of the x 1 test statistic. The observed frequencies 
will reflect a cross-classification for members of a single sample, and the table of 
expected frequencies will be constructed under the assumption that the null hypo- 
thesis is true. The test statistic, calculated as shown below, will be compared to a 
critical x 2 for a given level of significance using an appropriate value for df. 
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Chi-square test for the independence of two variables: 

number of rows in the 
contingency table 
number of columns in the 
contingency table 
observed frequency in 
row i, column /' 
expected frequency in 
row i, column /' 
(r-l)(k - 1) 

The rationale for df = (r — l)(k — 1) is that this quantity represents the num- 
ber of cell frequencies that are free to vary. Given that we know the sample size 
(the total number of observations in the table), we need only know the contents 
of (r — 1) rows and (k — 1) columns in order to completely fill in the cells within 
the r X k table. As in the preceding section, we should combine rows or columns 
whenever necessary, so that each £- ; value will be at least five. Again, this is be- 
cause we are using a continuous distribution in analyzing count data, which are 
discrete. 

The calculation of the E- values in the table of expected frequencies relies 
on (1) the assumption that H Q is true and the variables really are independent, and 
(2) the joint and marginal probability concepts presented in Chapter 5. 

Expected frequencies when testing for the independence of variables: 

The expected frequency in row i, column ; is 

E- = £>•(«,•) where p- = for all the observations, the 

proportion that are in column ; 
n i = the number of observations in row i 

In the above expression, p- is the marginal probability that a randomly 
selected sample member will be in column ;'. If the variables are really indepen- 
dent, this probability should be applicable regardless of which row is consid- 
ered, and the expected number of observations in the //' cell will be the product 
of pi times ».. 

/ * 







Examp) 





Test for Independence 



A traffic-safety researcher has observed 500 vehicles at a stop sign in a suburban 
neighborhood and recorded (1) the type of vehicle (car, sport utility vehicle [SUV], 
pickup truck) and (2) driver behavior at the stop sign (complete stop, near stop, 
"ran" the stop sign). Her results are summarized in part A of Table 13.2. At the 
0.05 level of significance, could there be some relationship between driver behavior 
and the type of vehicle being driven? The underlying data are in file CX13STOP. 



r k (0 ;; - £ ;/ ) 2 
Calculated x 2 = 2 2 p where r : 

i=l/=l E ij 

k ■ 

E v 
df 
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A. Observed Frequencies, 0 7 



Behavior at Stop Sign 





Stopped 


Coasted 


Ran It 


Type of 


Car 


183 


107 


60 


Vehicle 


SUV 


54 


27 


19 




Pickup 


14 


20 


16 



251 



154 



95 



350 
100 
50 

500 



TABLE 13.2 



Observed and expected 
frequencies in testing the 
independence of two 
nominal-scale variables 



B. Expected Frequencies, F- 



Type of Car 
Vehicle SUV 

Pickup 



Behavior at Stop Sign 



Stopped 


Coasted 


Ran It 




175.7 


107.8 


66.5 


350.0 


50.2 


30.8 


19.0 


100.0 


25.1 


15.4 


9.5 


50.0 


251.0 


154.0 


95.0 


500.0 



251 

E ™ = 500 X 50 ' ° r251 



SOLUTION 

As part A of Table 13.2 shows, the table of observed frequencies reflects inter- 
sections of the various categories of the two variables. For example, 183 persons 
driving cars were seen to come to a complete stop at the sign. The contingency 
table has r = 3 rows and k = 3 columns. Let's proceed through the steps in the 
hypothesis-testing procedure. 

Formulate the Null Hypothesis 

The null hypothesis is H Q : driver behavior and type of vehicle are independent. 
The alternative hypothesis is that a relationship exists between the variables. 

Specify the Level of Significance (a) for the Test 

The a = 0.05 level has been specified. 

Construct the Table of Observed Frequencies, 0 Vj 

Part A of Table 13.2 shows the observed frequencies recorded by the researcher. 

Assuming the Null Hypothesis to Be True, Construct the Table of Expected 
Frequencies, B v 

Each value is calculated by the formula presented earlier. In the entire sample 
of 500 observations, 251 of the 500 were in column 1 (those who stopped at the 
sign). Under the assumption that the variables are independent, we would expect 
that the proportion of the persons in each row (vehicle type) who stopped at the 
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sign would also be 251/500. For rows 1 through 3, the expected frequencies in 
column 1 can be calculated as 

• Row i = 1, column ;' = 1: 

251 

E a = p i {n i ] = W 350) and £ n = 175,7 



Row i = 2, column / = 1: 



= = |^(100) and E 21 = 50.2 
• Row i = 3, column / = 1: 

251 

= p,(n,) = — (50) and E 31 = 25.1 

The expected frequencies within columns 2 and 3 are calculated using the 
same approach. For example, the expected frequency for the "car — ran it" cell 
(1st row, 3rd column) is (95/500) X 350, or 66.5. If the variables were really 
independent, and 95/500 = 19.0% of all drivers "ran" the stop sign, it would 
be expected that 19.0% of those driving each type of vehicle would have run 
through the stop sign. 

Determine the Calculated Value of the \ 2 Test Statistic 

The calculated chi-square is 12.431, computed as follows: 

r t (O i; - E tj ) 2 
Calculated x 1 = 22 p 

, = i, = i h ,j 

(183.0 - 175. 7) 2 + (107.0 - 107.8) 2 + (60.0 - 66.5) 2 



175.7 107.8 66.5 

(54.0 - 50.2) 2 (27.0 - 30.8) 2 (19.0 - 19.0) 2 
+ 50.2 + 30.8 + 19.0 

(14.0 - 25.1) 2 (20.0 - 15.4) 2 (16.0 - 9.5) 2 
25.1 15.4 9.5 

12.431 



Identify the Critical Value of the Chi-Square Statistic 

The table includes r = 3 rows and k = 3 columns, so the number of degrees of 
freedom for the test is df = (r - l)(k - 1), or (3 - 1)(3 - 1) = 4. For the 0.05 
level of significance and df = 4, the critical value of chi-square is 9.488. 

Compare the Calculated and Critical Values of Chi-Square 

The calculated chi-square (12.431) exceeds the critical value (9.488), so the null 
hypothesis can be rejected at the 0.05 level. Had the test been done at the 0.025 
level (critical chi-square = 11.143), it would have been rejected at that level as 
well. It would not have been rejected at the 0.01 level (critical chi-square = 
13.277). Based on the chi-square table, the most accurate statement we can make 
about the exact p-value for the test is that it is somewhere between 0.025 and 
0.01. From this analysis it appears that driving behavior at the stop sign is not in- 
dependent of the type of vehicle being driven. 
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Computer Solutions 13.3 shows how we can use Excel and Minitab to 
carry out a test for the independence of variables like the one in this section. The 
analysis can be based on either summary information or raw data. 



CG 


imputer solutions 1 


3.3 







Chi-Square Test for Independence of Variables 



EXCEL 





A 


B 


C 


D 


E 


1 


Contingency Table 








2 




Column 1 


Column 2 


Column 3 


TOTAL 


3 


Row 1 


183 


107 


60 


350 


4 


Row 2 


54 


27 


19 


100 


5 


Row 3 


14 


20 


16 


50 


6 


TOTAL 


251 


154 


95 


500 


7 












8 


chi-squared Stat 




12.431 




9 


df 






4 




10 


p-value 






0.014 




11 


chi-squared Critical 




9.488 





Excel Chi-Square Test for Independence of Variables, from Summary Information 

1 . Enter the observed frequencies into adjacent columns. For example, the 9 observed frequencies in Table 1 3.2 would be 
in cells A1 :C3. Click Tools. Click Data Analysis Plus. Click Contingency Table. Click OK. 

2. Enter A1:C3 into the Input Range box. Do not select Labels. Enter 0.05 into the Alpha box. Click OK. The observed 
frequencies, along with the chi-square statistic and the p-value, will be as shown above. 

Note: As an alternative, you can use Excel worksheet template TMCHIVAR.XLS, provided with the text. The procedure is 
described within the template. 

Excel Chi-Square Test for Independence of Variables, from Raw Data 

1 . The underlying data for Table 13.2 are in file CX13STOP.XLS, with vehicle codes in column A (1 = car, 2 = SUV, and 3 = 
pickup) and behavior codes in column B (1 = stopped, 2 = coasted, and 3 = ran it). The labels are in cells A1 and B1 . 

2. Click Tools. Click Data Analysis Plus. Click Contingency Table (Raw Data). Click OK. 

3. Enter A1:B501 into the Input Range box. Click Labels. Enter 0.05 into the Alpha box. Click OK. The observed 
frequencies, along with the chi-square statistic and the p-value, are provided. 



MINITAB 

Chi-Square Test: CI, C2, C3 

Expected counts are printed below observed counts 

Chi-Square contributions are printed below expected counts 

CI C2 C3 Total 

1 183 107 60 350 

175.70 107.80 66.50 
0.303 0.006 0.635 



(continued) 
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54 27 19 100 

50.20 30.80 19.00 
0.288 0.469 0.000 



3 14 20 16 50 

25.10 15.40 9.50 

4.909 1.374 4.447 

Total 251 154 95 500 

Chi-Sq = 12.431, DF = 4, P-Value = 0.014 



Minitab Chi-Square Test for Independence of Variables, from Summary Information 

1. Enter the 9 observed frequencies into adjacent columns, in the same layout as in Table 13.2 — e.g., the C1 column 
will consist of 183, 54, and 14. Click Stat. Select Tables. Click Chi-Square Test (Table in Worksheet). 

2. Enter C1-C3 into the Columns containing the table box. Click OK. The observed frequencies, expected frequencies, 
chi-square statistic, and p-value will be as shown above. 



Minitab Chi-Square Test for Independence of Variables, from Raw Data 

1 . The underlying data for Table 13.2 are in file CX13STOP.MTW, with vehicle codes in column C1 (1 = car, 2 = SUV, and 
3 = pickup) and behavior codes in column C2 (1 = stopped, 2 = coasted, and 3 = ran it). 

2. Click Stat. Select Tables. Click Cross Tabulation and Chi-Square. 

3. In the Categorical variables menu, enter C1 into the For rows box and C2 into the For columns box. In the Display 
menu, select Counts. 

4. Click Chi-Square. In the Display menu, select Chi-Square analysis and Expected cell counts. Click OK. Click OK. 





13.27 In conducting a chi-square test, why is it 
advisable that each expected frequency be at least 5.0? 
If the expected frequency in a cell happens to be less 
than 5.0, what should be done in order to carry out the 
analysis? 

13.28 In carrying out a chi-square test for the independence 
of variables, what is the procedure for determining the 
number of degrees of freedom to be used in the test? 

13.29 For a contingency table with r rows and k 
columns, determine the df for the test if 

a. r = 3, k = 4 b. r = 2, k = 3 

c. r = 4, k = 5 d. r = 5,k = 3 

e. r = 3, k = 7 f. r = 3, k = 3 

13.30 In testing the independence of two variables 
described in a contingency table, determine the critical 
value of chi-square if the test is to be conducted at the 

a. a = 0.05 level and df = 3 

b. a = 0.01 level and df = 5 



c. a = 0.10 level and df = 2 

d. a = 0.025 level and df=4 

13.31 In testing the independence of two variables 
described in a contingency table, determine the critical 
value of chi-square if the test is to be conducted at the 

a. a = 0.025 level and df = 5 

b. a = 0.05 level and df = 8 

c. a = 0.01 level and df = 6 

d. a = 0.10 level and df = 4 

13.32 In a test of the independence of two variables, one 
of the variables has two possible categories and the other 
has three possible categories. What will be the critical 
value of chi-square if the test is to be carried out at the 
0.025 level? At the 0.05 level? 

13.33 A researcher has observed 100 shoppers from three 
different age groups entering a large discount store and 
noted the nature of the greeting received by the shopper. 
Given the results shown here, and using the 0.025 level 



Chapter 13: Chi-Square Applications 



483 



of significance, can we conclude that the age category of 
the shopper is independent of the nature of the greeting 
he or she receives upon entering the store? Based on the 
chi-square table, what is the most accurate statement that 
can be made about the p-value for the test? 



Shopper Age Category (years) 





21 or 




51 or 






less 


22-50 


more 




Greeting Cool 


16 


12 


5 


33 


Friendly 


8 


20 


6 


34 


Hearty 


6 


14 


13 


33 




30 


46 


24 


100 



13.34 A research organization has collected the following 
data on household size and telephone ownership for 200 
U.S. households. At the 0.05 level, are the two variables in- 
dependent? Based on the chi-square table, what is the most 
accurate statement that can be made about the p-value for 
the test? 



Telephones Owned 





<1 


2 


>3 




Persons ==2 


49 


18 


13 


80 


in the 3-4 


40 


27 


21 


88 


Household >5 


1 1 


13 


8 


32 




100 


58 


42 


200 



13.35 Researchers in a California community have asked 
a sample of 175 automobile owners to select their favorite 
from three popular automotive magazines. Of the 111 
import owners in the sample, 54 selected Car and Driver, 
25 selected Motor Trend, and 32 selected Road & Track. 
Of the 64 domestic-make owners in the sample, 19 selected 
Car and Driver, 22 selected Motor Trend, and 23 selected 
Road & Track. At the 0.05 level, is import/domestic own- 
ership independent of magazine preference? Based on the 
chi-square table, what is the most accurate statement that 
can be made about the p-value for the test? 

13.36 A pharmaceutical firm, studying the selection of 
"name brand" versus "generic equivalent" on 
prescription forms, has been given a sample of 150 recent 
prescriptions submitted to a local pharmacy. Of the 44 
under-40 patients in the sample, 16 submitted a prescrip- 
tion form with the "generic equivalent" box checked. 

Of the 52 patients in the 40-60 age group, 28 submitted 
a prescription form specifying "generic equivalent," 
and for the 54 patients in the 61-or-over age group, 
32 submitted a prescription form specifying "generic 
equivalent." At the 0.025 level, is age group independent 
of name-brand/generic specification? Based on the 
chi-square table, what is the most accurate statement 
that can be made about the p-value for the test? 



13.37 Customers of the Sky Mountain Grocery chain are 
routinely asked at the checkout whether they prefer paper 
or plastic bags for their purchases. In a recent study, 
researchers observed the type of bag specified and surveyed 
the customer for other information, including his or her 
level of education. For the 175 persons in the sample, bag 
selection and education levels were as shown below. At the 
0.01 level, is bag selection independent of education level? 
Based on the chi-square table, what is the most accurate 
statement that can be made about the p- value for the test? 

Education Level 



High Some College Graduate 
School College Grad Study 



Bag Paper 


14 


13 


34 


2 


63 


Selection Plastic 


17 


19 


19 


3 


58 


No Preference 


8 


28 


13 


5 


54 




39 


60 


66 


10 


175 



13.38 Upon leaving an assembly area, production items 
are examined and some of them are found to be in need of 
either further work or total scrapping. Tags on a sample 
of 150 items that failed final inspection show both the 
recommended action and the identity of the inspector 
who examined the item. The summary information for 
the sample is shown below. At the 0.10 level, is the recom- 
mended action independent of the inspector? Based on the 
chi-square table, what is the most accurate statement that 
can be made about the p-value for the test? 



Inspector 







A 


B 


C 


Recommended 


Major Rework 


20 


14 


13 


Action 


Minor Rework 


18 


16 


23 




Scrap 


16 


21 


9 






54 


51 


45 



150 



13.39 Collecting data for eight games, a basketball coach 
has compiled the following contingency table for the 
quarter of the game versus the result of "one-and-one" 
free-throw attempts in which a second shot can be 
attempted if the first one is made. At the 0.025 level, can 
he conclude that the quarter of the game is independent 
of the result of the free-throw opportunity? 



Quarter 





1 


2 


3 


4 


Points Scored 0 


6 


12 


10 


17 


1 


21 


17 


19 


25 


2 


26 


24 


1 1 


15 




53 


53 


40 


57 



/ data Set / Note: Exercises 13.40 and 13.41 require 
a computer and statistical software. 

13.40 The manager of a television station employs three 
different weather reporters (coded as 1-3) and has 
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surveyed viewers to find out whether education level 
(coded as 1-4) might be related to the weather reporter 
the viewer prefers. For the results listed in data file 
XR13040, use the 0.05 level of significance in concluding 
whether weather reporter preference and education level 
might be related. 

13.41 For a random sample of returns audited by the IRS, 
the data in file XR13041 describe the results according 
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to the income category of the audited party (coded as 
1 = low, 2 = medium, 3 = moderate, and 4 = high) 
and the type of IRS personnel who examined the 
return (coded as 1 = revenue agent, 2 = tax auditor, 
and 3 = service center). At the 0.05 level, are we able 
to conclude that the category of return is independent 
of the type of IRS examiner to which a return was 
assigned? 



13.5 COMPARING PROPORTIONS FROM k 

INDEPENDENT SAMPLES 



Purpose 

In Chapter 11, we presented a test for comparing two sample proportions to 
examine whether or not the population proportions could be the same. By using 
chi-square analysis, we can compare two or more sample proportions at the same 
time. For example, a politician might like to determine whether persons in differ- 
ent age groups are equally likely to have voted in the most recent election, or a 
production manager could be interested in whether the percentage of defective 
products differs significantly among workers in a given department. 

Procedure 

For all practical purposes, the test for equality of proportions, the procedure in this 
section, is really just a special case for the independence of two variables. The only 
differences are that here (1) each column consists of observations for an indepen- 
dent sample, while in the previous section the entire table represented just one 
sample; and (2) each table (observed and expected) will always have r = 2 rows. 
The O » and E- tables, the calculated and the determination of df are all han- 
dled the same way as in Section 13.4. The null and alternative hypotheses are 

H Q : 7r 1 = 7r 2 = 7r 3 = ••• = Tr k , for the / = 1 through k populations. 
H,: At least one of the tt- values differs from the others. 



Example .% 



Comparing Proportions 

According to the Bureau of the Census, 17% of Americans moved to a different 
house during the 1980-1981 period. This compares to 16% during 1990-1991 
and 14% during 2000-2001. 1 For our example, we will assume that 1000 persons 



Bureau of the Census, Statistical Abstract of the United States 2006, p. 36. 
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were surveyed during the two earlier studies and that 2000 persons were surveyed 
in the most recent one. Using the 0.05 level of significance, is it possible that the 
percentage of Americans who moved in each of these three periods was really the 
same, but the three surveys showed differences only because of sampling error? 

SOLUTION 

As mentioned previously, the appropriate procedure is just a special case of the 
contingency table method in the preceding section, and we proceed in the same 
manner. Although the test can be performed by tables and pocket calculator, we 
will use the procedure described in Computer Solutions 13.4, and we will discuss 
it in terms of the printout shown there. The null hypothesis is H 0 : it 1 = tt 7 = tt 3 , 
that the population proportions are equal. Whether the test is carried out by hand 
calculations or using the computer, the key components are shown below and in 
the Excel printout in Computer Solutions 13.4: 

1. The column totals are 1000, 1000, and 2000, respectively, as shown in cells 
B5:D5. 

2. For each column, we obtain the number of "movers" by multiplying the col- 
umn total by the percentage of people who moved that year. These frequen- 
cies are shown in row 3 of the printout. 

3. For each column, we obtain the number of "didn't move" observations by 
subtracting the number of "movers" from the column total. 

4. Overall, 15.25% of the respondents had moved. This is obtained by 
(170 + 160 + 280)/4000 = 0.1525, and 0.1525 is our estimate of the com- 
mon proportion for all three groups that is expressed in the null hypothesis. 
Based on what we've done so far, we can construct the following tables for the 
observed and expected frequencies: 





1980-1981 


1990-1991 


2000-2001 




Observed Frequencies 










Moved 


170 


160 


280 


610 


Didn't move 


830 


840 


1720 


3390 


Total 


1000 


1000 


2000 


4000 


Expected Frequencies 










Moved 


152.5 


152.5 


305.0 


610 


Didn't move 


847.5 


847.5 


1695.0 


3390 


Total 


1000 


1000 


2000 


4000 



5. Using the same formula as for the contingency table analyses in the previous 
section, we obtain the calculated value of the chi-square statistic: 

r k (0 ;/ - - E lt ) 2 _ (i7Q _ 152.5) 2 (160 - 152.5) 2 (1720 - 1695) 2 

•ti^ E t/ ~ 152.5 + 152.5 + "' + 1695 

= 5.223 

6. The calculated chi-square (5.223) does not exceed the critical value (5.992 for 
the 0.05 level and df = 2) and, at the 0.05 level of significance, we are not able 
to reject the null hypothesis that the proportions of Americans who moved dur- 
ing these time periods were the same. As shown in Computer Solutions 13.4, 
the exact p- value for the test is 0.073. 
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computer solutions 13.4 



Chi-Square Test Comparing Proportions from Independent Samples 
EXCEL 



7,1 





A 


B 


C 


D 


E 


1 


Contingency Table 








2 




1980-81 


1990-91 


2000-01 


TOTAL 


3 


moved 


170 


160 


280 


610 


4 


didn't 


830 


840 


1720 


3390 


5 


TOTAL 


1000 


1000 


2000 


4000 


6 












7 


chi-squared Stat 




5.223 




8 


df 






2 




9 


p-value 






0.073 




10 


chi-squared Critical 




5.992 





Whether using Excel or Minitab, this procedure is the same as that for testing the independence of two variables, de- 
scribed in Computer Solutions 13.3, except that at least one of the two variables will have just two categories. Using Excel 
and Data Analysis Plus, the results are as shown above. 

Note: As an alternative, you can use Excel worksheet template TMCHlPRO.XLS, provided with the text. The procedure is 
described within the template. 




13.42 For the following data obtained from three 
independent samples, use the 0.05 level in testing 

H Q : T7j = T7 2 = tt 3 versus Hj: "At least one population 
proportion differs from the others." 

n x = 100, p 1 = 0.20 
n 2 = 120, p 2 = 0.25 
« 3 = 200, p 3 = 0.18 

13.43 For the following data obtained from four 
independent samples, use the 0.025 level in testing 

H Q : ir x = 7r 2 = tt 3 = tt 4 versus H t : At least one population 
proportion differs from the others. 



Wj = 


150, 


Pi = 


0.30 


"2 = 


80, 


Pi = 


0.20 


»3 = 


140, 


Pi = 


0.25 


« 4 = 


50, 


Pa = 


0.26 



13.44 For three independent samples, each with 
n = 100, the respective sample proportions are 0.30, 
0.35, and 0.25. Use the 0.05 level in testing whether the 
three population proportions could be the same. 



13.45 An investment firm survey included the finding 
that 52% of 150 clients describing themselves as "very 
aggressive" investors said they were optimistic about the 
near-term future of the stock market, compared to 46% 
of 100 describing themselves as "moderate" and 38% of 
100 describing themselves as "conservative." Use the 
0.01 level in testing whether the three population propor- 
tions could be the same. 

13.46 A Scenic America study of billboards found that 
70% of the billboards in a sample observed in Baltimore 
advertised alcohol or tobacco products, compared to 
50% in Detroit and 54% in St. Louis. Using the 0.05 
level of significance, and assuming that a random sample 
of 200 billboards was observed in each city, could the 
population proportions be equal for billboards advertis- 
ing alcohol or tobacco products in the three cities? 
source: Elys McLean-Ibrahim, "Targeting the Message," USA Today, 
January 11, 1990, p. 1A. 

13.47 It has been reported that 18.3% of all U.S. 
households were heated by electricity in 1980, compared 
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to 26.5% in 1993 and 30.7% in 2001. At the 0.05 level, 
and assuming a sample size of 1000 U.S. households for 
each year, test whether the population percentages could 
be equal for these years, source: Statistical Abstract of the 
United States 2002, p. 604; Statistical Abstract of the United States 1996, 
p. 722. 

13.48 In analyzing the consumption of cottage cheese by 
members of various occupational groups, the United 
Dairy Industry Association found that 326 of 837 profes- 
sionals seldom or never ate cottage cheese, versus 220 of 
489 white-collar workers and 522 of 1243 blue-collar 
workers. Assuming independent samples, use the 0.05 
level in testing the null hypothesis that the population 
proportions could be the same for the three occupational 
groups, source: United Dairy Industry Association, Cottage Cheese 
Attitudes & Usage, p. 25. 

/ data Set / Note: Exercises 13.49 and 13.50 require 
a computer and statistical software. 



13.49 The movie complex at a shopping mall shows 
three movies simultaneously on the same evening. On a 
recent Friday evening, each movie drew a capacity crowd. 
A sample of the evening's movie patrons have been coded 
according to whether they purchased snacks ( 1 = pur- 
chased snacks, 2 = did not purchase snacks) and which 
movie they attended (1 = the "G" movie, 2 = the "PG" 
movie, and 3 = the "R" movie). The data are in file 
XR13049. At the 0.025 level, could the percentage buying 
snacks be the same for all three movies? 

13.50 An experiment has been conducted to compare the 
ease of use of several pocket calculators, with subjects ran- 
domly provided with one of four calculator designs. The 
subjects have been coded according to which one of the 
four calculators they tried (codes = 1-4) and whether they 
thought the calculator was "easy to use" (1 = easy to use, 
2 = not easy to use). The data are in file XR13050. At the 
0.01 level of significance, test whether the calculators could 
be equally easy to use. 



ESTIMATION AND TESTS REGARDING 13.6 
THE POPULATION VARIANCE 



Besides enabling us to carry out the tests presented in earlier sections of the chap- 
ter, the chi-square distribution is the basis for both estimation and hypothesis test- 
ing regarding the population variance. Like the population mean, the population 
variance will typically be unknown; it must be estimated by a sample statistic, and 
there will be uncertainty with regard to its actual value. In this section, we will 
quantify this uncertainty by using sample information and the chi-square distri- 
bution in (1) constructing a confidence interval for a 2 and (2) conducting hypoth- 
esis tests regarding its value. 

The Confidence Interval for a Population Variance 

The sample variance (s 2 ) is a point estimate of the population variance, and as 
such it is our single best estimate as to the value of a 2 . Just as it can be desirable 
to have an interval estimate for /a, it can also be useful to have an interval estimate 
for a 2 . For example, a production machine may be readily adjustable with regard 
to the mean of its output (e.g., a diameter or weight), but due to mechanical wear 
on bearings and other components, the amount of variability in its output may 
both increase and accelerate as the "wearing-out" process persists. At some point, 
a decision may have to be made regarding either the replacement or the overhaul 
of the machine, and such a decision could rely heavily on the variability that is 
present in a sample of output. 

The chi-square distribution was described earlier as the sampling distribution 
for x 2 = (« — lJsVcr 2 when random samples of size n are repeatedly drawn from 
a normal population. Of particular importance is the fact that we can transform 
the preceding equation and have a 2 = (n — l)s 2 /x 2 - This will be the statistic of 
interest in our determination of the confidence interval for a 2 . An important 
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assumption for both the construction of confidence intervals and the carrying out 
of hypothesis tests for a 2 is that the population must be normally distributed. 

While s 2 is the point estimate of a 2 , and a quantity from which we proceed to 
determine the lower and upper confidence limits, the confidence interval is not 
s 2 ± a fixed quantity. This is because, unlike the t and z distributions, the chi- 
square distribution is asymmetric. As Figure 13.1 showed, it is positively skewed. 
Thus, the description of the confidence interval for a 2 requires separate calcula- 
tions for its upper and lower limits: 



Confidence interval limits for the variance of a normal population: 




There is 100(1 — a)% confidence that 


o 2 is within the interval 




(n - l)s 2 , (n - l)s 2 
2 - <r ~ 2 


where s 2 = sample variance 




xh xi 


n = sample size 






Xij = chi-square value 


where 




upper-tail area = 


= a/2 




x£ = chi-square value 


where 




lower-tail area = 


a/2 




df = n - 1 





For the 100(1 — a)% confidence interval and df = n — 1, x\\ is tne chi-square 
value for which the area in the lower, or left, tail is a/2, while X\j is the chi-square 
value for which the area in the upper, or right, tail is a/2. Notice that xl is within 
the expression for the upper confidence limit, and Xjj is within that for the lower 
confidence limit. Because they are in the denominators of their respective expres- 
sions, their roles are the reverse of what intuition might suggest. 

For example, if df = 5 and we are constructing a 90% confidence interval, the 
lower confidence limit for a 2 would have X\) = 11.070 in its denominator, and the 
upper confidence limit for a 2 would have xl = 1-145 in its denominator. Shown 
in Figure 13.4, along with their respective tail areas, these values would be used 
in determining the confidence limits for a 2 . 



FIGURE 13.4 



Chi-square values used in 
determining the 90% 
confidence limits for a 2 
when df = 5. Because they 
appear in the denominators 
of their respective 
confidence limit 
expressions, xh is 
associated with the lower 
confidence limit for a 2 , 
and x[ ' s associated with 
the upper confidence limit 
for a 2 . 
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Example 




Confidence Interval for a Population Variance 

A total quality management (TQM) supervisor is interested in examining vari- 
ability in the amount of product being inserted by a filling machine. From a sam- 
ple of n = 15 packages, the average weight of the contents was 412.72 grams, 
with a standard deviation of 5.30 grams. The underlying data are in file CX13FILL. 
If the filling weights are from a normally distributed population, what is the 98% 
confidence interval for the population variance? 

SOLUTION 

From this sample, the point estimate of tr 2 is s 2 , or 5.30 2 = 28.09. However, since 
the chi-square distribution is not symmetrical, s 2 = 28.09 will not be at the mid- 
point of the confidence interval. For the 98% confidence interval, a = 0.02, and 
the area in each tail beneath the chi-square curve will be a/2, or 0.01. The next 
step is to determine the value of xl tnat corresponds to a lower-tail area of 0.01 
and the value of xl that corresponds to an upper-tail area of 0.01. These can be 
found by referring to chi-square values listed in Appendix A, or 

• For a lower-tail area of 0.01 and df = 14: 

xl = 4.660 

• For an upper-tail area of 0.01 and df = 14: 

Xl = 29.141 

The 98% confidence interval for a 2 can now be found through separate cal- 
culations for its two limits: 

• The lower 98% confidence limit for a 2 is 

(» - l)s 2 _ (15 - 1)(5.30) 2 
Xl ~ 29.141 

• The upper 98% confidence limit for tr 2 is 

(» ~ l)s 2 = (15 - 1)(5.30) 2 
y? ~ 4.660 



= 13.495 



= 84.391 



The TQM supervisor will have 98% confidence that the population variance 
is somewhere between 13.495 and 84.391. Since <r is the square root of a 1 , we 
can also say that he ha s 98% c onfide nce that the population standard deviation is 
somewhere between V13.495 and V84.391, or from 3.674 to 9.186 grams. 



Hypothesis Tests for the Population Variance 

As with the construction of a confidence interval for a 1 , hypothesis testing also re- 
quires that the population be normally distributed. As with previous hypothesis 
tests, we can carry out either nondirectional or directional tests for a 2 . Regardless 
of the directionality of the test, the test statistic will be the same. 
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Test statistic for tests regarding a population variance: 
(n - l)s 2 

X 2 = 2 where s 2 = sample variance 

a o n = sample size 

(Jq = a hypothesized value for a 2 
df=n-\ 

Although the test statistic is the same for any test, the null and alternative hy- 
potheses, the critical value(s), and the decision rule will depend on the type of test 
being conducted. The possibilities are summarized in Table 13.3, where the level 
of significance for each test has been identified as a. 



A. Two-tail test 




Null and alternative hypotheses: H 0 : a 2 = a 2 and HyO 2 ^ a 2 
Critical values: \\, corresponding to lower-tail area of a/2 
Xy, corresponding to upper-tail area of a/2 
Decision rule: Reject H Q if calculated x 2 is either < x[ or > Xu', 
otherwise, do not reject. 

B. Upper-tail test 




Null and alternative hypotheses: H Q : a 2 < a 2 and Hy o 2 > a- 2 

Critical value: Xu> corresponding to upper-tail area of a 

Decision rule: Reject H 0 if calculated x 2 > Xu', otherwise, do not reject. 

C. Lower-tail test 




xl 



Null and alternative hypotheses: H Q : a 2 > o- 2 and Hy a 2 < a 2 . 

Critical value: x 2 ' corresponding to lower-tail area of a 

Decision rule: Reject H Q if calculated x 2 < X 2 ', otherwise, do not reject. 
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Example 




Hypothesis Test for a Population Variance 

A computer manufacturer requires that springs purchased for installation beneath 
its keyboard keys have no more than a = 5 grams of variability in the amount of 
force required for key depression. From a shipment of several thousand springs, a 
random sample of 20 springs is tested, and the standard deviation is 7.78 grams. 
The underlying data are in file CX13KEYS. Assuming a normal distribution for the 
force required to compress the springs, use the 0.01 level of significance in exam- 
ining whether the shipment meets the manufacturer's specifications. 

SOLUTION 

The sample variance is 7.78 2 = 60.53, and the value of a 2 associated with the test 
is Oq = 5 2 = 25.00. The test is one-tail, since the computer manufacturer 
wishes to know whether the population variance exceeds the amount that is ac- 
ceptable. The null and alternative hypotheses are 

H Q : o- 2 < 25.00 and H x : a 2 > 25.00 

The test statistic is x 2 - For cr 1 = 25.00, s 2 = 60.53, and n = 20, the calculated 
value of the test statistic is 

Calculated , 2 = <^ = (2 ° = 46.00 

Oq 25.0 

As part B of Table 13.3 shows, this combination of null and alternative hypothe- 
ses calls for an upper-tail test. Referring to the df = 19 row of the chi-square table, 
we find that X\j = 36.191 is the chi-square value that corresponds to an upper-tail 
area of 0.01. The calculated value (46.00) falls into the "rejection" region shown in 
Figure 13.5. At the 0.01 level of significance, we reject H 0 and conclude that the ship- 
ment of springs does not meet the computer manufacturer's specifications. 



Do not 
reject H 0 



Reject H 0 



Area = 
0.01 



^2 =36.191 



FIGURE 13.5 



An upper-tail test for a 
population variance. The 
calculated value of the x 2 
test statistic allows us to 
reject the null hypothesis 
that the population 
variance is no more 
than 25.00. 



Test statistic: 
X 2 = 46.00 
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Computer Solutions 13.5 shows the procedure and results when we use 
Excel and Minitab to generate a confidence interval for a population variance. 
In Computer Solutions 13.6, we use Excel and Minitab to carry out a hypothe- 
sis test for a population variance. 





computer solu 


tioi 


is 13.5 





















Confidence Interval for a Population Variance 
EXCEL 





A 


B 


C 


D 


1 


Chi-Squared Estimate of a Variance 




2 










3 


Sample variance 


28.09 


Confidence Interval Estimate 




4 


Sample size 


15 


Lower confidence limit 


13.4950 


5 


Confidence level 


0.98 


Upper confidence limit 


84.3830 



Excel confidence interval for a population variance, based on summary information 

1. The data file for the filling machine example earlier in this section is CX13FILL.XLS. If we were not provided with 
summary information in this example, we would first use Excel to find the sample variance. In this file, the data 
values are in cells A2:A1 6. Just click on any empty cell and enter the formula =VAR(A2:A16). The variance is 5.30 2 , 
or 28.09. 

2. Open the ESTlMATORS.XLS workbook, supplied with the text. 

3. Using the arrows at the bottom left, select the Chi-Squared-Estimate_Variance. Enter the sample variance (28.09), 
the sample size (15), and the desired confidence level as a decimal fraction (0.98) into the appropriate cells. The upper 
and lower confidence limits for the population variance are provided as shown above. (These limits are slightly more 
accurate than our text example, in which we used table-based chi-square values in our calculations.) 

Excel confidence interval for a population variance, based on raw data 

1 . In workbook CX13FILL.XLS, click Tools. Click Data Analysis Plus. Select Chi-Squared Estimate: Variance. Click OK. 

2. Enter A1:A16 into the Input Range box. Click Labels. For a 98% confidence interval, enter 0.02 into the Alpha box. 
Click OK. 



MINITAB 

Test and CI for One Variance: Crams 

Chi -Square Method (Normal Distribution) 

Variable N Variance 98% CI 
Crams 15 28.1 (13.5, 84.4) 

Minitab confidence interval for a population variance, based on raw data 

1 . Open workbook CX13FILL.MTW. Click Stat, select Basic Statistics, and click 1 Variance. 

2. Select Samples in columns and enter C1 into the box. Click Options and enter 98 into the Confidence Level box. 
Click OK. Click OK. The 98% confidence interval will be as shown above. 

Minitab confidence interval for a population variance, based on summary data 

This will be identical to the procedure for raw data, but in step 2, select Summarized data, then enter 15 into the Sam- 
ple size box and 28.09 into the Sample variance box. 
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Hypothesis Test for a Population Variance 



EXCEL 





A B 


C 


D 


1 


Chi-squared Test of a Variance 






2 










3 


Sample variance 


60.53 


Chi-squared Stat 


46.003 


4 


Sample size 


20 


P(CHI<=chi) one-tail 


0.0005 


5 


Hypothesized variance 


25 


chi-squared Critical one-tail 


36.191 


6 


Alpha 


0.01 


P(CHI<=chi) two-tail 


0.001 


7 






chi-squared Critical two-tail 


6.844 


8 








38.582 



Excel hypothesis test for a population variance, based on summary information 

1 . The data file for the keyboard spring force example in this section is CX13KEYS.XLS. If we were not provided with 
summary information in this example, we would first use Excel to find the sample variance. In this file, the data val- 
ues are in cells A2:A21 . Just click on any empty cell and enter the formula =VAR(A2:A21). The variance is 7.78 2 , 
or 60.53. 

2. Open the TEST STATISTICS.XLS workbook, supplied with the text. 

3. Using the arrows at the bottom left, select the Chi-squared Test_Variance. Enter the sample variance (60.53), the 
sample size (20), the hypothesized value for the population variance (25.0) and the level of significance (0.01) for 
the test into the appropriate cells. 

Excel hypothesis test for a population variance, based on raw data 

1. In workbook CX13KEYS.XLS, click Tools. Click Data Analysis Plus. Select Chi-Squared Test: Variance. Click OK. 

2. Enter A1:A21 into the Input Range box. Click Labels. Enter 25.0 into the Hypothesized Variance box and 0.01 into 
the Alpha box. Click OK. 



MINITAB 

Test and CI for One Variance 

Test of sigma squared = 25 vs > 25 

Chi-Square Method (Normal Distribution) 

95% 
Lower 

N Variance Bound Chi-Square P 
20 60.5 38.2 46.00 0.000 

Minitab hypothesis test for a population variance, based on summary information 

1 . Click Stat, select Basic Statistics, and click 1 Variance. 

2. Select Summarized data, then enter 20 into the Sample size box and 60.53 into the Sample variance box. Select 
Perform hypothesis test and enter 25 into the Hypothesized variance box. Click Options and select Greater than 

from the Alternative menu. Click OK. Click OK. The results of the hypothesis test will appear as shown above. 

Minitab hypothesis test for a population variance, based on raw data 

Open workbook CX13KEYS.MTW. The procedure will now be the same as that for summary data but, in step 2, select 
Samples in columns and enter C1 into the box. 
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13.51 In applying the chi-square statistic to estimation 
and tests for a population variance, why must the popula- 
tion be normally distributed? 

13.52 In previous chapters, confidence intervals have 
been expressed in terms of a sample statistic plus or 
minus a given expression — e.g., x ± t(sl\rn). However, 
the confidence interval for cr 2 cannot be expressed as s 2 
plus or minus a comparable term. Why is this not 
possible? 

13.53 A random sample of 30 observations has been 
drawn from a normal population, and the sample 
variance is found to be s 2 = 23.8. Determine the 95% 
confidence interval for a 2 . 

13.54 A random sample of 10 observations has been 
drawn from a normal population, and the sample 
variance is found to be s 2 = 19.5. Determine the 98% 
confidence interval for cr 2 . 

13.55 A random sample of 20 observations has been 
drawn from a normal population, and the sample 
variance is found to be s 2 = 4.53. Determine the 95% 
confidence interval for cr 2 . 

13.56 A pharmaceutical company has specified that 
the variation in the quantity of active ingredient in its 
leading prescription medication must be such that the 
population standard deviation must be no more than 
1.20 micrograms. The ingredient content of the pills is 
normally distributed. In a random sample of 20 pills, the 
standard deviation is found to be s = 1.32 micrograms. 
Based on the sample data, use the a = 0.05 level in 
testing H 0 : cr < 1.20 micrograms versus Hy cr > 1.20 
micrograms. 

13.57 During the early stages of production for a 
single-cylinder engine designed for use in portable gen- 
erators, the manufacturer randomly selects 12 engines 
from the end of the assembly line and subjects them to 
a battery of tests. In the horsepower tests, the engines 
are found to have an output standard deviation of 

s = 0.18 horsepower. Assuming a normal distribution 
of horsepower outputs, determine the 90% confidence 
interval for cr. 

13.58 An instrument maker has randomly selected 
and submitted 20 of its electronic blood-pressure 
measurement units to a medical laboratory for testing. 
At the lab, each unit measures the "blood pressure" of 
a testing device that simulates a human arm; within 
the device the pressure is a known constant. For the 



instruments submitted for testing, the standard 
deviation of the readings is 1.4 millimeters. Assuming 
a normal distribution of measurement values, 
determine the 95% confidence interval for the population 
standard deviation of instruments produced by this 
manufacturer. 

13.59 A random sample of n = 30 is drawn from a 
population that is normally distributed, and the sample 
variance is s 2 = 41.5. Use a = 0.05 in testing H 0 : 

cr 2 = 29.0 versus H v - cr 2 ^ 29.0. 

13.60 A random sample of n = 12 is drawn from a 
population that is normally distributed, and the sample 
variance is s 2 = 19.3. Use a = 0.025 in testing H Q : 

cr 2 < 9.4 versus H y - cr 2 > 9.4. 

13.61 According to industry specifications, the 
standard deviation of gasket thicknesses for a given 
application must be no more than 3 thousandths of 
an inch. In a random sample of 25 gaskets produced 
by one of the firms, the sample standard deviation is 
found to be 3.7 thousandths of an inch. Assuming the 
population of thicknesses is normally distributed, 
and using a = 0.025, examine whether the firm 
could be in violation of the industry specification. 
Based on the chi-square table, what is the most 
accurate statement that can be made about the p-value 
for the test? 



/ data Set / Note: Exercises 13.62 and 13.63 require 
a computer and statistical software. 

13.62 In past studies, a fast-food franchise has found 
the standard deviation of weights for quarter-pound 
hamburgers to be approximately 0.28 ounces after 
cooking. A consultant has suggested a new procedure 
for hamburger preparation and cooking that she believes 
will reduce the variability in the cooked weight of the 
hamburgers. In a test of the new method, 61 hamburgers 
are prepared, cooked, and weighed, with the resulting 
cooked weights listed in data file XR13062. At the 

0.05 level, and assuming a normal distribution for the 
weights, examine the possibility that the new procedure 
is actually no better than the old one. 

13.63 Given the information in Exercise 13.62 and data 
file XR13062, determine the 98% confidence interval for the 
population variance of the cooked weights for hamburgers 
prepared using the consultant's recommended cooking 
procedure. 
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• The chi-square distribution 

The chapter introduces a probability distribution called the chi-square distribu- 
tion. When samples of size n are drawn from a normal population, the chi-square 
distribution is the sampling distribution of X 1 = ( n ~ l)s 2 /°" 2 - The chi-square dis- 
tribution is also a family of distributions, with the specific member of the family 
depending on the number of degrees of freedom, and with the determination of df 
depending on the particular test or procedure being carried out. The chi-square 
distribution has extreme positive skewness for smaller values of df, but 
approaches the normal distribution as df becomes larger. 

• Applications and procedures for chi-square tests 

The chi-square distribution, along with sample data and frequency counts, is used 
to examine ( 1 ) whether a sample could have come from a given type of population 
distribution (e.g., the normal distribution), (2) whether two nominal variables 
could be independent of each other, and (3) whether two or more independent 
samples could have the same population proportion. In these applications, 
chi-square analysis involves the comparison of a table of observed frequencies 
(Ojj) with a table of expected frequencies (£■•) that has been constructed under the 
assumption that the null hypothesis is true. 

The calculated value of the chi-square statistic represents the extent to which 
these tables differ. If the calculated chi-square exceeds the critical value, H 0 is re- 
jected. In these tests, it is important that the total sample size and the categories 
be such that each £•■ value is at least 5. This is because the chi-square distribution 
is continuous and is being used to analyze discrete (counts) data. If one or more 
expected frequencies do not meet this minimum requirement, we should either in- 
crease the total sample size or combine adjacent rows or columns in the table. 

• Confidence intervals and testing for a population variance 

The chi-square distribution is also used in constructing confidence intervals and 
carrying out hypothesis tests regarding the value of a population variance. In 
these applications, an important assumption is that the population is normally 
distributed. 



Chi-Square Test for Goodness of Fit 



k 



(O, - E ? .) 



Calculated x 2 = 2 ~£ where k = number of categories, 

/=i / or cells in the table 

O ■ = observed frequency in cell /' 
E- = expected frequency in cell / 

In this test, df = k — 1 — m, with m = the number of population parameters (e.g., /a or a) 
that must be estimated from sample data in order to carry out the goodness-of-fit test. The 
test is upper-tail, and the critical value of x 2 will be that associated with the level of signif- 
icance (a) and degrees of freedom (df) for the test. Each £• must be at least 5. If not, either 
the total sample size should be increased, or cells with E, < 5 can be combined with adja- 
cent cells. 
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Chi-Square Test for the Independence of Two Variables 

number of rows 
in the contingency table 
number of columns in 
the contingency table 
observed frequency in 
row i, column / 
expected frequency in 
row i, column j 

The expected frequency in row i, column /' is 

£ (/ = pj{n t ) where p- = for all the observations, the 

proportion that are in column ;' 
rij = number of observations in row i 

In this test, df '= (r — l)(k — 1), the number of cell frequencies that are free to vary. Given 
that we know the sample size (the total number of observations in the table), we need 
only know the contents of (r — 1) rows and (k — 1) columns in order to completely fill 
in the cells within the r X k table. Each E» must be at least 5. If not, either the total sam- 
ple size should be increased, or adjacent rows or columns can be combined until each 
is > 5. 

Chi-Square Test for the Equality of Population Proportions 

This procedure is a special case of the test for independence above, but the columns repre- 
sent independent samples from the respective populations, and there will be just two rows 
in the contingency table. 

Confidence Interval Limits for the Variance of a Normal Population 

There is 100(1 — a)% confidence that a 2 is within the interval 

sample variance 
sample size 

chi-square value where 
upper-tail area = a/2 
chi-square value where 
lower-tail area = a/2 

In this test, df = n — 1. The population must be normally distributed. 

Test Statistic for Hypothesis Tests Regarding a Population Variance 

X 2 = where s 2 = sample variance 

a ° n = sample size 

o- 2 , = a hypothesized value for a 1 

Regardless of the test, df = n — \. The test statistic is the same for any test, but the null and 
alternative hypotheses, the critical value(s), and the decision rule will depend on the type 
of test being conducted. The possibilities are summarized in Table 13.3, and the population 
must be normally distributed. 



Calculated x 1 = 22 jt where r 



,= i,= i E ij 



k = 



°M = 



(n ~ l)s 2 „ , _ (n - \)s 2 

2 — °" — 2 where s = 

Xu Xl n = 

xh = 
xl = 
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13.64 The personnel director of a large firm has summa- 
rized a random sample of last year's absentee reports in 
the accompanying contingency table. At the 0.01 level of 
significance, test the independence of age versus length 
of absence. 

Number of Days Absent 



Age Group 





1 


2-4 


5-7 


>7 




<25 years 


30 


8 


3 


4 


45 


26-40 years 


15 


12 


5 


7 


39 


>40 years 


18 


22 


1 1 


10 


61 




63 


42 


19 


21 


145 



13.65 Frank Gerbel, a local insurance representative, has 
hired a part-time assistant to make "cold" calls on potential 
customers. The assistant has been given a quota of 20 such 
calls to make each day, and a call is deemed a success if the 
potential customer agrees to set up an appointment to dis- 
cuss his or her insurance needs with Mr. Gerbel. Over the 
past 50 days, the distribution of the number of successful 
calls made by the assistant is as shown below. Using the 
0.05 level of significance, test the goodness of fit between 
these data and a Poisson distribution with A = 1.4. 



Number of 
Successful Calls 



Number of 
Days 



0 


17 


1 


15 


2 


9 


3 


4 


4 


1 


5 


3 


6 


1 




50 



13.66 For the most recent year reported, the Federal 
Bureau of Investigation has listed the following data for 
robberies in the United States, source: Statistical Abstract of the 
United States 2006, p. 201. 



Street or highway 
Bank or commercial 

establishment 
Gas station or 

convenience store 
Residence 



179,000 robberies 

70,000 

37,000 
57,000 



The police chief in a medium-sized community has data 
indicating that the respective numbers of these robberies in 
his community during the same year were 23, 9, 7, and 11. 
At the 0.05 level, can the distribution of robberies in this 
community be considered as having differed significantly 
from that experienced by the United States as a whole? 



13.67 The following data are the number of persons 
who were waiting in line at a checkout counter at 100 
randomly selected times during the week. At the 0.05 
level, test whether the population of these values could 
be Poisson distributed. 



x = Number 
of Persons 



Number of 
Observations 



24 
24 
18 
20 
10 
4 
100 



13.68 A newspaper reporter, collecting data for a feature 
article on her state lottery system, has found the 200 digits 
most recently selected to be distributed as shown below. 
Based on this information, and using the 0.10 level of sig- 
nificance, can she conclude that the digits have not been 
drawn from the appropriate discrete uniform distribution? 



Digit 

Frequency 



0 


1 2 


3 4 


5 


6 


7 


8 9 


17 1 


8 24 


25 19 


16 


14 


23 


24 20 



200 



13.69 The American Council of Life Insurance lists the 
following breakdown of ordinary life insurance policies 
purchased in the United States during the most recent 
reporting year. If a random sample of the ordinary life 
insurance policies from a given state were to result in the 
table of observed frequencies shown in the rightmost col- 
umn, could we conclude that the amounts for ordinary 
life insurance policies sold within that state have the same 
distribution as policies on the national level? Use the 0.05 
level of significance in reaching a conclusion, source: 
American Council of Life Insurance, Life Insurance Fact Book 1999, p. 14. 



Size of 
Policy 



National Level, 
Percentage 
of Policies 



State Sample, 
Number 
of Policies 



under $10,000 

$10,000-$14,999 

$15,000-$24,999 

$25,000-$49,999 

$50,000-$99,000 

$100,000-$500,000 

over $500,000 



9 
7 
13 
19 
40 
3 



34 
39 
17 
38 
56 
105 
1 1 



100% 



300 



13.70 A national public television network has found 
that 35% of its contributions are for less than $20, with 
45% for $20-$50, and 20% for more than $50. In a 
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random sample of 200 of the contributions to a local 
station, 42% were for less than $20, 43% were for 
$20-$50, and 15% were over $50. At the 0.05 level, 
does the local station's distribution of contributions differ 
significantly from that experienced nationally? 

13.71 Twenty-five percent of the employees of a large 
firm have been with the firm less than 10 years, 40% 
have been with the firm for 10 to 20 years, and 35% have 
been with the firm for more than 20 years. Management 
claims to have randomly selected 20 employees to partici- 
pate in a drawing for a new car. Of the 20 employees 
chosen, 3 have fewer than 10 years with the firm, 5 have 
been there between 10 and 20 years, and 12 have been 
with the firm for over 20 years. At the 0.025 level of sig- 
nificance, evaluate management's claim of random selec- 
tion. Using the chi-square table in Appendix A, what is 
the most accurate statement that can be made about the 
exact p- value for the test? 

13.72 The following contingency table describes types of 
collisions versus driving environments for a random sample 
of two-vehicle accidents that occurred in a given region last 
year. At the 0.01 level of significance, can we conclude that 
the type of collision is independent of whether the accident 
took place in a rural versus an urban setting? 



Type of Collision 





Angle 


Rear-end 


Other 


Driving 


Urban 


40 


30 


72 


Environment 


Rural 


6 


12 


15 






46 


42 


87 



13.73 Of the students at a large university, 30% are fresh- 
men, 25% are sophomores, 25% are juniors, and 20% are 
seniors. The representation of the four classes in the 100- 
member marching band is found to be 24%, 28%, 19%, 
and 29%. At the 0.05 level of significance, does the com- 
position of the marching band differ significantly from the 
class distribution of the university? Using the chi-square 
table in Appendix A, what is the most accurate statement 
that can be made about the exact p-value for the test? 

13.74 Three different instructional techniques are being 
considered for training mechanics to perform a difficult 
emissions-reduction adjustment on fuel-injection engines. 
Each of 60 mechanics is randomly assigned to receive one of 
the training techniques. For the 20 trained by technique A, 
70% are able to perform the adjustment successfully on the 
first try. For mechanics trained by techniques B and C, the 
respective percentages are 50% and 65%. At the 0.05 level, 
can we conclude that the three techniques are equally 
effective? 

13.75 Taking air samples at random times during a 
2-month period, an environmental group finds the stan- 
dard deviation of ozone concentration measurements in 



the vicinity of a chemical plant to be 0.03 parts per million 
(ppm). The group believes the population of ozone 
measurements is normally distributed, and they have 
decided to use the 0.03 ppm figure as a baseline for the 
variability in the values. They will continue taking air sam- 
ples, with 12 samples taken at random times during each 
week. If they find that the standard deviation of observa- 
tions for any given week is abnormally higher or lower 
than 0.03 ppm, they will conclude that something unusual 
is happening at the plant. Given that the group is estimat- 
ing a as 0.03, and assuming that they are relying on a 98% 
confidence interval, how high or low would a weekly stan- 
dard deviation have to be for them to conclude that 
unusual events are taking place at the plant? 

13.76 A local bank has recently installed an automatic 
teller machine (ATM). At a regional meeting, an ATM 
specialist tells the bank manager that usage times will be 
normally distributed and the variability in the amount of 
time customers require to use the machine will start out 
relatively high, then eventually settle down to a standard 
deviation of 0.8 minutes. The manager, skeptical that the 
variability could be this great, has a study done in which 
a random sample of 30 persons is timed while using the 
machine. Analysis of the data shows a standard deviation 
of 0.61 minutes. At the 0.05 level, test whether the stan- 
dard deviation of usage times for this bank's customers 
might actually be no less than the amount predicted at 
the regional meeting. 

13.77 According to the Bureau of the Census, 39% of 
U.S. adults in the $20,001-$35,000 income group partic- 
ipated in adult education during the most recent report- 
ing year. Data for this and other groups are shown here. 

source: Statistical Abstract of the United States 2006, p. 189. 

Percentage Who Participated 
Income Group in Adult Education 

$20,001-$35,000 39% 
$35,001-$50,000 48% 
$50,001-$75,000 56% 

Assuming independent samples of 1000 for each of these 
income groups, use the 0.10 level of significance in testing 
the null hypothesis that tt 1 = tt 2 = tt-x. 

13.78 Over the past year, an audio store has sold stereo 
systems produced by three different manufacturers. Of 
the 400 units from company A, 12% were returned for 
service during the warranty period. This compares to 
15% of 300 units from company B, and 16% of 500 
from company C. Assuming that the store's sales are a 
representative sample of the stereo systems produced by 
the three manufacturers, use the 0.025 level of signifi- 
cance in testing whether the population percentages of 
warranty service returns could be equal. 

13.79 Union members in a large firm work in three depart- 
ments: assembly, maintenance, and shipping/receiving. 
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Twenty of a random sample of 30 assembly workers say 
they intend to ratify the new contract negotiated by 
union leaders and management. This compares to 30 
of 40 maintenance workers and 25 of 50 shipping/ 
receiving workers who say they will ratify the contract. 
At the 0.025 level of significance, test the null hypothesis 
that the population proportions are the same for all three 
departments. 

13.80 An aspirin manufacturer has designed the child- 
proof cap on its bottles so that it must be pressed down 
before it can be turned and removed. A consumer maga- 
zine has randomly purchased 15 bottles of the company's 
aspirin for analysis in a feature article on pain relievers 
and their packaging. Among the analyses is a test of how 
much force is required to depress the cap on each bottle. 
The standard deviation for the bottles made by this man- 
ufacturer is found to be 7.2 grams. From the magazine's 
findings, and assuming a normal distribution, determine 
the 95% confidence interval for the population standard 
deviation of the force required to depress the firm's 
aspirin bottle caps. 

13.81 A campus newspaper is examining the extent to 
which various segments of the university community favor 
the Student Government Association's recommendation 
that the school make the transition from Division II to 
Division I NCAA football. A random sample of 100 
students is chosen from each of the major colleges. 

Of persons surveyed from the College of Business, 
47% favor the move. Corresponding percentages for 
the College of Fine Arts, the College of Science and 
Technology, and the College of Humanities and Social 
Sciences are 62%, 45%, and 39%. At the 0.05 level, 
test whether the population percentages could be the 
same for students from these four colleges within the 
university. 

13.82 A casting machine is supposed to produce units 
for which the standard deviation of weights is 0.4 ounces. 
According to the maker of the machine, it should be shut 
down and thoroughly examined for wear if the standard 
deviation becomes greater than the 0.4 ounces that has 
been specified. In a random sample of 30 units recently 
produced by the machine, the standard deviation is 
found to be 0.82 ounces. Given this information, and 
assuming a normally distributed population of weights, 
would you feel very secure in advising management that 
the machine should be shut down? Use the approximate 
p-value of the test for assistance in formulating your 
answer. 

/ data Set / Note: Exercises 13.83-13.88 require a 
computer and statistical software. 



13.83 At the beginning of last year, a power company 
asked each of its industrial customers to estimate how 
much electricity they would require during the year. 

At the end of the year, the power company recorded the 
extent to which a customer's estimate was over or 
under the amount of electricity actually used. In data 
file XR13083, customers are described according to their 
size (1 = small, 2 = medium, 3 = large) and the extent 
to which their usage differed compared to their estimate 
(1 = used more than 110% of estimate, 2 = within 
10% of estimate, and 3 = used less than 90% of 
estimate). At the 0.025 level of significance, test whether 
customer size might be independent of estimation 
accuracy. 

13.84 The data in file XR13084 represent the Scholastic 
Aptitude Test (SAT) scores for a random sample of 

200 high school seniors who have taken the exam. Using 
the 0.025 level of significance, test whether the sample 
scores could have come from a population that is 
normally distributed. 

13.85 For a random sample of households in a county, 
file XR13085 lists the number of TV sets owned by each of 
the households in the study. Using the 0.025 level of sig- 
nificance, test whether x = the number of television sets 
per household could be Poisson distributed in the 
county's households. 

13.86 Researchers have coded a random sample of 
households according to household income (1 = under 
$30,000, 2 = $30,000-$49,999, 3 = $50,000-$69,999, 
and 4 = $70,000 or more) and number of vehicles owned 
(1 = 0, 2 = 1 or 2, and 3 = 3 or more). The data are in 
file XR13086. Using the 0.05 level of significance, test 
whether household income is independent of the number 
of vehicles owned. 

13.87 Safety researchers in a government agency believe 
that too much variability in the speeds of vehicles on 
urban sections of interstate highways can contribute to 
accidents by causing a greater level of interaction between 
vehicles traveling in the same direction. They believe that 
a standard deviation in excess of 5 mph is undesirable. 
Observing a random sample of vehicles on an urban por- 
tion of the interstate highway in their locale, they find the 
speeds to be as listed in file XR13087. At the 0.025 level, 
and assuming a normal distribution of vehicle speeds, 
could they be mistaken in their conclusion that too much 
variability exists in the speeds of vehicles passing this 
location? 

13.88 Given the information in Exercise 13.87 and data 
file XR13087, determine the 95% confidence interval for 
the population variance of the speeds of vehicles passing 
this location on the urban interstate. 
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Thorndike Sports Equipment 

Ted Thorndike is trying to close a sale with the Alvindale 
Chipmunks, a minor-league professional baseball team. 
Although his grandfather isn't very thrilled about any kind 
of deal with somebody with a name like "Chipmunks," 
Ted feels this could mark the beginning of some really 
high-level business with professional baseball teams, all 
the way up to the majors. 

The Chipmunks, who are relatively high tech for a 
minor-league team, have been using a Thorndike competi- 
tor's automatic pitching machine for batting practice and 
are looking for a replacement. The problem is that the 
machine is very erratic. Although it has not "beaned" any 
players yet, they are concerned about the amount of verti- 
cal variability it exhibits in its pitches. Tom Johnson, prin- 
cipal owner of the Chipmunks, did a study in which he set 
up a target at home plate, then measured the heights at 
which the balls hit the target. 

Mr. Johnson found the heights were normally dis- 
tributed, as he has discovered is the case for pitching ma- 
chines throughout the industry. However, he feels that a 
standard deviation of 3.5 inches in the heights of the 
pitches is just too "wild," even for the minor-league level. 
He has offered Ted Thorndike the opportunity to supply 
a replacement machine. The only catch to the sale is 
that Ted must prove that the Thorndike "Rapid Robot" 
machine is significantly better than the machine the 
Chipmunks are now using. Mr. Johnson has specified 
that the Thorndike pitcher will be purchased only if it 
passes the following test: Based on 30 pitches by Rapid 



Robot, the standard deviation of the vertical heights 
must be significantly less than 3.5 inches, and at the 0.01 
level of significance. 

All of this talk about vertical standard deviation is over 
Luke's head, but he tells Ted to go ahead and see if he can 
swing a deal with the Chipmunks. Ted is confident that the 
Thorndike Rapid Robot can meet Mr. Johnson's specifica- 
tions, so he and Mr. Robot make the trip to Alvindale. 

Ted and Mr. Johnson set up the Thorndike Rapid 
Robot machine at the team's practice facility, along with a 
target at home plate that will reveal exactly where each 
pitch lands. Measured from ground level at home plate, 
the 30 pitches are the following distances from the ground. 
The data are also in file THORN13. 



26.1 


26.2 


27.2 


26.9 


25.8 


26.8 


25.3 


29.9 


27.1 


26.3 


28.1 


31.4 


27.3 


23.5 


27.0 


29.0 


31.3 


24.2 


28.7 


30.6 


26.9 


28.9 


25.4 


26.9 


22.7 


32.1 


30.1 


26.6 


25.5 


27.5 inches 



1. Given Mr. Johnson's decision rule, is this a two-tail or a 
one-tail hypothesis test? What are the null and alterna- 
tive hypotheses? 

2. When Mr. Johnson has completed his analysis of the 
data represented by the pitches in the test session, will 
he sign Thorndike's Rapid Robot as a mechanical mem- 
ber of the Chipmunks? 



Springdale Shopping Survey 

These analyses involve the following variables from the data 
file SHOPPING, the Springdale Shopping Survey introduced in 
Chapter 2: 

1, 2, and 3 = Frequency of shopping at Springdale Mall, 
Downtown, and West Mall, respectively 

4, 5, and 6 = Typical amount spent during a trip to 

Springdale Mall, Downtown, and West Mall, 
respectively 

17 = The area identified as best for bargain sales 
26 = Gender of the respondent 




27 = Education level of the respondent 

28 = Marital status of the respondent 

Information like that gained from this database exer- 
cise could provide helpful insights into the nature of the 
respondents themselves as well as the possibility that some 
respondent characteristics might not be independent of 
other characteristics. Only a few of the many possible 
comparisons have been specified in this exercise. 
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Part I: Respondent Characteristics and the 
Area Judged Best for Bargains 

1. Construct and use the 0.05 level in testing the contin- 
gency table for variables 26 and 17. Can we conclude 
that the gender of the respondent is independent of the 
shopping area judged best for bargains? 

2. Repeat step 1 for variables 27 and 17. Can we conclude 
that the education level of the respondent is independent 
of the shopping area judged best for bargains? 



Part II: Respondent Characteristics 

1. Construct and use the 0.05 level in testing the contin- 
gency table for variables 26 and 27. Can we conclude 
that the gender of the respondent is independent of the 
education level of the respondent? 

2. Construct and use the 0.05 level in testing the contin- 
gency table for variables 26 and 28. Can we conclude 
that the gender of the respondent is independent of the 
marital status of the respondent? 




Baldwin Computer Sales (C) 



In the Baldwin Computer Sales case, visited previously in 
Chapters 3 and 5, one of the areas of concern regarding the 
student purchase program was whether defaulting on the 
computer payments is independent of the university at- 
tended by the student. Using the BALDWIN data file on the 
text CD, apply the appropriate chi-square test from this 
chapter to determine whether these variables could be inde- 
pendent. Use a level of significance of your choice in reach- 
ing a conclusion, and identify and interpret the p-value for 
the test. What, if any, recommendations would you make 
to Baldwin management on the basis of this test? 




Chi-Square Distribution 



When this applet loads, it shows a chi-square distribution 
similar to those in Figure 13.1. By using the slider at the 
right, we can change the degrees of freedom for the distri- 
bution and observe how this changes the distribution 




shape. We can also use the text boxes to enter either a 
chi-square value or a probability value, thus providing re- 
sults similar to those in Table 13.1 . Note that the chi-square 
and probability values in this applet will sometimes differ 
slightly from those in Table 13.1 and the corresponding 
chi-square table in the back of the text. Besides being more 
accurate because they do not rely on printed tables, these 
text boxes can also provide a wider variety of chi-square 
and probability values than are available from our printed 
tables. 

Applet Exercises 

17.1 Using the slider, increase the number of degrees of 
freedom. What happens to the shape of the distribution? 

17.2 Using the "ChiSq" text box, enter a larger chi-square 
value. (Be sure to press the Return or Enter key after chang- 
ing the text box entry.) What effect does this have on the 
probability? 

17.3 Using the "Prob" text box, enter a numerically higher 
probability. (Be sure to press the Return or Enter key after 
changing the text box entry.) What effect does this have on 
the chi-square value? 



Presidential Surnames: A Nonrandom Sequence? 

In a series of numbers, names, or other data, is it possible that the series might exhibit 
some nonrandom tendencies? This chapter provides a procedure that allows us to test 
just that: the runs test for randomness. A "run" occurs when data of one type (e.g., 
above the median versus below the median) happens one or more times in a row; 
and if we have either too many or too few runs, this suggests the data might be 
other than random. 

This technique is a handy way to examine whether a series of data might have 
come under the influence of one or more outside forces. It doesn't tell us exactly 
what the forces might be; it just sends a message that some force, sinister or 
otherwise, appears to be at work on the data. 

Have there been nonrandom tendencies in the sequence of surnames for U.S. 
presidents? In the series below, also listed in file CX14PRES, the last names of the first 
43 presidents are converted to a "1" (name begins with A-M) or a "2" (name begins 
with N-Z): 

George Washington 

211111121222121111111 
1111222111221112112111 

George W. Bush 

Using Minitab and applying the runs test for , met hods, a verSc 

randomness, we find 1 6 streaks or runs in the n onP aralT ' etr ' C 
observed series, compared to 19.14 runs that 5e tof to0 ' s ' ' 

would be expected from a random series. 
Although our series does appear to have some 
long "clusters," the p-value is 0.249 and we are not 
able to reject the null hypothesis that the series is 
random. 

You can verify these results for yourself when you 
get to the runs test later in the chapter. While you're at 
it, check out Exercise 14.60. It gives you a chance to 
statistically test the series of Super Bowl winners (Ameri- 
can Conference versus National Conference), a series 
that many football fans are convinced is decidedly non- 
random. Is it really? 
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learning 
objectives 

After reading this 
chapter, you should 
be able to: 



Differentiate between nonparametric and parametric hypothesis tests. 

Explain the advantages and disadvantages of nonparametric versus parametric 
testing. 

Determine when a nonparametric hypothesis test should be used instead of its 
parametric counterpart. 

Apply each of the nonparametric methods in the chapter to tests of 
hypotheses for which they are appropriate. 



14.1 INTRODUCTION 



Chapters 10 and 11 discussed £-tests for examining hypotheses about either one 
or two samples. Chapter 12 extended this idea to include one-way analysis of 
variance and more than two samples, and also introduced the randomized block 
ANOVA design. This chapter gives alternative approaches to these tests, describes 
the conditions under which such alternatives are necessary, and examines each 
test through the use of a computer statistical package. 

Nonparametric Testing 

A nonparametric test is one that makes no assumptions about the specific shape 
of the population from which a sample is drawn. This is unlike most of our pre- 
vious tests, which assumed that a population was either normally distributed or 
approximately so. When two or more populations were being compared, another 
typical assumption was that their population variances were equal. Also, most 
parametric tests require data to be of the interval or ratio scale of measurement, 
while many nonparametric techniques have no such requirement. In Chapter 13, 
the chi-square tests for goodness of fit, independence, and comparison of sample 
proportions were nonparametric methods having a variety of applications, but 
there are many more procedures in the realm of nonparametric testing. 

A nonparametric test should be used instead of its parametric counterpart 
whenever 

1. data are of the nominal or ordinal scale of measurement, or 

2. data are of the interval or ratio scale of measurement but one or more other 
assumptions, such as the normality of the underlying population distribution, 
are not met. 

Both nonparametric and parametric testing rely on the basic principles of hy- 
pothesis testing. Figure 14.1 shows five of the nonparametric tests in this chapter, 
along with their parametric counterparts from earlier in the text. 

Advantages and Disadvantages 
of Nonparametric Testing 

Compared to parametric tests, nonparametric testing has the following advan- 
tages and disadvantages: 

• Advantages 

1. Fewer assumptions about the population. Most important, the population 
need not be normally distributed or approximately so. Nonparametric tests do 
not assume the population has any specific distribution. 
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FIGURE 14.1 
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Corresponding parametric test: 

r-test, 
one sample 



f-test, 
paired samples 



r-test, two 
independent 
samples 



Randomized block 
analysis of 
variance 



One-way 
analysis of 
variance 



Five of the nonparametric tests discussed in the chapter, with the parametric counterpart for each 



2. The techniques can be applied when sample sizes are very small. 

3. Samples with data of the nominal and ordinal scales of measurement can be 
tested. 

• Disadvantages 

Compared to a parametric test, the information in the data is used less effi- 
ciently, and the power of the test will be lower. For this reason, a parametric 
test is preferable whenever its assumptions have been met. 
Nonparametric testing places greater reliance on statistical tables, if a computer 
statistical package or spreadsheet is not being used. 



1. 



2. 



exercise: 




- 1 

'"Mr. 



■V; 



14.1 What key feature distinguishes nonparametric 
from parametric tests? 

14.2 What are some advantages and disadvantages of 
nonparametric testing? 



14.3 In general, when should a nonparametric technique 
be used instead of its parametric counterpart? 

14.4 Referring to Chapter 1, as necessary, differentiate 
among the nominal, ordinal, interval, and ratio scales 
of measurement. 
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1 4.2 WILCOXON SIGNED RANK TEST FOR ONE SAMPLE 



Description and Applications 

For one sample, the Wilcoxon signed rank method tests whether the sample could 
have been drawn from a population having a hypothesized value as its median. 
As shown in Figure 14.1, its parametric counterpart is the one-sample i-test of 
Chapter 10. 

No assumptions are made about the specific shape of the population except 
that it is symmetric or approximately so. Data are assumed to be continuous and 
of the interval or ratio scale of measurement. When data are not of the interval or 
ratio scale of measurement, an alternative technique called the sign test can be 
used. It is described in Section 14.7. 



The Wilcoxon signed rank test for one sample: 

• Null and alternative hypotheses 

Two-tail Left-tail Right-tail 

test test test 

Null hypothesis, H Q : m = m Q m > m 0 m < m 0 

Alternative hypothesis, H 1 : m + m Q m < m 0 m > m 0 

where m = the population median 

m 0 = a value that has been specified 

• Test statistic, W 

1. For each of the observed values, calculate d t = x t — m Q . 

2. Ignoring observations where d i = 0, rank the \d-\ values so the smallest 
\d;\ will have a rank of 1. If there are ties, assign each of the tied rows the 
average of the ranks they are occupying. 

3. For observations where jc- > m 0 , list the rank in the R+ column. 

4. The test statistic is the sum of the R+ column, W = ^R+. 

• Critical value of W The Wilcoxon signed rank table in Appendix A lists lower 
and upper critical values for various levels of significance, with n = the number 
of observations for which d- ¥= 0. The rejection region will be in either one or 
both tails, depending on the null hypothesis being tested. For n values beyond 
the range of this table, a z-test approximation can be used. 



Example 



Wilcoxon Signed Rank Test, One Sample 

An environmental activist believes her community's drinking water contains at least 
the 40.0 parts per million (ppm) limit recommended by health officials for a certain 
metal. In response to her claim, the health department samples and analyzes drinking 
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Observed 
Concentration 



sehold 


X; 


d ; = X; m„ 
II 0 


Id,. | 


Rank 


R+ 


R 


A 


39.0 ppm 


-1.0 


1.0 


2 




2 


B 


20.2 


-19.8 


19.8 


10 




10 


C 


40.0 


0.0 


0.0 


— 






D 


32.2 


-7.8 


7.8 


6 




6 


E 


30.5 


-9.5 


9.5 


7 




7 


F 


26.5 


-13.5 


13.5 


9 




9 


G 


42.1 


2.1 


2.1 


3.5 


3.5 




H 


45.6 


5.6 


5.6 


5 


5 




I 


42.1 


2.1 


2.1 


3.5 


3.5 




J 


29.9 


-10.1 


10.1 


8 




8 


K 


40.9 


0.9 


0.9 


1 


1 





TABLE 14.1 



Data and calculations 
for Wilcoxon signed 
rank test. The median of 
the hypothesized popula- 
tion distribution 
is m 0 = 40.0 ppm. 



13.0 42.0 



W= £/?+ = 13.0 



water from a sample of 1 1 households in the community. The results are residue lev- 
els of 39.0, 20.2, 40.0, 32.2, 30.5, 26.5, 42.1, 45.6, 42.1, 29.9, and 40.9 ppm. The 
data are also in file CX14WAT. At the 0.05 level of significance, can we conclude that 
the community's drinking water might equal or exceed the 40.0 ppm recommended 
limit? 

SOLUTION 

We will use a left-tail test in evaluating the activist's statement. The null and alter- 
native hypotheses are 

H Q : m ■ 40.0 ppm 
H x : m < 40.0 ppm 

Table 14.1 shows the data and the calculations needed to conduct the 
Wilcoxon signed rank test. For each household, d i = x ; — m 0 is the difference be- 
tween the observed concentration and the specified population median 
(m 0 = 40.0 ppm) in the null hypothesis. The absolute values of these differences 
are placed in the | J-| column, then the nonzero \dj\ values are ranked. 

The smallest nonzero \d:\ value is the 0.9 for household K. This household 
gets a rank of 1 in the "Rank" column. Household A has the next smallest <i-|, 
so it gets a rank of 2. Households G and I are tied, each having \ d t \ = 2.1. Since 
these two of- values are taking up ranks 3 and 4, each is assigned the average of 
the two ranks, or (3 + 4)/2 = 3.5. For household C, |<f.| = 0.0, and this row is 
omitted from further analysis. 

For those households having jc. > 40.0, the rank is placed in the R+ column. 
Although it isn't used in the analysis, the corresponding R— column is also listed, 
since its inclusion can help us avoid errors in the assignment of the ranks. The 
sum of both columns is shown, and the test statistic W is the sum of the R+ col- 
umn, or W = 13.0. 

For n = the number of observations for which d i ¥= 0, the Wilcoxon signed 
rank statistic will be between W = 0 (if all of the d i values are negative) and 
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TABLE 14.2 



A portion of the table of 
critical values for the 


Two-Tail Test: 
One-Tail Test: 


a — 
a — 


0.20 
0.10 


a = 0.10 
a = 0.05 


a = 0.05 
a = 0.025 


a = 0.02 
a = 0.01 


a = 0.01 
a = 0.005 


Wilcoxon sicjned rank test 
















in Appendix A. 


n = 4 


1, 


9 


0, 10 


0, 10 


0, 10 


0, 10 


Lower and upper critical 


5 


3, 


1 2 


1,14 


0, 1 5 


0, 1 5 


0, 1 5 


values are provided. 


6 


4, 


1 7 


3, 18 


1 , 20 


0, 21 


0, 21 




7 


6, 


22 


4, 24 


3, 25 


1 , 27 


0, 28 




Q 

o 


Q 


Z / 


o, jU 


4, oZ 


Z, 34 


I , D D 




9 


11, 


34 


9, 36 


6, 39 


4, 41 


2, 43 




10 


15, 


40 


11,44 


9, 46 


6, 49 


4, 51 




11 


18, 


48 


14, 52 


11, 55 


8, 58 


6, 60 




12 


22, 


56 


18, 60 


14, 64 


10, 68 


8, 70 



Sources: Adapted from Roger C. Pfaffenberger and James H. Patterson, Statistical Methods for Business and 
Economics, (Homewood, III.: Richard D. Irwin, Inc., 1987), p. 1 176; and R. L. McCornack, "Extended Tables of 
the Wilcoxon Matched Pairs Signed Rank Statistics," Journal of the American Statistical Association, 60 (1965), 
pp. 864-871. 



W = n(n + l)/2 (if all of the d t values are positive). The largest possible value, 
n(n + l)/2, is the formula for the sum of consecutive integers from 1 to n. In this 
test, n = 10 nonzero differences, and W must have a value between 0 and 55.0. 
(Note that = 13.0 plus ~£R- = 42.0 is 55.0, the maximum value that W 

could have attained.) W = 13.0 is relatively low within these possible values. 

The critical value of W can be found from the table of critical values for the 
Wilcoxon signed rank test in Appendix A. A portion of this table is shown in Table 
14.2. Upper and lower critical values are listed, but we need only a lower critical 
value for our left-tail test. For n = 10 nonzero differences and a = 0.05, the criti- 
cal value is 11. The test statistic, W = 13.0, exceeds 11, and the null hypothesis 
cannot be rejected at the 0.05 level. At the 0.05 level, we are unable to reject the 
possibility that the city's water supply might have at least 40.0 ppm of the metal. 



The Normal Approximation 

When the number of observations for which d i # 0 is n > 20, a z-test will be a 
close approximation to the Wilcoxon signed rank test. This is possible because 
the W distribution approaches a normal curve as n becomes larger. The format for 
this approximation is shown below. 

Z-test approximation to the Wilcoxon signed rank test: 
Test statistic: 



where W = sum of the R+ ranks 

n = number of observations for 
which d- # 0 



W 



n(n + 1) 



n(n + l)(2n + 1) 
24 
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Although our example has only n = 10 nonzero differences, we'll use it to 
demonstrate the procedure for using the normal approximation. Because the sam- 
ple size is smaller than the more desirable n > 20, this approximation will be 
"rougher" than if n were 20 or larger. 

Substituting W = 13.0 and n = 10 into this expression, z = —14.5/9.81, or 
Z = —1.48. For a left-tail test at the 0.05 level, the critical value of z will be 
— 1.645 (from the normal distribution table). Since z = —1.48 falls to the right of 
this critical value, the null hypothesis cannot be rejected. We are unable to reject 
the possibility that the city's water supply might have at least 40.0 ppm of 
the metal. 

Computer Solutions 14.1 shows how we can use Excel and Minitab in apply- 
ing the Wilcoxon signed rank test to the data in Table 14.1. The Excel method 
uses the z-test approximation, and is best used when there are at least 20 nonzero 
differences. The "estimated median" shown by Minitab is computed through a 
special algorithm, and is not necessarily the same as the median for the sample. 




14.5 What is the parametric counterpart to the Wilcoxon 
signed rank test for one sample? Compare the 
assumptions involved in using the respective tests. 

14.6 Given the following randomly selected observations, 
use the Wilcoxon signed rank test and the 0.05 level of 
significance in examining whether the population median 
could be equal to 4.9. Using the appropriate statistical 
table, what is the most accurate statement that can be 
made about the p-value for the test? 

2.9 2.7 5.2 5.3 4.5 3.2 3.1 4.8 

14.7 Given the following randomly selected observations, 
use the Wilcoxon signed rank test and the 0.01 level of 
significance in examining whether the population median 
could be greater than 37.0. Using the appropriate statisti- 
cal table, what is the most accurate statement that can be 
made about the p-value for the test? 

34.6 40.0 33.8 47.7 41.4 40.2 47.0 
39.5 36.1 48.1 39.1 45.0 45.7 46.6 

14.8 According to the director of a county tourist 
bureau, there is a median of 10 hours of sunshine per day 
during the summer months. For a random sample of 20 
days during the past three summers, the number of hours 
of sunshine has been recorded as shown below. Use the 
0.05 level in evaluating the director's claim. 

89 8 10 977 97 7 

9 8 11 9 10 7 8 11 8 12 hours 

14.9 As a guideline, a local Internal Revenue Service of- 
fice recommends that auditors try to spend 15 minutes or 



less with taxpayers who are called in for audit interviews. 
In a recent random sample of 22 interviews, the interview 
times were as shown below. Using the 0.025 level, do 
these data suggest that the median interview time for 
auditors from this office exceeds the recommended guide- 
line? Using the appropriate statistical table, what is the 
most accurate statement we can make about the p-value 
for the test? 

12 15 16 16 17 10 22 13 20 20 20 
19 24 12 11 18 23 23 13 24 24 12 

/ data Set / Note: Exercises 14.10 and 14.11 require 
a computer and statistical software. 

14.10 Fhe placement director for a university claims that 
last year's graduates had a median starting salary of 
$35,000. For a random sample of graduates, the starting 
salaries, in thousands of dollars, are listed in data file 
XR14010. At the 0.05 level, could the director's claim be 
true? Identify and interpret the p-value for the test. 

14.11 The median age for a male at the time of his divorce 
has been reported as 37.4 years. In a given region, a 
random sample of divorces showed the ages for the males 
as listed in data file XR14011. Using the 0.05 level of sig- 
nificance, test whether the median age at which married 
males in this region are divorced could be the same as that 
for the nation as a whole. Identify and interpret the p- 
value for the test, source: Bureau of the Census, Statistical Abstract 
of the United States 1999, p. 1 1 1 . 
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computer solutions 14.1 


1 3^HrrlSB9<ai»xlrSci " 





Wilcoxon Signed Rank Test for One Sample 

This hypothesis test is for the median of a population. 

EXCEL 





A B 


C 


D 


1 


Wilcoxon Signed Rank Sum Test 




2 










3 


Difference 




ppm - medianO 


4 


T+ 




13 




5 


T- 




42 




6 


Observations (for test) 


10 




7 


zStat 




-1.478 




8 


P(Z<=z) one-tail 


0.0697 




9 


z Critical one-tail 


1.645 




10 


P(Z<=z) two-tail 


0.1394 




11 


z Critical two-tail 


1.960 





Excel Wilcoxon Signed Rank Test for One Sample, Using the z-Test Approximation 

1. The data in Table 14.1 are also in file CX14WAT.XLS, with the label (ppm) in A1 and the data in A2:A12. Enter the me- 
dian associated with H Q (40) into B2. Click at the bottom right corner of B2, then drag and fill through B1 2 so that 40 
will be in all of these cells. Enter the label (medianO) into B1. 

2. Click Tools. Click Data Analysis Plus. Click Wilcoxon Signed Rank Sum Test. Click OK. 

3. Enter A1:A12 into the Variable 1 Range box. Enter B1:B12 into the Variable 2 Range box. Click Labels. Enter 0.05 
into the Alpha box. Click OK. The printout includes the number of nonzero differences, the sum of the ranks for the 
positive differences and the negative differences, the z-statistic, and the p-value for this one-tail test (0.0697). 

The z-test approximation here is applicable when there are at least 20 nonzero differences. For the data in file 
CX14WAT.XLS, this requirement is not met, but we have applied the approximation here to demonstrate the steps involved. 

Note: As an alternative, you can use Excel worksheet template TMWILSR1.XLS, provided with the text. The procedure is 
described within the template. 

MINITAB 

Minitab Wilcoxon Signed Rank Test for One Sample 
Wilcoxon Signed Rank Test: ppm 

Test of median = 40.00 versus median < 40.00 

N for Wilcoxon Estimated 
N Test Statistic P Median 

ppm 11 10 13.0 0.077 35.85 

1. With the Table 14.1 data values in column C1 of file CX14WAT.MTW: Click Stat. Select Nonparametrics. Click 1-Sample 
Wilcoxon. 

2. Enter C1 into the Variables box. Select Test median and enter 40 into the box. Select less than within the Alterna- 
tive box. Click OK. The printout includes the p-value (0.077) as well as a restatement of the null and alternative 
hypotheses. Unlike the z-test method, this Minitab test can be used even with small data sets. 
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WILCOXON SIGNED RANK TEST FOR 
COMPARING PAIRED SAMPLES 



Description and Applications 

The Wilcoxon signed rank test can also be used for paired samples. In this context, 
it is the nonparametric counterpart to the paired-samples i-test of Chapter 11. 

As in Section 14.2, the technique assumes the data are continuous and of the 
interval or ratio scale of measurement. In this application, the measurement of in- 
terest is the difference between paired observations, or d { = x, — y.. Depending 
on the type of test, the null and alternative hypotheses will be as follows: 

Two-tail Left-tail Right-tail 

test test test 

Null hypothesis, H 0 : m d = 0 m d > 0 m d < 0 

Alternative hypothesis, Hy m d + 0 m d <0 m d >0 

where m d = the population median of d j = x ( . - y ( . 

The population of d values is assumed to be symmetric or nearly so, but need 
not be normally distributed or have any other specific shape. Applying the 
Wilcoxon signed rank test to paired samples is nearly identical to its use for one 
sample, discussed in the preceding section. Because the two approaches are so sim- 
ilar, this one will be described through the use of an example problem. 



cxamp 


If 







Wilcoxon Signed Rank Test, Paired Samples 



Two computer software packages are being considered for use in the inventory 
control department of a small manufacturing firm. The firm has selected 12 dif- 
ferent computing tasks that are typical of the kinds of jobs such a package would 
have to perform, then recorded the number of seconds each package required to 
complete each task. The results are shown in Table 14.3 (page 512), which includes 
the difference {d-j in the number of seconds the two packages required for each task. 
The data are also in file CX14TIME. At the 0.10 level, can we conclude that the 
median difference (mj) for the population of such tasks might be zero? 

SOLUTION 

The background for this test does not imply that one package is any faster or 
slower than the other, so a two-tail test is used. The null and alternative hypo- 
theses are 

^o : m d = 0 The population median of d t = x- — y i is zero. 
Hy- m d ¥= 0 The population median of d- = x i — y- is not zero. 

As Table 14.3 shows, the difference (d-) is calculated for each pair of x i and y ■ 
values, the absolute values for d i are obtained, then the \d^ values are ranked. Fi- 
nally, the ranks associated with positive differences are listed in the R+ column 
and added to get the observed value of the test statistic, W. 



14.3 
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TABLE 14.3 



Data and calculations for 
the application of the 
Wilcoxon signed rank test 
to the comparison of paired 
samples. Prior to the test, 
there was no reason to be- 
lieve one software package 
took more time than the 
other, so the test is two-tail. 



Computing 



Time Required 
for Software 
Packages 
x and y 



Tack 


x i 


Yi 


di ~ x i Yi 




Rani/ 


A r 


D_ 
A 


A 


24.0 


23.1 


0.9 


0.9 


1 


1 




B 


16.7 


20.4 


-3.7 


3.7 


4 




4 


C 


21.6 


17.7 


3.9 


3.9 


5 


5 




D 


23.7 


20.7 


3.0 


3.0 


2.5 


2.5 




E 


37.5 


42.1 


-4.6 


4.6 


6 




6 


F 


31.4 


36.1 


-4.7 


4.7 


7 




7 


G 


14.9 


21.8 


-6.9 


6.9 


10 




10 


H 


37.3 


40.3 


-3.0 


3.0 


2.5 




2.5 


1 


17.9 


26.0 


-8.1 


8.1 


11 




11 


J 


15.5 


15.5 


0.0 


0.0 








K 


29.0 


35.4 


-6.4 


6.4 


9 




9 


L 


19.9 


25.5 


-5.6 


5.6 


8 


8.5 


8 

57.5 



1/1/ = = 8.5 



The smallest nonzero \d t \ value is \d t \ = 0.9 seconds for computing task A, 
and it gets a rank of 1. Tasks D and H are tied, each having \d-\ = 3.0 seconds. 
These two tasks are occupying ranks 2 and 3, so each is assigned the average of the 
two ranks, or 2.5. Because \d-\ = 0.0 for task J, it is dropped from the analysis. 

As Table 14.3 shows, the test statistic is W = = 8.5. For n = 11 nonzero 

differences and a = 0.10, the Wilcoxon signed rank table (see the portion shown in 
Table 14.2) gives lower and upper critical values for W of 14 and 52, respectively. 
The observed value, W = 8.5, falls outside these limits, and the null hypothesis is re- 
jected. Based on these data, the software packages are not equally rapid in handling 
computing tasks like those in the sample. At the 0.10 level, we are able to conclude 
that the population median for d i = x i — y- is not equal to zero and that package x 
is faster than package y in handling computing tasks like the ones sampled. 



The z-test approximation described in the preceding section can also be used 
for paired samples. Again, it is desirable that the number of nonzero differences 
be at least 20, but we can use this method to get a very rough approximation if n 
is smaller. Using the normal approximation in this case, the observed z is 
Z = —2.18. This is based on W = 8.5, n = 11, and the z expression in the preced- 
ing section. Using the normal distribution table, we find that 0.5000 — 0.4854, or 
0.0146 of the area falls in the left tail. Since the test is two-tail, the approximate 
p-value is 2(0.0146), or 0.0292. 

Computer Solutions 14.2 shows how we can use Excel and Minitab in apply- 
ing the Wilcoxon signed rank test to the paired data in Table 14.3. The Excel 
method uses the z-test approximation, and is best used when there are at least 



Chapter 14: Nonparametric Methods 



513 



• • - i 

computer solutions 14.2 




Wilcoxon Signed Rank Test for Comparing Paired Samples 

This hypothesis test applies to paired samples, such as before-and-after comparisons for the same 
items or subjects. 

EXCEL 





A 


B 


C 


1 


Wilcoxon Signed Rank Sum Test 


2 








3 


Difference 




X- Y 


4 


T+ 




8.5 


5 


T- 




57.5 


6 


Observations (for test) 


11 


7 


zStat 




-2.178 


8 


P(Z<=z) one-tail 


0.0147 


9 


z Critical one-tail 


1.282 


10 


P(Z<=z) two-tail 


0.0294 


11 


z Critical two-tail 


1.645 



Excel Wilcoxon Signed Rank Test for Paired Samples, Using the z-test Approximation 

1. The data in Table 14.3 are in adjacent columns in file CX14TIME.XLS, with the labels in A1 and B1, and the data in 
A2:B1 3. Click Tools. Click Data Analysis Plus. Click Wilcoxon Signed Rank Sum Test. Click OK. 

2. Enter A1:A13 into the Variable 1 Range box. Enter B1:B13 into the Variable 2 Range box. Click Labels. Enter 0.10 
into the Alpha box. Click OK. The printout includes the number of nonzero differences, the sum of the ranks for the 
positive differences and the negative differences, the z-statistic, and the p-value for this two-tail test (0.0294). 

The z-test approximation here is applicable when there are at least 20 nonzero differences. For the data in file 
CX14TIME.XLS, this requirement is not met, but we have applied the approximation here to demonstrate the steps involved. 

Note: As an alternative, you can use Excel worksheet template TMWILSR1.XLS, provided with the text. The procedure is 
described within the template. 



MINI TAB 

Minitab Wilcoxon Signed Rank Test for Paired Samples 
Wilcoxon Signed Rank Test: DIFF 

Test of median = 0.000000 versus median not = 0.000000 

N for Wilcoxon Estimated 
N Test Statistic P Median 

DIFF 12 11 8.5 0.033 -3.100 

1 . With the Table 14.3 data values in columns C1 and C2 of file CX14TIME.MTW, we must first calculate the differences 
and insert them into C3: Click Calc. Click Calculator. Enter C1-C2 into the Expression box. Enter C3 into the Store 
result in variable box. Click OK. Type the label (DIFF) at the top of column C3. 

2. With the differences in C3, click Stat. Select Nonparametrics. Click 1-Sample Wilcoxon. 

3. Enter C3 into the Variables box. Select Test median and enter 0 into the box. Select not equal within the Alterna- 
tive box. Click OK. Unlike the z-test method, this Minitab test is not an approximation, and it can be used even with 
small data sets. 
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20 nonzero differences. The "estimated median" shown by Minitab is computed 
through a special algorithm, and is not necessarily the same as the median for the 
sample. The Excel and Minitab p-values differ slightly from each other, as well as 
from our p-value that was based on the normal distribution table and its inherent 
gaps between listed z values. 



■p" exercises 




14.12 What is the parametric counterpart to the 
Wilcoxon signed rank test for paired samples? Compare 
the assumptions involved in using the respective tests. 

14.13 For nine paired samples, the data are as listed 
below. For d = the difference between paired 
observations (x 1 — x 2 ), use the 0.10 level of significance 
in testing H Q : m d < 0 versus fL: m d > 0. 

Sample: 123456789 



7.8 4.0 
9.7 2.1 



4.2 9.8 
3.7 8.2 



8.7 4.3 
6.6 5.7 



5.6 7.7 



8.9 
8.6 



14.14 For ten paired samples, the data are as listed here. 
For d = the difference between paired observations 
(x 1 — x 2 ), use the 0.10 level of significance in testing 



H Q : m d = 0 versus H^: m d + 0. 



Sample: 1 2 3 4 5 



8 9 10 



x, 12.5 11.2 14.4 8.9 10.5 12.3 11.8 12.4 13.9 8.8 

x 2 12.7 11.2 15.6 8.7 11.8 12.2 11.3 13.4 14.48.3 

14.15 Tom Frederick is the computer support manager for 
a large company whose employees have been complaining 
about "spam," which many of us know as unwanted 
e-mail to solicit our money or our attention. Tom asked a 
sample of 9 employees to keep track of the number of 
spam messages they received during the previous week. 
He then installed a spam "filter" into the e-mail system in 
attempt to block some of the spam by identifying key 
words that often appear in such messages. During the 
week following installation of the filter, the number of 
spam messages received by each of the 9 employees was 
again counted, with the results shown here. At the 0.05 
level of significance, can we conclude that Tom's filtering 
system is effective in reducing the weekly number of spam 
messages an employee receives? 



Employee: 



123456789 



Spam Messages, 

Before Filter 28 27 24 18 23 28 25 20 28 
Spam Messages, 

After Filter 19 25 25 19 20 21 18 21 22 



14.16 A researcher studying the purchase habits of eight 
married couples has found they spent the amounts shown 
below for clothing. At the 0.05 level, can we conclude 
that the population of husbands and wives do not spend 
equally in this purchase category? 



1 



8 



Husband $300 240 230 250 230 300 470 230 
Wife $380 375 180 390 400 480 470 320 

14.17 A new procedure is being tested for the assembly of 
electronic circuit boards. For each of five employees, the 
time required to assemble a board is measured for both the 
current procedure and the method that has been proposed. 
Given the times listed in the table, use the 0.025 level of 
significance in testing whether the new method could be an 
improvement over the one currently used. Using the appro- 
priate statistical table, what is the most accurate statement 
that can be made about the p-value for the test? 

Current Proposed 



Person 



66.4 
39.8 
50.5 
65.1 
55.6 



70.0 
35.4 
43.7 
59.0 

49.9 seconds 



/ data Set / Note: Exercises 14.18 and 14.19 require 
a computer and statistical software. 

14.18 Early in the 1995-1996 NHL hockey season, play- 
ers and coaches observed with great interest an apparent 
drop in the number of goals scored so far during that sea- 
son, compared to scoring over the same number of games 
during the previous season. For each team, the number of 
goals in the 1995-1996 season versus the same stage of 
the 1994-1995 season are listed in data file XR14018. At 
the 0.05 level, could the drop in goals be simply due to 
chance, or does it appear that something may have hap- 
pened between seasons to reduce teams' scoring power? 
Identify and interpret the p-value for the test, source: Kevin 
Allen, "Some Players, Coaches Applaud Trend Toward Lower-Scoring 
Games," USA Today, November 12, 1996, p. 5C. 
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14.19 A company has randomly selected a number of em- 
ployees to test the effectiveness of a speed-reading course. 
If the course is found to significantly increase reading 
speed, it will be offered to all of the firm's clerical employ- 
ees. For each of the employees selected for the test, read- 



ing speeds (words per minute) before and after the course 
are listed in data file XR14019. Using the 0.01 level of sig- 
nificance in evaluating the effect of the course, should the 
firm offer the program to all of its clerical employees? 
Identify and interpret the p-value for the test. 



WILCOXON RANK SUM TEST FOR COMPARING 1 4.4 

TWO INDEPENDENT SAMPLES 



Description and Applications 

The Wilcoxon rank sum test compares two independent samples and is a non- 
parametric counterpart to the two-sample pooled £-test of Chapter 11. The test 
assumes that the data are at least of the ordinal scale of measurement, that the 
samples are independent and randomly selected, and that the populations have 
approximately the same shape. 

When doubts exist as to whether data meet the requirements of the interval 
scale of measurement, or if it cannot be assumed that the populations are normal, 
with equal variances, the Wilcoxon rank sum test should be used instead of the 
pooled-variances i-test for independent samples. The sample sizes need not be 
equal, and the test is as shown: 

Wilcoxon rank sum test for comparing two independent samples: 

• Null and alternative hypotheses 

Two-tail Left-tail Right-tail 

test test test 

H 0 : m 1 = m 2 m 1 > m 2 m 1 < m 2 

Hy m x + m 2 m 1 < m 2 m 1 > m 2 

where m 1 and m 2 are the population medians. 

• Test statistic, W 

1. Designate the smaller of the two samples as sample 1. If the sample sizes 
are equal, either sample may be designated as sample 1. 

2. Rank the combined data values as if they were from a single group. The 
smallest data value gets a rank of 1, the next smallest, 2, and so on. In 
the event of a tie, each of the tied values gets the average rank that the 
values are occupying. 

3. List the ranks for data values from sample 1 in the R 1 column and the 
ranks for data values from sample 2 in the R 2 column. 

4. The observed value of the test statistic is W = ^R r 

• Critical value of W The Wilcoxon rank sum table in Appendix A lists lower 
and upper critical values for the test, with n x and n 2 as the number of 
observations in the respective samples. The rejection region will be in either 
one or both tails, depending on the null hypothesis being tested. For and « 2 
values beyond the range of this table, a z-test approximation is used in carry- 
ing out the test. 



516 



Part 4: Hypothesis Testing 



example 



Wilcoxon Rank Sum Test 

In evaluating the flexibility of rubber tie-down lines, an inspector selects a random 
sample of the straps and counts the number of 360-degree twists each will with- 
stand before breaking. For 7 lines from one production lot, the number of turns 
before breaking is 112, 105, 83, 102, 144, 85, and 50. For 10 lines from a second 
production lot, the number of turns before breaking is 91, 183, 141, 219, 105, 
138, 146, 84, 134, and 106. The data are also in file CX14TIE. At the 0.05 level, can 
it be concluded that the two production lots have the same median flexibility? 

SOLUTION 

Prior to the testing, there was no reason to believe that one lot was either more 
flexible or less flexible than the other, so the test is two-tail. The null and alterna- 
tive hypotheses are H 0 : m 1 = m 2 and H^, m l + m 2 , where m x and m 2 are the me- 
dians of the respective populations. 

Combining the data values and ranking them, the rankings are shown in Table 

14.4. The smallest value, 50, gets a rank of 1 and the next smallest, 83, gets the 
rank of 2. This continues until the largest value, 219, has been assigned the rank 
of 17. Two values are tied at 105, and each is given the rank of 7.5. 

The sum of ranks of values from sample 1 is W = = 44.5, the observed 
value of the test statistic. The smaller of the two samples has already been desig- 
nated as sample 1. If this had not been done, the identification of the samples 
would have to be reversed in order to carry out the analysis. 

A portion of the Wilcoxon rank sum table in Appendix A is provided in Table 

14.5. For n 1 = 7, n 2 = 10, and a two-tail test at the 0.05 level, the lower and 
upper critical values of W arc 43 and 83, respectively. Since the calculated value, 
W = 44.5, falls between these limits, the null hypothesis cannot be rejected at the 
0.05 level. Our conclusion is that the median flexibility of the two lots could be 
the same. 



TABLE 14.4 



Data and calculations for 
Wilcoxon rank sum 
comparison of two 
independent samples. The 
sample with the smaller 
number of observations is 
designated as sample 1 . 



Sample 1 and Ranks 



Sample 2 and Ranks 



112 


10 


91 


5 


105 


7.5 


183 


16 


83 


2 


141 


13 


102 


6 


219 


17 


144 


14 


105 


7.5 


85 


4 


138 


12 


50 


1 


146 


15 




44.5 


84 


3 




i 




134 
106 


11 
9 

108.5 




w-- 


= 2«, = 44.5 
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a = 0.025 (One-Tail) or a = 0.05 (Two-Tail) 



n 1 




3 


4 




5 


6 


7 




8 


9 


10 


3 


5, 


16 


6, 


18 


6 


21 


7, 23 


7, 


26 


8, 


28 


8, 31 


9, 33 


4 


6, 


18 


11, 


25 


12 


28 


12, 32 


13, 


35 


14, 


38 


15, 41 


16, 44 


5 


6, 


21 


12, 


28 


18 


37 


19, 41 


20, 


45 


21, 


49 


22, 53 


24, 56 


6 


7, 


23 


12, 


32 


19 


41 


26, 52 


28, 


56 


29, 


61 


31, 65 


32, 70 


7 


7, 


26 


13, 


35 


20 


45 


28, 56 


37, 


68 


39, 


73 


41, 78 


43, 83 


8 


8, 


28 


14, 


38 


21 


49 


29, 61 


39, 


73 


49, 


87 


51, 93 


54, 98 


9 


8, 


31 


15, 


41 


22 


53 


31, 65 


41, 


78 


51, 


93 


63, 108 


66, 114 


10 


9, 


33 


16, 


44 


24 


56 


32, 70 


43, 


83 


54, 


98 


66, 114 


79, 131 



TABLE 14.5 



A portion of the Wilcoxon 
rank sum table from 
Appendix A. Lower and 
upper limits are provided, 
and, if sample sizes are 
unequal, sample 1 will be 
the one with the smaller 
number of observations. 



Note: is the smaller of the two samples — i.e., n ] a n 2 . 

Sources: F. Wilcoxon and R. A. Wilcox, Some Approximate Statistical Procedures (New York: American 
Cyanamid Company, 1964), pp. 20-23. 



The Normal Approximation 

When n 1 > 10 and « 2 — 10, a normal approximation to the Wilcoxon rank sum 
test can be used. The format is as follows: 



Z-test approximation to the Wilcoxon rank sum test: 
Test statistic: 



nAn + 1) 
W - 



z = — -, where W = sum of the R 1 ranks 

j ni n 2 {n + 1) n = ni + n 2 

12 



Although our example falls short of the n 1 > 10, « 2 — 10 rule of thumb, it 
will be used to demonstrate the normal approximation. The results will be more 
"approximate" than if the rule of thumb had been met. 

Substituting W = 44.5, n 1 = 7, n 2 = 10, and n = 17 into the above expres- 
sion, we find that z = —1.81. For a two-tail test at the 0.05 level, the critical z val- 
ues are —1.96 and +1.96. The calculated z is between these limits, and the null 
hypothesis cannot be rejected. 

The approximate p-value for this two-tail test is twice the area under the 
normal curve to the left of z = —1.81. Using the normal distribution table, 
this area is found to be 0.5000 - 0.4649, or 0.0351. The p-value is 2(0.0351), 
or 0.0702. 

Computer Solutions 14.3 shows how we can use Excel and Minitab in ap- 
plying the Wilcoxon rank sum test to compare the independent samples in 
Table 14.4. The Excel method uses the z-test approximation, and is best used 
when each sample size is at least 10. Note that the Minitab procedure calls for 
the Mann-Whitney test, which is equivalent to the Wilcoxon test described in 
the text. 
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computer solutions 14.3 



IfflHK 

Wilcoxon Rank Sum Test for Two Independent Samples 

This hypothesis test compares the medians of two independent samples. 

EXCEL 






A 


B C 


D 


1 


Wilcoxon Rank Sum Test 




2 










3 






Rank Sum 


Observations 


4 


LOT1 




44.5 


7 


5 


LOT2 




108.5 


10 


6 


z Stat 




-1.81 




7 


P(Z<=z) one-tail 


0.0355 




8 


z Critical one-tail 


1.645 




9 


P(Z<=z) two-tail 


0.071 




10 


z Critical two-tail 


1.960 





Excel Wilcoxon Rank Sum Test for Two Independent Samples, Using the z-test Approximation 

1. The data in Table 14.4 are in adjacent columns in file CX14TIE.XLS, with labels in A1 and B1, and data in A2:A8 and 
B2:B1 1 . Click Tools. Click Data Analysis Plus. Click Wilcoxon Rank Sum Test. Click OK. 

2. Enter A1:A8 into the Variable 1 Range box. Enter B1:B11 into the Variable 2 Range box. Click Labels. Enter 0.05 
into the Alpha box. Click OK. The printout includes the p-value for this two-tail test (0.071). 

The z-test approximation here is applicable when each sample size is at least 1 0. For the data in file CX14TIE.XLS, this re- 
quirement is not met, but we have applied the approximation here to demonstrate the steps involved. 

Note: As an alternative, you can use Excel worksheet template TMWILRS2.XLS, provided with the text. The procedure is 
described within the template. 



MINITAB 

Minitab Wilcoxon Rank Sum Test for Two Independent Samples 

Mann-Whitney Test and CI: LOT 1, LOT 2 

N Median 
LOT 1 7 102.00 
LOT 2 10 136.00 

Point estimate for ETA1-ETA2 is -34.00 

95.5 Percent CI for ETA1-ETA2 is (-78.00,2.99) 

W = 44.5 

Test of ETA1 = ETA2 vs ETA1 not = ETA2 is significant at 0.0790 
The test is significant at 0.0788 (adjusted for ties) 

1. With the Table 14.4 data values in columns C1 and C2 of file CX14TIE.MTW: Click Stat. Select Nonparametrics. 
Click Mann-Whitney, the equivalent test that Minitab performs. 

2. Enter C1 into the First Sample box. Enter C2 into the Second Sample box. Select not equal within the Alternative 

box. Click OK. Unlike the z-test method, this Minitab test can be used even with small data sets. 
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14.20 Differentiate between the Wilcoxon signed rank 
test and the Wilcoxon rank sum test, and explain the 
circumstances under which each should be applied. 

14.21 What is the parametric counterpart to the 
Wilcoxon rank sum test? Compare the assumptions 
involved in using the respective tests. 

14.22 For the following random samples from two 
populations, use the 0.05 level of significance in testing 



m \ = m l versus H^: 



Sample 1: 40 34 53 28 41 
Sample 2: 29 31 52 29 20 



31 26 



14.23 For the following random samples from two popu- 
lations, use the 0.025 level of significance in testing 

H Q : Wj & m 2 versus Hj: < m 2 - 

Sample 1: 46.8 48.0 72.6 55.1 43.6 

53.0 41.8 45.5 

Sample 2: 56.4 59.3 59.1 58.4 57.9 

46.4 49.6 50.2 55.8 53.8 

14.24 Twelve family pets have been randomly placed into 
two groups. Group 1 receives the standard version of a 
flea powder, while group 2 receives a modified formula. 
Four days following treatment, researchers count the 



number of fleas collected from each pet during a 2-minute 
period. The results are shown below. At the 0.025 level, 
is the new formula more effective than the old? 

Number of fleas after old formula: 
17 16 17 4 8 15 
Number of fleas after new formula: 
6 112 7 5 

/ data Set / Note: Exercises 14.25 and 14.26 require 
a computer and statistical software. 

14.25 A shop owner has collected random samples 
of sales receipts for credit purchases made with 
MasterCard and Visa. The amounts are listed in data file 
XR14025. At the 0.05 level, can we conclude that the 
shop's median sale for MasterCard is the same as that 
for Visa? Identify and interpret the p- value for the test. 

14.26 A word-processing firm has narrowed down its 
choice of a printer to two major brands. Using a 20-page 
document that is typical of the firm's work, a number of 
printers of each brand are timed while they generate a 
letter-quality copy of the document. The times are listed 
in data file XR14026. At the 0.05 level, can we conclude 
that the two brands are equally fast? Identify and 
interpret the p-value for the test. 



KRUSKAL-WALLIS TEST FOR COMPARING MORE 1 4.5 

THAN TWO INDEPENDENT SAMPLES 



Description and Applications 

An extension of the Wilcoxon rank sum technique of the previous section, the 
Kruskal-Wallis test compares more than two independent samples. It is the non- 
parametric counterpart to the one-way analysis of variance. However, unlike one- 
way ANOVA, it does not assume that samples have been drawn from normally 
distributed populations with equal variances. The scale of measurement of the 
data must be at least ordinal, and the samples are assumed to be randomly 
selected from their respective populations. 

In this test, the null hypothesis is that the medians of the k populations are the 
same, or H 0 : m l = m 2 = ■ • ■ = m k . The test is one-tail and is carried out as follows: 

Kruskal-Wallis test for comparing more than two independent samples: 
• Null and alternative hypotheses 

H 0 : m 1 = m 2 = ••• = m k for the / = 1 through k populations. 

(The population medians are equal.) 
Hj: At least one m- differs from the others. 

(The population medians are not equal.) 



{continued) 
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• Test statistic, H 

1. Rank the combined data values as if they were from a single group. The 
smallest data value gets a rank of 1, the next smallest, 2, and so on. In 
the event of a tie, each of the tied values gets their average rank. 

2. Add the ranks for data values from each of the k groups, obtaining *£Rp 
2# 2 , through ^R k . 

3. The calculated value of the test statistic is 



H 12 



nln + 1) 



3(n + 1) 



'1 "2 n k 

where « l5 n 2 , = the respective sample sizes 

for the k samples 
n = n 1 + n 2 + ••• + 

Critical value of H The distribution of H is closely approximated by the chi- 
square distribution whenever each sample size is at least 5 and, for a = the 
level of significance for the test, the critical H is the chi-square value for which 
df = k — 1 and the upper-tail area is a. If the calculated H exceeds the criti- 
cal value, the null hypothesis is rejected. Otherwise, it cannot be rejected. If 
one or more of the sample sizes is < 5, either a computer statistical package 
or special tables must be used for the test. 



example 



Kruskal-Wallis Test 

Each of three aerospace companies has randomly selected a group of technical staff 
workers to participate in a training conference sponsored by a supplier firm. The 
three companies have sent 6, 5, and 7 employees, respectively. At the beginning of 
the session, a preliminary test is given, and the scores are shown in Table 14.6. The 
data are also in file CX14TECH. At the 0.05 level, can we conclude that the median 
scores for the three populations of technical staff workers could be the same? 



SOLUTION 

As in the Wilcoxon rank sum test, the combined data values are ranked, with the 
smallest value receiving a rank of 1. The process is summarized in Table 14.6, 
which also shows the sum of the ranks for each sample. There is one tie, and each 
person who scored 67 on the test receives a rank of 7.5. 

The sums of the ranks are = 32, ^l,R 2 = 45, and = 94. There is a 
total of n = 18 observations in the k = 3 samples, with n x = 6, « 2 = 5, and 
« 3 = 7. Substituting these into the expression for the test statistic, H, we have 



H 



12 [(S^j) 2 (2K 



n(n + 1) 
12 



+ 



n 



1 

_ 322 
18(18 + 1) [~6~ 



45 2 942 
+ + 

5 7 



+ 



3(18 + 1) 



3(« + 1) 



7.490 
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Technical Staff Technical Staff Technical Staff ^^^^^^j^^^^^^^ 

from Firm 1 from Firm 2 from Firm 3 Exam scores and ranks for 

members of the three 



Test Score 


Rank 


Test Score 


Rank 


Test Score 


Rank 


groups of technical staff 


67 


7.5 


64 




75 


15 


workers. The Kruskal-Wallis 


5 


test is an extension of the 


57 


1 


73 


13 


61 


3 


Wilcoxon rank sum test, 


62 


4 


72 


12 


76 


16 


and the ranks are assigned 


59 


2 


68 


9 


71 


11 


the same way. 


70 


10 


65 


6 


78 


17 




67 


7.5 




= 45 


74 


14 






= 32 






79 


18 












2*3 


= 94 





The critical value of H is the chi-square statistic corresponding to an upper 
tail area of a = 0.05 and df = k — 1 = 2. Referring to the chi-square table in Ap- 
pendix A, this is found to be 5.991. The calculated value, H = 7.490, exceeds 
5.991, and the null hypothesis is rejected at the 0.05 level of significance. At this 
level, we conclude that the three populations do not have the same median. 

The calculated H also exceeds the critical value for the 0.025 level of signifi- 
cance (chi-square = 7.378), but does not exceed that for the 0.01 level (chi- 
square = 9.210). Based on the table in Appendix A, the most accurate statement 
we can make about the p-value for the test is that it is between 0.025 and 0.01. 



Computer Solutions 14.4 shows how we can use Excel and Minitab in apply- 
ing the Kruskal-Wallis test to compare the three independent samples in Table 
14.6. Minitab lists the same H value (7.49) we've just obtained, along with an 
"adjusted" H value (7.50) that corrects for the tie that occurred during the rank- 
ing process. Although it would not change our conclusion in this example, it is 
preferable to use the adjusted H value when it is available. 



exercises 




14.27 Differentiate between the Kruskal-Wallis test and 
the one-way analysis of variance in terms of their 
assumptions and the circumstances under which each 
should be applied. 

14.28 For the following independent and random 
samples, use the 0.05 level of significance in testing 
whether the population medians could be equal. 



Sample 1 


Sample 2 


Sample 3 


31.3 


29.4 


36.0 


30.7 


20.8 


37.7 


35.4 


22.2 


31.0 


36.1 


24.9 


28.4 


30.3 


21.4 


31.7 


25.5 




24.1 






32.6 



14.29 For the following independent and random 
samples, use the 0.10 level of significance in testing 
whether the population medians could be equal. 

Sample 1 Sample 2 Sample 3 Sample 4 



313 


361 


197 


382 


240 


316 


203 


302 


262 


149 


244 


260 


218 


325 


108 


370 


236 


171 


102 


190 


253 






382 



14.30 A sample of five law firms has been selected from 
each of three major cities for the purpose of obtaining a 
quote for legal services on a relatively routine business 
contract. The quotes provided by the fifteen firms are as 
shown in the accompanying table. At the 0.05 level of 
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Kruskal-Wallis Test for Comparing More Than Two Independent Samples 

This hypothesis test compares the medians of more than two independent samples. 



EXCEL 





A 


B 


C 


1 


Kruskal-Wallis Test 






2 








3 


Group 


Rank Sum 


Observations 


4 


Firm 1 


32 


6 


5 


Firm2 


45 


5 


6 


Firm3 


94 


7 


7 


H Stat 




7.4896 


8 


df 




2 


9 


p-value 




0.0236 


10 


chi-squared Critical 




5.9915 



Excel Kruskal-Wallis Test for Comparing More Than Two Independent Samples 

1. The data in Table 14.6 are in adjacent columns in file CX14TECH.XLS, with the labels in A1:C1, and the data in A2:C8. 
Click Tools. Click Data Analysis Plus. Click Kruskal Wallis Test. Click OK. 

2. Enter A1:C8 into the Input Range box. Click Labels. Enter 0.05 into the Alpha box. Click OK. 



MINITAB 

Minitab Kruskal-Wallis Test for Comparing More Than Two Independent Samples 
Kruskal-Wallis Test: Score versus Firm 

Kruskal-Wallis Test on Score 



Fi rm 


N 


Median 


Ave Rank 


Z 


1 


6 


64.50 


5.3 


-2.34 


2 


5 


68.00 


9.0 


-0.25 


3 


7 


75.00 


13.4 


2.49 


Overal 1 


18 




9.5 




H = 7.49 


DF = 


2 P = 0 


024 




H = 7.50 


DF = 


2 P = 0 


024 (adjusted 


for tie 



1 . The data in Table 14.6 must be stacked, as in file CX14TECH.MTW. With the test scores in column C1 and the firm 
identification (1, 2, or 3) in column C2: Click Stat. Select Nonparametrics. Click Kruskal-Wallis. 

2. Enter C1 into the Response box. Enter C2 into the Factor box. Click OK. The printout will include two p-values, 
including one that is adjusted for ties. 
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significance, does it seem that the population median bids 
from the cities' firms could be the same? 



City A City B City C 

452 439 477 

469 461 460 

430 487 496 

403 431 484 

456 401 423 



14.31 In testing three different rubber compounds, a tire 
manufacturer finds the tread life of tires made from each 
to be as shown below. At the 0.05 level, could the three 
compounds deliver the same median tread life? 

Design 1: 34 38 33 30 30 thousand miles 
Design 2: 46 43 39 46 36 
Design 3: 48 39 33 35 41 

14.32 In an agricultural test, each of four organic 
compounds is applied to a sample of plants. At the end of 
4 weeks, the heights of the plants are as shown here. At 
the 0.025 level, are the compounds equally effective in 
promoting plant growth? 



Formula 1: 


19 


18 


20 


20 


18 


inches 


Formula 2: 


9 


13 


20 


16 


13 


18 


Formula 3: 


14 


8 


8 


17 


8 




Formula 4: 


10 


13 


12 


19 


18 


11 



/ data Set / Note: Exercises 14.33 and 14.34 require 
a computer and statistical software. 

14.33 For three random samples of employees, each sam- 
ple consisting of employees from a given age group, the 
data in file XR14033 show the number of absences over 
the past 6 months. At the 0.10 level, can we conclude 
that the populations of such employee groups have the 
same median number of days missed? Identify and 
interpret the p-value for the test. 

14.34 The Environmental Protection Agency measures 
city fuel economy by placing the vehicle on a laboratory 
dynamometer and determining the amount of fuel 
consumed during a standard speed-distance schedule that 
simulates an urban trip. File XR14034 lists the miles per 
gallon data for three different automobile models. Can 
we conclude at the 0.10 level that the three models have 
the same median fuel economy? Identify and interpret the 
p-value for the test. 



FRIEDMAN TEST FOR THE RANDOMIZED 1 4.6 

BLOCK DESIGN 



Description and Applications 

The Friedman test is an extension of the Wilcoxon signed rank test for paired 
samples and is the nonparametric counterpart to the randomized block ANOVA 
design of Chapter 12. While randomized block ANOVA requires that observa- 
tions be from normal populations with equal variances, the Friedman test makes 
no such demands. The Friedman test is applicable regardless of the shapes of the 
populations from which the individual data values are obtained. Unlike the ran- 
domized block ANOVA, the Friedman test can also be extended to the examina- 
tion of ordinal data instead of being limited to data from the interval or ratio 
scales of measurement. 

Like the randomized block ANOVA, the Friedman test has the goal of compar- 
ing treatments, though it is concerned with their medians rather than their means, 
and persons or test units are arranged into homogeneous blocks before the treat- 
ments are randomly assigned to members of each. The purpose of the blocks is to 
reduce error variation that might otherwise be present as the result of chance differ- 
ences from one treatment group to the next with regard to the characteristics repre- 
sented by the blocking variable. The randomized block ANOVA and the Friedman 
test differ in their assumptions and procedures, but their purposes and applications 
are similar. 

As in the randomized block ANOVA of Chapter 12, the data can be summa- 
rized in a b X t table, where each column represents one of the t treatment groups 
and each row represents one of the b blocks. In each cell (combination of 
block and treatment), there will be one observation. Within each block (row), the 
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observations are ranked from lowest (rank = 1) to highest (rank = t). If there are 
ties, each of the tied cells is assigned the average of the ranks they are occupying. 
The null and alternative hypotheses, the test statistic, and the decision rule are as 
follows: 

Friedman test for the randomized block design: 

• Null and alternative hypotheses 

fi 0 : m t = m 2 = ■■• = m t for the; = 1 through t treatments. 

(The population medians are equal.) 

H-ji At least one m- differs from the others. 

(The population medians are not equal.) 

• Test statistic 

12 1 

F r = ~, — ; — ^,R} — 3b(t + 1) where b = number of blocks 

r bt(t + 1 )£i ' . , c„ 

v ' /-i t = number of treatments 

Rj = sum of the ranks 

for treatment ; 

• Critical value and decision rule For a test at the a level of significance, the critical 
value is the chi-square statistic with df = (t — 1) and an upper-tail area of a. 
If F r exceeds the critical chi-square, reject H 0 ; otherwise, do not reject. 

For the Friedman test, either the number of blocks or the number of treatments 
should be 5 5 for the chi-square distribution to be a good approximation. If both b 
and t are < 5, it is advisable either to use a computer statistical package or to rely on 
a set of special tables that are available in references such as Siegel (1956). 1 It is also 
desirable that the number of ties be small relative to the total number of observations, 
or cells. This presents no problem if the test is computer assisted, as most computer 
statistical packages provide results that are corrected for ties. 




Friedman Test 

The maker of a stain remover is testing the effectiveness of four different formu- 
lations for a new product. An experiment has been conducted in which each for- 
mula was randomly applied to one of four fabric pieces stained with the same 
household or food product. Six common types of stains were used as the blocks 
in the experiment, and each measurement represents the research director's sub- 
jective judgment on a scale of 1 to 10 using the following criteria: 

123456789 10 

□ □□□□□□□□□ 
"Poor" "Fair" "Good" "Excellent" "Outstanding" 



'Source: See S. Siegel, Nonparametric Statistics for the Behavioral Sciences (New York: McGraw-Hill, 
1956). 
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A. Ratings Data 



Type of Stain 



Stain-Remover Formula 





1 


2 


3 


4 


Creosote 


2 


7 


3 


6 


Crayon 


9 


10 


7 


5 


Motor oil 


4 


6 


1 


4 


Grape juice 


9 


7 


4 


5 


Ink 


6 


8 


4 


3 


Coffee 


9 


4 


2 


6 



TABLE 14.7 



Part A shows the underlying 
data for a Friedman test in 
which the blocks are types 
of stains and the treatments 
are four different formula- 
tions for a stain remover. 
The ratings reflect how 
well the stain was removed, 
with numerically higher 
numbers representing 
greater effectiveness. In 
part B, the scores within 
each block have been 
converted into ranks. 



B. After Converting to Ranks, with 1 = Lowest Rating and 4 = Highest Rating 

Stain-Remover Formula 





1 


2 


3 


4 


Creosote 


1 


4 


2 


3 


Crayon 


3 


4 


2 


1 


Stain Motor oil 


2.5 


4 


1 


2.5 


Grape juice 


4 


3 


1 


2 


Ink 


3 


4 


2 


1 


Coffee 


4 


2 


1 


3 


Sum of the ranks: 


17.5 


21.0 


9.0 


12.5 



The fabric samples having each type of stain are presented to the research direc- 
tor in a random order, and the ratings that resulted are shown in part A of Table 
14.7. The data are also in file CX14FAB. At the 0.05 level, examine whether the 
stain-remover formulas could be equally effective in removing stains from this 
type of fabric. 

SOLUTION 

Although there are equal numerical distances from one rating category to the 
next, the presence of verbal descriptors along the way detracts from our ability to 
assume the resulting data are in the interval scale of measurement. For example, 
although they are shown equidistant along the scale, doubts may exist as to 
whether the distance between "Fair" and "Good" is the same as the distance be- 
tween "Good" and "Excellent." It's therefore best that the ratings in part A of 
Table 14.7 be treated as ordinal (ranks) data. 

The first step in the analysis is to convert the ratings into ranks within blocks. 
The results of this conversion are shown in part B of Table 14.7. For example, 
part A reveals that formula 4 has the lowest rating for removing ink stains, and it 
is assigned a rank value of 1 within that block, or row. Within the same block, 
formula 3 has the next lowest rating for ink stains, and it is assigned a rank value 
of 2. Formula 1 has the next to the highest rating on ink stains, and it is assigned 
a rank value of 3. Finally, formula 2 has the highest rating for ink stains and is as- 
signed a rank value of 4. The ranks within the other blocks are assigned using the 
same procedure. 



526 



Part 4: Hypothesis Testing 



In the motor oil block, formulas 1 and 4 had identical ratings. The rank val- 
ues they are occupying include ranks 2 and 3, so each is assigned the average of 
these ranks or (2 + 3)/2 = 2.5. As part B of Table 14.7 shows, this is the only tie 
that exists in the data. 

Next, we calculate the F r test statistic. The calculation relies on the sum of the 
ranks shown in part B of Table 14.7. These are R 1 = 17.5, R 2 = 21.0, R 3 = 9.0, 
and R 4 = 12.5. For b = 6 blocks and t = 4 treatments, the calculated value of 
the test statistic is 



2 ^Rf ~ 3b(t + 1) 



bt(t + \)p x 
12 

6(4)(4 + 1) 



17.5 2 + 21.0 2 + 9.0 2 + 12.5 2 ) - 3(6)(4 + 1) 



12 
120 



(984.5) - 90 = 8.45 



For a test at the 0.05 level of significance, the critical value of F r is the chi- 
square statistic with df = (t — 1) = 3 and an upper-tail area of 0.05. Referring to 
the chi-square table, we find the critical value to be 7.815. For the 0.05 level of 
significance, the calculated value of F r (8.45) exceeds the critical value (7.815), 
and we are able to reject the null hypothesis that the treatment medians could be 
equal. In other words, at least one of them is better or worse than the others. 

Had the test been carried out at the 0.025 level of significance, the calculated 
value (8.45) would not have exceeded the critical value for this level (9.348), and 
the null hypothesis could not have been rejected. Based on the chi-square table, 
the most accurate statement we can make about the p-value for this test is that it 
is somewhere between 0.05 and 0.025. 



Computer Solutions 14.5 shows how we can use Excel and Minitab in apply- 
ing the Friedman test to the randomized block data in part A of Table 14.7. 
Minitab provides two p-values: one that ignores ties (0.038) and one that is ad- 
justed for the one tie that was present in the ratings (0.035). Two values are sim- 
ilarly given for the calculated test statistic, identified in the printout as "S." 



Exercises 




«';-jf „ EC 




14.35 Compare the Friedman test with the randomized 
block ANOVA in terms of (a) their respective null and 
alternative hypotheses, and (b) the assumptions required 
in order to use each test. 

14.36 The randomized block design has been used in com- 
paring the effectiveness of three treatments, with the data 
as shown at the right. Using the 0.05 level of significance, 
can we conclude that the treatments are equally effective? 



Treatment 



Block 



80 
60 
53 
72 
84 



75 
70 
45 
65 
82 



72 
60 
50 
49 
75 
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Friedman Test for the Randomized Block Design 

This hypothesis test compares the medians of more than two dependent samples. 



EXCEL 





A B 


C 


1 


Friedman Test 




2 








3 


Group 




Rank Sum 


4 


Formula 1 




17.5 


5 


Formula2 




21 


6 


Formula3 




9 


7 


Formula4 




12.5 


8 


Fr Stat 




8.45 


9 


df 




3 


10 


p-value 




0.0376 


11 


chi-squared Critical 


7.815 



Excel Friedman Test for Comparing More Than Two Dependent Samples 

1 . The data in Table 14.7 are in adjacent columns in file CX14FAB.XLS, with the labels in A1 :D1, and the data in A2:D7. 
The columns must represent the treatments and the rows must represent the blocks, as in part A of Table 14.7. Click 
Tools. Click Data Analysis Plus. Click Friedman Test. Click OK. 

2. Enter A1:D7 into the Input Range box. Click Labels. Enter 0.05 into the Alpha box. Click OK. The printout includes 
the calculated test statistic (8.45) and the p-value for the test (0.0376). 

MINITAB 

Minitab Friedman Test for Comparing More Than Two Dependent Samples 
Friedman Test: Rating versus Formula blocked by StainTyp 

S = 8.45 DF = 3 P = 0.038 

S = 8.59 DF = 3 P = 0.035 (adjusted for ties) 









Sum 






Est 


of 


Formula 


N 


Median 


Ranks 


1 


6 


6.438 


17.5 


2 


6 


7.188 


21.0 


3 


6 


3.438 


9.0 


4 


6 


4.688 


12.5 



Grand median = 5.438 

1 . The data in Table 14.7 must be stacked, as in file CX14FAB.MTW. With the formula (coded as 1-4) in C1, the stain 
type (1-6) in C2, and the performance rating in C3: Click Stat. Select Nonparametrics. Click Friedman. 

2. Enter C3 into the Response box. Enter C1 into the Treatment box. Enter C2 into the Blocks box. Click OK. The print- 
out includes a p-value that is adjusted for ties. 
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14.37 The following salary figures have been reported for 
the mayor, police chief, and fire chief for a sample of U.S. 
cities. Assuming these are a random sample of major cities 
in the United States, and using the 0.10 level of significance, 
can we conclude that the median salaries for these positions 
of responsibility could be the same? Using the appropriate 
statistical table, what is the most accurate statement that can 
be made about the p-value for the test? source: City and State 
magazine, as reported in "Top City Workers' Pay from Across the USA," USA 
Today, December 5, 1 989, p. 8A. 





Mayor 


Police 
Chief 


Fire 
Chief 


Atlanta 


$65,000 


$49,356 


$53,516 


Chicago 


80,000 


78,750 


88,476 


Denver 


77,400 


66,408 


75,888 


Jacksonville 


86,991 


55,000 


55,000 


Newark 


80,670 


47,108 


64,961 


San Diego 


60,000 


99,456 


85,626 



14.38 An extermination firm is testing several brands of 
pesticide spray, all of which claim to be effective against 
ants. Under controlled conditions, each spray is used on 
10 ants of the species listed here. The measurement in 
each cell is the number of seconds until all 10 ants are 
dead. Given these data, and using the 0.05 level of 
significance, can the firm conclude the brands are equally 
effective? Using the appropriate statistical table, what is 
the most accurate statement that can be made about the 
p-value for the test? 



Pesticide Spray 





1 


2 


3 


4 


Fire Ants 


10 


14 


13 


17 


Bulldog Ants 


12 


16 


8 


19 


Honey Ants 


17 


14 


15 


20 


Carpenter Ants 


15 


12 


14 


18 


Weaver Ants 


14 


1 1 


10 


15 


Janitor Ants 


13 


15 


17 


17 seconds 



/ data Set / Note: Exercises 14.39 and 14.40 require 
a computer and statistical software. 

14.39 Three movie critics have each rated a number of 
current movies on a scale of 1 = poor to 10 = excellent. 
Given the ratings in file XR14039, use the 0.025 level of 
significance in comparing the critics. Does it appear that 
they may be relying on different value systems in assign- 
ing their ratings? Identify and interpret the p-value for 
the test. 

14.40 To evaluate whether three of its slopes should be 
classified as equally difficult, a ski resort sets up an exper- 
iment in which three skiers from each skill category (A = 
beginner to F = expert) are randomly assigned to make 
one run on the slope to which they have been assigned. 
Given the times (minutes) in file XR14040, and using the 
0.05 level of significance, can it be concluded that the 
slopes could be equally difficult? Identify and interpret 
the p-value for the test. 



1 4.7 OTHER NONPARAMETRIC METHODS 



The nonparametric methods covered earlier are just the tip of the iceberg, as a 
great many other tests have been devised for specific applications. This section 
presents three such techniques. 

Sign Test for Comparing Paired Samples 

The sign test is used for the same purposes as the Wilcoxon signed rank test of 
Sections 14.2 and 14.3, but it assumes that data are ordinal instead of interval 
or ratio. In the sign test, the difference between a data value and the hypothe- 
sized median (one-sample test) or the difference between two data values 
(paired-samples test) is replaced with a plus ( + ) or a minus ( — ) sign indicating 
the direction of the difference. "We'll discuss this test in the context of compar- 
ing paired samples. 

The sign test relies on the binomial distribution and the fact that, if m^, the 
population median for d i = x i — y t , is actually zero, P( + ) will be 0.5 for any pair 
of observations that are not tied. The method we are about to present involves the 
binomial tables, and it is exact. For larger samples, however, it can be approxi- 
mated through the use of the normal distribution. The normal approximation will 
be discussed later. 
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The sign test for comparing paired samples: 

• The test statistic 

1. For each pair of values, calculate d- = x- — y l and record the sign of d-. 

2. n = the number of pairs for which d l ¥= 0. 

3. The test statistic is T = the number of pairs for which d- > 0. 

• Determining the /?-value for the test For n = the number of nonzero differ- 
ences, use the binomial tables (with the number of trials = n, and the proba- 
bility of a "success," tt = 0.5) to identify probabilities associated with s = the 
number of successes for the type of test being conducted: 

• Two-tail test with H Q : m d = 0 versus Hji m d ¥= 0. 

p-value = P(s < T) + P{s > [n - T]) 

• Right-tail test with H 0 : m d < 0 versus H^. m d > 0. 

p-value = P(s > T) 

• Left-tail test with H Q : m d >0 versus Hj! m d < 0. 

p-value = P(s < T) 

• Decision rule For a test at the a level, reject H Q if p-value < a; otherwise, do 
not reject. 

• Carrying out the sign test for one sample In this application, each value in the 
y- column will be ra Q , the value that has been hypothesized for the median. 
Otherwise, the procedure is similar to the procedure for comparing paired ob- 
servations above. 




Sign Test 

To demonstrate how the sign test works, we'll apply it to the paired samples 
described earlier in Table 14.3, and the test will be carried out at the 0.10 level of 
significance. As in our earlier analysis, the data are in file CX14TIME, the test is two- 
tail, and the null and alternative hypotheses are H Q : m d = 0 and H^: m d ¥= 0. 

SOLUTION 

As Table 14.8 (page 530) shows, the sign test converts the d l = x i — y i values to 
plus or minus, depending on whether x i is greater than or less than y i for each pair 
of observations. For computing task J, d t = 0, and this pair of observations is 
omitted from further analysis. For the n = 11 nonzero differences that remain, 
T = 3 of these differences are positive. 

In determining the p-value for the test, we can use the cumulative binomial 
probabilities for n = 11 trials and tt = 0.5 for "success" (a positive difference) on 
each trial. For convenience, these probabilities have been reproduced in 
Table 14.9, which also summarizes the determination of the p-value for the test. If 
the null hypothesis were true, the probability of achieving s ^ 3 successes would be 



P(s < 3) = 0.1133 as shown in Table 14.9 
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TABLE 14.8 



Using the sign test to Time Required for 

compare the paired samples Software Packages 
of Table 14.3 x and y 

Computing The Sign of 



Task 




y, 




A 


24.0 


23.1 


+ 


B 


16.7 


20.4 




c 


21 .6 


17.7 


+ 


D 


23.7 


20.7 


+ 


E 


37.5 


42.1 




F 


31.4 


36.1 




G 


14.9 


21.8 




H 


37.3 


40.3 




1 


17.9 


26.0 




J 


15.5 


15.5 


0 


K 


29.0 


35.4 




L 


19.9 


25.5 





TABLE 14.9 



The underlying binomial 
probabilities for a sign test 
of the data in Tables 14.3 
and 14.8 



Binomial probabilities for 11 nonzero differences 

Cumulative Distribution Function 

Binomial with n = 11 and p = 0.5 





X 


P( X 


<= x) 


0 


00 


0 


0005 


1 


00 


0 


0059 


2 


00 


0 


0327 


3 


00 


0 


1133 - 


4 


00 


0 


2744 


5 


00 


0 


5000 


6 


00 


0 


7256 


7 


00 


0 


8867 - 


8 


00 


0 


9673 


9 


00 


0 


9941 


10 


00 


0 


9995 


11 


00 


1 


0000 



Forn = 1 1,77- = 0.50, 
the exact probability 
of three or fewer " + " 
differences is 0.1 133. 



Forn = 11,77 = 0.50, 
the exact probability 
of eight or more " + " 
differences is 
1 - 0.8867 = 0.1133. 



Since T = 3 and the test is 
two-tail, the exact p-value is 
0.1133 + 0.1 133, or0.2266. 



However, the test is two-tail and P(s < 3) = 0.1133 represents only the left- 
tail component of the p-value. The right-tail component will be its mirror image 
(recall that the binomial distribution is symmetric when tt = 0.5), and this is the 
probability that s ^ (n — T), or P(s a 8). This component can be calculated as 

P(s > 8) = 1 - P(s < 7) 

= 1.0000 - 0.8867 = 0.1133 also shown in Table 14.9 



Chapter 14: Nonparametric Methods 



531 



and the p-value for the two-tail test is 

P{s < 3) + P(s > 8) = 0.1133 + 0.1133 = 0.2266 

This analysis had the goal of using the a = 0.10 level in testing whether m d 
could be zero. Since the p-value (0.2266) is not less than a (0.10), the null hypoth- 
esis cannot be rejected at the 0.10 level. Based on the sign test, we are not able to 
conclude that the software packages differ with respect to the time they require 
for various computing tasks. 



If the number of nonzero differences is n > 10, the normal approximation to 
the binomial distribution can be used in carrying out the sign test. As discussed in 
Chapter 7, the normal distribution is a close approximation to the binomial 
whenever both nir and n(l — it) are ^5. Since the null hypothesis in the sign test 
always assumes that ir = 0.5, the approximation will be satisfactory as long as 
n > 10. The procedure for the normal approximation is described below. 

The sign test, using the normal approximation to the binomial distribution: 

For n = the number of nonzero differences, T = number of ( + ) differences, the 
test statistic is 

T- 0.5n 

Z = -j=~ 

0.5Vn 

The critical value(s) for z will be similar to other z-tests. In the sign-test applica- 
tion, the normal approximation to the binomial distribution may be used when- 
ever n > 10. 

The rightmost term in the numerator of the z-statistic is the expected number 
of successes in n trials, and the denominator is the standard deviation of the num- 
ber of successes. In Chapter 7, the standard deviation was presented as the square 
root of [mr(l — 7r)], and the denominator in the z-statistic is its equivalent when 
7T = 0.5. 

In the example just presented, we carried out the sign test onn = 11 nonzero 
differences by referring to the binomial tables. Since n was s 10, we could have 
used the normal approximation instead. Applying the normal approximation 
method to this two-tail test, the first step is to compare the observed and expected 
values of T. The calculated value of the test statistic will be 

T-0.5n 3.0 - 0.5(11) 

z = t=- 1=- — 1.51 

o.5V» 0.5VTT 

For a = 0.10 and a two-tail test, the critical values of z are z = —1.645 and 
z = +1.645. The calculated value (z = —1.51) falls within the "do not reject" re- 
gion bounded by these values, and H Q : m d = 0 cannot be rejected. This is the same 
conclusion we reached when we used the exact binomial probabilities in carrying 
out the test. When the number of nonzero differences is very large, the normal 
approximation will very closely approach the exact binomial distribution results 
in the sign test. 

The sign test can also be used for tests involving a single sample. In this case, 
we just compare each observed value to the hypothesized value for the median. 
The observations are represented as x { , and the hypothesized value for the median 
is represented as y t . All of the values in the y t column will be identical (and equal 
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b 


6* 



Sign Test for Comparing Paired Samples 

This hypothesis test applies to paired samples, such as before-and-after comparisons for the same items 
or subjects. 



EXCEL 





A 


B 


C 


D 


E 


1 


Sign Test for Testing One Sample or for Comparing Paired Samples 




2 












3 


n, the Number of NonZero Differences: 


11 


Calculated Values: 




4 


T, the Number of Differences with di > 0: 


3 


p-Value If the Test Is: 




5 






Left-Tail 


Two-Tail 


Right-Tail 


6 






0.1133 


0.2266 


0.9673 



Excel Sign Test for Paired Samples 

1 . The data in Table 14.3 (repeated in Table 14.8) are in adjacent columns in file CX14TIME.XLS, with labels in A1 and 
B1, and data in A2:A13 and B2:B13. Enter the label (Difference) into CI. Select C2 and enter the equation, =A2-B2. 
Click on the lower right corner of C2, then autofill downward so the differences will be in C2:C13. Count the number 
of positive, negative, and zero differences. 

2. Open Excel worksheet template TMSIGN.XLS, provided with the text. In this worksheet, enter the number of nonzero 
differences into B3. Enter the number of positive differences into B4. The p-value for the test is based on the binomial 
distribution and is automatically calculated by Excel. For this two-tail test, the p-value is reported as 0.2266. 

Note: As an alternative, the z-test approximation is applicable when there are at least 1 0 pairs of values. This test is avail- 
able via the Tools, Data Analysis Plus, Sign Test sequence of menu selections. (Note: Select one of the cells in the data 
field before proceeding.) 



MINI TAB 

Minitab Sign Test for Paired Samples 
Sign Test for Median: DIFF 

Sign test of median = 0.00000 versus not = 0.00000 

N Below Equal Above P Median 

DIFF 12 8 1 3 0.2266 -4.150 

1 . With the Table 14.3 (and Table 14.8) data values in columns C1 and C2 of file CX14TIME.MTW, we must first calcu- 
late the differences and insert them into C3, as described in step 1 of Computer Solutions 14.2. 

2. Click Stat. Select Nonparametrics. Click 1-Sample Sign. 

3. Enter C3 into the Variables box. Select Test median and enter 0 into the box. Select not equal within the Alterna- 
tive box. Click OK. 



to m 0 , the value that has been hypothesized for the median). The procedure is 
then carried out using the methods just described. If n = the number of nonzero 
differences is < 10, the binomial tables should be used, while if n > 10, it is sat- 
isfactory to use the normal approximation. 

Computer Solutions 14.6 shows how we can use Excel and Minitab in apply- 
ing the sign test to compare the paired samples in Table 14.8. The p-value 
(0.2266) is greater than for the Wilcoxon signed rank test for the same data 
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(0.03). This is because the sign test has made less efficient use of the data, assum- 
ing it to be ordinal instead of interval. 

The Runs Test for Randomness 

The runs test evaluates the randomness of a series of observations by analyzing 
the number of runs it contains. A run is the consecutive appearance of one or 
more observations that are similar. For example, suppose a portion of a random 
number table included the series 2, 7, 6, 8, 9, 6, 7, 6, 6, 4. The first digit (2) is 
below the supposed median of 4.5 and constitutes one run. The middle eight dig- 
its are all above 4.5 and are another run. The final digit (4) is below 4.5 and is a 
third run. For a listing of digits from a supposed random number table, the pres- 
ence of either very few runs or a great many runs would cause us to doubt 
whether the "random" digits are truly random. 

If data are nominal, runs can also be counted. In a series of coin flips, we 
might observe H H T T T H T. In recording the order in which males and females 
arrive at work, part of the sequence could be F M M M F F M. Each of these se- 
quences contains four runs. The runs test is carried out as follows: 



Runs test: 

• Null and alternative hypotheses 

H 0 : The sequence is random, and H*: The sequence is not random. 

• Procedure 

• For nominal data with two categories 

1. Determine n 1 and n 2 , the number of observations of each type. 

2. Count the number of runs, T. 

• For ordinal, interval, or ratio data 

1. Determine the median, m, of the data values. 

2. Identify each data value with a plus ( + ) if x i s m and with a 
minus ( — ) if x- < m. 

3. Determine n 1 and n 2 , the number of ( + ) and ( — ) observations. 

4. Count the number of runs, T. 

• Test statistic 



where T = the number of runs 

= the number of observations 
of the first type 
w ? = the number of observations 

of the second type 
n = the total number of 
observations, n x + n 2 

In the numerator of the expression above, the quantity in parentheses is 
the number of runs that would be expected if the observations were randomly 
arranged. If the number of runs is significantly less than expected, this indicates 
that similar kinds of observations tend to occur in clusters, as would happen if a 



T 



2n 1 n 2 



+ 1 



2n 1 n 2 (2n 1 n 2 — n) 
n 2 (n — 1) 



Part 4: Hypothesis Testing 

sports team tended to win and lose in streaks. This may also occur if there is an 
upward or downward trend from earlier values to later ones. If the number of 
runs is significantly more than expected, this reflects that the data values tend to 
reverse direction more often than chance processes would allow. 

When data are of the interval or ratio scale of measurement, the mean can be 
used instead of the median. Since the median can apply to any data that are not 
nominal, however, we will use it in our presentation and examples. If a sequence 
has a great many values and a computer is not being used, it can be time-saving 
to use the more easily calculated mean instead of the median. 



example 



Runs Test for Randomness 

A political activist claims to have "randomly" stopped persons at a street cor- 
ner and asked them to sign his petition and give their age. During his first hour 
on the street, 30 people signed the document and gave their age, and the order 
is as shown in Table 14.10. The data are also in file CX14AGE. At the 0.05 level 
of significance, evaluate the randomness of the ages for this sequence of 30 
respondents. 

SOLUTION 

In conducting this test, the null hypothesis is H Q : The ages are in a random order. 
The age values, shown in Table 14.10, have a median of 44 years. Each age is con- 
verted to a plus ( + ) if it is 44 or higher and to a minus ( — ) if it is less than 44. This 
process provides the sequence of ( + ) and ( — ) symbols in the lower part of the 
table. For clarity, each of the T = 10 runs is enclosed in a box. There are n 1 = 15 
( + ) symbols, n 2 = 15 { — ) symbols, and the total sample size is n = 15 + 15 = 30. 
The ^-statistic is 



The actual 
number of runs 



T 



The expected 
number of runs 



2n 1 n 2 



+ 1 



2m 1 « 2 (2« 1 «7 — n) 



n 2 (n 



10 - 16 
2.691 



- 1) 
-2.23 



10 - 



2(15)(15) 



30 



+ 1 



2(15)(15) • [2(15)(15) - 30] 



30 2 (30 



For a two-tail test at the 0.05 level of significance, the critical z values are 
z = —1.96 and z = +1.96. The observed value, z = —2.23, is outside these lim- 
its, and the null hypothesis is rejected. At the 0.05 level, the ages do not occur in 
a random order. That the number of runs (10) is much less than the expected 
number (16) is not necessarily the petitioner's fault, since persons of similar ages 
may tend to be found together or to frequent the street at different times of the 
day. For example, older persons may tend to be more prevalent during the early 
morning and noontime hours. 
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The Age of Signers, in the Order in Which They Signed 



30 


33 


15 


59 


35 


29 68 


69 


38 


43 


15 


36 


35 


30 


61 


74 


56 


47 


68 18 


22 


12 


58 


45 


65 


64 


49 


38 


58 


45 



















The Ages Compared to the Median Age, 44 Years [(+) if x > 44, (-) if x < 44]) 

Run No. 1 2 3 4 5 

ED 



+ + 




TABLE 14.10 



In this sequence of persons 
signing a petition, the me- 
dian age is 44 years. The 
sequence contains T = 10 
runs versus an expected 
value of 16 runs, and the 
null hypothesis that the 
order is random is rejected 
at the 0.05 level of 
significance. 



Using the standard normal table, the area to the left of z = —2.23 is 
0.5000 - 0.4871, or 0.0129. Because the test is two-tail, the p-value is twice this 
area, or 2(0.0129) = 0.0258. 



Computer Solutions 14.7 shows how we can use Excel and Minitab in ap- 
plying the runs test for randomness to the sequence in Table 14.10. With Excel, 
we manually count the runs, then apply Excel worksheet template 
TMRUNS.XLS, provided with the text. Minitab reports both the actual and the 
expected number of runs. The actual number of runs differs greatly from the 
expected number, and the p-value for the test is just 0.026. Using the 0.05 level 
of significance that has been specified, the sequence of ages is judged to be 
other than random. 



Kolmogorov-Smirnov Test for Normality 

The Kolmogorov-Smirnov test for normality is an alternative to the chi-square 
test for normality discussed in Chapter 13. Compared to the chi-square method, 
this test does not require any minimum values for expected frequencies and can 
be used with small as well as large sample sizes. The data must be a random sam- 
ple from the population, and the variable being observed or measured has to be 
at least interval in its scale of measurement. 

The population parameters are not specified, and the sample mean and stan- 
dard deviation are used to estimate their population counterparts. The null and 
alternative hypotheses are 

H 0 : The sample was drawn from a normal distribution. 
JL: The sample was not drawn from a normal distribution. 

This test involves two cumulative relative frequencies: one that is derived 
from the actual data observed, the other constructed under the assumption that 
the null hypothesis is true, and relying on the mean and standard deviation of 
the sample data. A comparison is made between these cumulative relative 
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Runs Test for Randomness 
EXCEL 





A 


B 


C 


D 


E 


1 


Z-Test Approximation, Runs Test for Randomness 






2 












3 


Number of Runs, T: 


10 


Calculated Values: 






4 


Number of Observs. of Type 1 , n1 : 


15 


Expected No. of Runs = 


16.0 




5 


Number of Observs. of Type 2, n2: 


15 


z = 


-2.230 




6 


Total Number of Observations, n: 


30 


p-value If the Test Is: 






7 






Left-Tail 


Two-Tail 


Right-Tail 


8 






0.0129 


0.0258 


0.9871 



Excel Runs Test for Randomness 

1. The ages in Table 14.10 are in file CX14AGE.XLS, with the label in A1 and the data values in A2:A31 . Click on cell B2 
and enter the equation =A2-43.999. Click on the bottom right corner of cell B2 and autofill downward to cell B31 . 
An examination of the sequence of values in B2:B31 will reveal the same series of signs shown in the lower portion of 
Table 14.10. Count the number of runs — this will be T = 10. Count the number of " + " values in the sequence — this 
will be n 1 = 1 5. Count the number of " — " values — this will be n 2 = 1 5. 

2. Open Excel worksheet template TMRUNS.XLS, provided with the text. Enter 7" (10) into B3. Enter n 1 (15) into B4. Enter 
n 2 (15) into B5. The expected number of runs, the calculated z-statistic, and the two-tail p-value for the test will be 
displayed. 

MINI TAB 

Minitab Runs Test for Randomness 
Runs Test: Age 

Runs test for Age 

Runs above and below K = 43.999 

The observed number of runs = 10 
The expected number of runs = 16 
15 observations above K, 15 below 
P-value = 0.026 

1. With the Table 14.10 sequence of ages in column C1 of CX14AGE.MTW: Click Stat. Select Nonparametrics. Click 
Runs Test. 

2. Enter C1 into the Variables box. Select Above and below: and enter 43.999 into the box. Click OK. The p-value 
(0.026) is comparable to that obtained with the Excel worksheet template above. 



frequencies, and the test statistic is the maximum amount of divergence 
between them. The procedure and test statistic are as follows: 



Kolmogorov-Smirnov test for normality: 

1. Data values are arranged in order, from smaller to larger values of x. 

2. Calculated test statistic: 

D = max|F- — E,\ 
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where F- = observed cumulative relative frequency for the z'th data value 

E ; = expected observed cumulative relative frequency for the z'th 
data value 

3. If D > the critical value listed in the critical values for the Kolmogorov- 
Smirnov test table in Appendix A for sample size = n and level of significance 
= a, the null hypothesis will be rejected. Otherwise, it cannot be rejected. 



Example 



Kolmogorov-Smirnov Test for Normality 

A random sample consists of just 5 data values: 438, 424, 213, 181, and 137, 
which are also in file CX14XDAT. Using the Kolmogorov-Smirnov test for normal- 
ity and the 0.10 level of significance, could these data have been drawn from a 
population that is normally distributed? 

SOLUTION 

This sample size is much too small for us to apply the chi-square test for normal- 
ity, and it might even seem too small for us to be able to apply any test at all. But 
the Kolmogorov-Smirnov test for normality is applicable even for a sample this 
small. For this test, the null hypothesis is that the sample was drawn from a nor- 
mal population and the alternative hypothesis is that it was not. 

The test is easily carried out. All we have to do is arrange the data in increas- 
ing order, then compare the observed and the expected cumulative relative 
frequencies. 

Table 14.11 summarizes the calculations involved in the test. The leftmost 
column shows the data values arranged in increasing order of size, and the F- col- 
umn lists the observed cumulative relative frequencies for each of the data values. 



Cumulative Relative Frequencies 



Data 


Relative 






Value 


Frequency 


Fj = Observed 


Ei = Expected 


137 


0.2000 


0.2000 


0.1587 


181 


0.2000 


0.4000 


0.2451 


213 


0.2000 


0.6000 


0.3228 


424 


0.2000 


0.8000 


— >~0.8485 


438 


0.2000 


1.0000 


0.8686 



In a normal distribution with fj. = 278.6 and 

a = 141.8, P(x< 424) is found by first calculating 

424 - 278.6 

- = 1.03, 



141.1 

then referring to the standard normal table. The area to 
the left of z = 1 .03 is 0.5000 + 0.3485, or 0.8485. 



0.0413 
0.1549 
0.2772 
0.0485 
0.1314 




TABLE 14.11 



The Kolmogorov-Smirnov 
test for normality can be 
applied even for very small 
samples. In this example, 
there are just 5 data values, 
but we can still test 
whether the sample could 
have been drawn from a 
normal population. The 
sample mean and standard 
deviation are used in 
constructing the expected 
cumulative relative 
frequencies. 
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For example, 3 of the 5 observations (60%, for a cumulative relative frequency of 
0.6000) were no greater than 213. 

The entries in the E t column are based on a normal distribution with i± = 
278.6 and a = 141.8. Each data value is converted to a z-score, then we use the 
standard normal table to find the area to the left. Table 14.11 shows an example 
calculation for the expected cumulative relative frequency for 424, or P(x ^ 424) 
for a normal distribution in which /jl = 278.6 and a = 141.8. The z-score is 
Z = 1.03, and the standard normal table is used in identifying the area to the left, 
which is 0.5000 + 0.3485, or 0.8485. 

The value of the test statistic, D = max|F- — E-|, is 0.2772. Referring to 
Appendix A and the critical values for the Kolmogorov-Smirnov test table, we 
find the critical value to be 0.315 for a test at the 0.10 level with n = 5. Since the 
observed value of D (0.2772) does not exceed the critical value (0.315), we are 
not able to reject the null hypothesis. The data could have been drawn from a nor- 
mal distribution. 



Computer Solutions 14.8 shows how we can use Minitab in applying the 
Kolmogorov-Smirnov test for normality to the data values in Table 14.11. The 
printout includes a calculated D statistic that is comparable to the one in Table 
14.11, but with greater accuracy, because our calculations were based on 
the normal distribution table, which has inherent gaps between the printed 
z values. 

Spearman Coefficient of Rank Correlation 

For paired observations in which the two variables are at least ordinal, we can 
use the Spearman coefficient of rank correlation (r s ) to measure the strength 
and direction of the relationship between them. For example, the Spearman co- 
efficient of rank correlation could be applied if two different investment advis- 
ers were to rank 10 stocks from best to worst. If they provided identical rank- 
ings, r s would be +1. If their rankings were exactly opposite, r s would be — 1. 
For variables x and y, a positive correlation indicates that higher ranks for x 
tend to be associated with higher ranks for y, and a negative correlation indi- 
cates that lower ranks for x tend to be associated with higher ranks for y. 
When the original data are not in the form of ranks, they are converted to 
ranks for purposes of this analysis. The Spearman coefficient of rank correla- 
tion has a parametric counterpart that was discussed in Chapter 2 and will be 
discussed further in the next chapter. The formula below is exact when there 
are no ties, and is a very close approximation when the number of ties is small 



relative to the number of pairs. 


Spearman coefficient of rank correlation: 




Test statistic: 




r = 1 2 where 


= the sum of the squared differences 


b w(« z — 1) 


between the ranks 


n 


= the number of pairs of observations 
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The Kolmogorov-Smirnov Test for Normality 

This test examines whether a set of data could have come from a normal population. 

EXCEL 

Excel does not provide the Kolmogorov-Smirnov test for normality. As an alternative, the chi-square test for normality 
(Chapter 13) can be used when n > 32. 



MINITAB 

Minitab Kolmogorov-Smirnov Test for Normality 



Probability Plot of X 

Normal 



99 



3S 
90 

80 
70 

i» 

u 50 
<U 40 

30 
20 

10 

S 




Mean 


278.8 


StDev 


141.8 


N 


5 


KS 


0.278 




>0.150 



1. The data in the leftmost column of Table 14.11 are in column C1 of file CX14XDAT.MTW. Click Stat. Select Basic 
Statistics. Click Normality Test. 

2. Enter C1 into the Variable box. Select Kolmogorov-Smirnov. Click OK. 

3. In the normal probability plot that is generated, data that are actually from a normal distribution are more likely to fall 
along the straight line that is fitted. The output also includes the D statistic (shown as "KS") and an approximate 
p-value (p > 0.1 5) for the test. Because (p > 0.1 5) is not less than the 0.10 level for our test, we conclude that the 
data could have come from a normal distribution. 



Significance test for p s = the population value of r s : 

For n < 30, the rejection region(s) for the tests of H Q : p s = 0, H Q : p s > 0, and 
H Q : p s < 0 are in the table of critical values for r s in Appendix A. 

For larger values of n, the nor mal dist ribution approximation can be used, 
and the test statistic will be z = r s yn — 1. 



example 
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Spearman Rank Correlation 

Two sportswriters have ranked 10 Olympic sports according to how interesting 
they are to watch in person, with the results shown here. Calculate and interpret 
the Spearman coefficient of rank correlation, and use the 0.10 level of significance 
in testing whether its true value could be something other than 0. 





Arnold's 


Barbara's 






Olympic Sport 


Ranking, x ( - 


Ranking, y ( 


d i = *i - y, 


c/ 2 

i 


Track & field 


1 


2 


-1 


1 


Basketball 


2 


4 


-2 


4 


Volleyball 


3 


5 


-2 


4 


Swimming 


4 


1 


3 


9 


Boxing 


5 


8 


-3 


9 


Ice skating 


6 


3 


3 


9 


Weight lifting 


7 


6 


1 


1 


Wrestling 


8 


7 


1 


1 


Judo 


9 


10 


-1 


1 


Equestrian 


10 


9 


1 


1 



5tf = 40 



For these data, the Spearman coefficient of rank correlation is r s = 0.758, and 
it is calculated as follows: 

6CZdf) 6(40) 240 

r — 1 ^ =1 ^ =1 = 0.758 

n{n 2 - 1) 10(10 2 - 1) 990 

The two sportswriters seem to be somewhat in agreement regarding the rela- 
tive attractiveness of the various sports, but does the calculated test statistic dif- 
fer significantly from 0 at the 0.10 level of significance? To find out, we refer to 
the table of critical values for r s in Appendix A to identify the critical values of 
the test statistic for a two-tail test at the 0.10 level of significance. The table is 
symmetrical and provides one-tail areas, so we use the 0.05 column of the n = 
10 row. The critical values are —0.564 and +0.564. The calculated test statistic, 
r s = 0.758, falls into the upper rejection region and we are able to reject the pos- 
sibility that the population value for the Spearman coefficient of rank correlation 
could be 0. You may wish to further use the table in verifying that the most ac- 
curate statement we can make about the p-value for this two-tail test is that it is 
somewhere between 0.01 and 0.02. 



Computer Solutions 14.9 shows the Excel and Minitab procedures for obtain- 
ing the Spearman coefficient of rank correlation. Although each printout has the 
same correlation value, note that Excel has used the normal approximation in its 
p-value and critical-value calculations. We've used the Excel printout here as an 
example, but this approach is best used when the number of sample pairs is at 
least 30. 
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Spearman Coefficient of Rank Correlation 
EXCEL 





A B 


C 


D 


1 


Spearman Rank Correlation 




2 










3 


Arnold and Barbara 






4 


Spearman Rank Correlation 


0.758 


5 


zStat 




2.273 


6 


P(Z<=z) one-tail 




0.012 


7 


z Critical one-tail 




1.282 


8 


P(Z<=z) two-tail 




0.023 


9 


z Critical two-tail 




1.645 



Using Excel to obtain Spearman coefficient of rank correlation 

1 . Open file CX14SPEAR.XLS. Click Tools. Click Data Analysis Plus. Select Correlation (Spearman) and click OK. 

2. Enter B1:B11 into the Variable 1 Range box and C1:C11 into the Variable 2 Range Box. Select Labels and enter 
0.10 into the Alpha box. 

Note: The Spearman coefficient of rank correlation has been calculated as described in the text, but the Excel p-values and 
critical values are based on the normal approximation. This approximation works best when there are at least 30 pairs of 
values, but it is shown here for demonstration purposes. 



MINITAB 

Correlations: Arnold, Barbara 

Pearson correlation of Arnold and Barbara = 0.758 
P-value = 0.011 

Using Minitab to obtain Spearman coefficient of rank correlation 

1 . Open workbook CX14SPEAR.MTW. Click Stat, select Basic Statistics, and click Correlation. 

2. Enter C2 and C3 into the Variables box, select Display p-values, and click OK. 





14.41 What role does the binomial distribution play in 
the sign test, and when is it permissible to use the normal 
approximation in carrying out the test? 

14.42 Whether testing a single sample or comparing 
paired samples, how should one go about deciding 
whether to use the sign test versus the Wilcoxon signed 
rank test? 



14.43 For n = 8 nonzero differences between paired 
observations, there are 5 ( + ) differences for d = x x — x 2 
Use the 0.05 level of significance in testing H Q : m d = 0 
versus Hy m d 0. 

14.44 For n = 6 nonzero differences in a one-sample 
test, there are 5 ( + ) differences for d = x — 20. Use the 
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0.10 level of significance in testing H 0 : m d S 20 versus 
Hy m d > 20. 

14.45 Using the data of Exercise 14.13 and the 0.10 level, 
apply the sign test to compare the two samples. How do 
the results compare to those of that exercise? If they differ, 
what does this imply about the respective tests? 

14.46 Using the data of Exercise 14.8 and the 0.05 level, 
apply the sign test to compare the data with the claimed 
median. How do the results compare to those of that ex- 
ercise? If they differ, what does this imply about the 
respective tests? 

14.47 In the runs test for randomness, what is a "run"? 

14.48 How many runs are there in each of the following 
series? 

a. 11000100101100001101101011 
10 0 1 

b. 0101101110001011001010010 
110 0 1 

c. 1110001110001100110010101 
0 0 10 1 

d. 1111111111111100000000000 
0 0 0 0 0 

14.49 From 1789 through 1870, the last names of asso- 
ciate justices appointed to the U.S. Supreme Court were 
as coded below. Last names beginning with A-M are 
shown as a "1," those beginning with N-Z as a "2." 
Compare the observed number of runs with the expected 
number, then interpret what this difference might mean in 
terms of the sequence of the appointees. At the 0.10 level 
of significance, test the randomness of this series, source: 
Time Almanac 2003, p. 91. 

221111212111212221 
121111221111211121 

14.50 Franchise headquarters claims it randomly selected 
the local franchises that received surprise visits during the 
past year. The sequence of visits is as shown below for 
male (M) versus female (F) managers. Using the 0.10 level 
of significance, evaluate the randomness of the sequence. 

FMMMFMFMFMFFM 
MFMFMMMFMMF 

14.51 A random sample of recent LSAT (Law School Ad- 
mission Test) scores submitted by applicants to a univer- 
sity's law program are as listed below. Using the 0.10 
level of significance, apply the Kolmogorov-Smirnov test 
for normality in examining whether the scores could have 
come from a normally distributed population. 

142 161 142 149 147 145 164 170 

14.52 Product testers for a radio manufacturer routinely 
select a sample of units from the production line and test 
circuit integrity by subjecting each unit to extreme 
vibrations that are approximately triple the most violent 
shaking they are ever likely to encounter in the field. For the 



most recent five radios tested, the number of minutes until 
failure was found to be 15, 79, 68, 10, and 17 minutes. 
Using the 0.05 level of significance, apply the Kolmogorov- 
Smirnov test for normality in examining whether the pop- 
ulation of failure times might be normally distributed. 

14.53 A local supermarket has contracted for floor clean- 
ing and polishing to be done each evening by a small 
work crew using mechanized equipment. After the crew 
has performed the service for several weeks and settled 
into a routine, the supermarket manager has timed the 
most recent jobs to get a better idea as to the nature of 
the distribution of times for performing the task. Over 
the past week, the cleaning and polishing times have been 
61, 44, 57, 49, 59, 43, and 42 minutes. Using the 0.10 
level of significance, apply the Kolmogorov-Smirnov test 
for normality in examining whether the population of 
cleaning times might be normally distributed. 

14.54 What is the Spearman coefficient of rank correla- 
tion, what range of values can it have, and how is its 
numerical value interpreted? 

14.55 Two travel editors have each ranked 10 popular 
vacation spots according to the value they represent for 
the money. The sum of the squared differences between 
their rankings is 28. Determine and interpret the 
Spearman coefficient of rank correlation. At the 0.05 
level, does r s differ significantly from 0? 

14.56 For the data of Exercise 14.16, convert the 
husbands' and wives' expenditures to ranks, then determine 
and interpret the Spearman coefficient of rank correlation. 
At the 0.05 level, does r s differ significantly from 0? 

14.57 A Republican and a Democrat candidate for politi- 
cal office have each rated the 12 most recent pieces of leg- 
islation in their state from 1 (most beneficial to the state) 
to 12 (least beneficial). Given the data shown here, deter- 
mine and interpret the Spearman coefficient of rank corre- 
lation. At the 0.02 level, does r s differ significantly from 0? 

Legislation ABCDEFGH I JK L 
Republican's ranking 123456 78 9 10 11 12 
Democrat's ranking 3469851071211 1 2 

/ data set/ Note: Exercises 14.58-14.63 require a 
computer and statistical software. 

14.58 Testing the effect of a diet and exercise regimen on 
subjects' cholesterol levels, a sports medicine clinic enlists 
40 volunteers whose cholesterol levels are measured 
before and after the 1 -month program. The before and 
after data are listed in data file XR14058. Using the sign 
test and the 0.025 level of significance, examine the effec- 
tiveness of the diet and exercise program. Identify and 
interpret the p-value for the test. 

14.59 According to the U.S. Department of Transporta- 
tion, the median age of automobiles and trucks on the 
road in the United States is about 8 years. The ages for a 
random sample of vehicles in the executive parking lot of 
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a metal fabrication company are listed in file XR14059. 
Using the sign test and the 0.05 level of significance, ex- 
amine whether the median age for cars driven to work 
by the company's executives might be less than the 
median reported by the Department of Transportation. 
Identify and interpret the p-value for the test. 
source: Keith Simmons, "Hanging on to the Old Buggy," USA Today, 
September 27, 2000, p. 1A. 

14.60 For the first 40 Super Bowl games, the winner is 
listed in file XR14060 according to "1" (American Con- 
ference) or "2" (National Conference). At the 0.05 level 
of significance, can this sequence be considered as other 
than random? Identify and interpret the p-value for the 
test, source: The New York Times Almanac 2006, pp. 934-938, and 
NFL.com, August 19, 2006. 

14.61 For the first 40 Super Bowl games, the winning 
margins are listed in file XR 14061 . At the 0.10 level of 



SUMMARY 



significance, can this sequence be considered as other than 
random? Identify and interpret the p-value for the test. 

source: The New York Times Almanac 2006, pp. 934-938, and NFL.com, 
August 19, 2006. 

14.62 For the set of winning margins in Exercise 14.61, 
use either the Kolmogorov-Smirnov test or the chi-square 
test for normality and the 0.05 level of significance in 
examining whether the data could have come from a 
normal distribution. 

14.63 Last year's rates of return for a random sample of 
mutual funds are listed in file XR14063. Using either the 
Kolmogorov-Smirnov test or the chi-square test for nor- 
mality and the 0.10 level of significance, examine whether 
the data could have come from a normal distribution. 



14.8 



• Nonparametric versus parametric testing 

A nonparametric test is one that makes no assumptions about the specific shape of the 
population from which a sample is drawn. In Chapter 13, the chi-square tests for 
goodness of fit, independence, and comparison of sample proportions were nonpara- 
metric, but chi-square tests of hypotheses regarding the population variance were 
parametric. (Recall that chi-square tests for o 2 required that the population be nor- 
mally distributed.) This chapter covers a variety of other nonparametric techniques, 
most of which have a parametric counterpart covered in other chapters of the text. 

• When to use a nonparametric test 

A nonparametric test is used instead of its parametric counterpart (1) when the 
data are of the nominal or ordinal scale of measurement or (2) when the data are 
of the interval or ratio scale of measurement, but one or more other assumptions, 
such as the normality of the underlying population distribution, are not met. Un- 
like parametric methods, nonparametric techniques can be applied even when the 
sample sizes are very small. 

• Nonparametric tests: types and applications 

The Wilcoxon signed rank test is applied to a single sample in testing whether the 
population median could be a hypothesized value. In this application, it is the 
nonparametric counterpart to the single-sample i-test. With paired samples, the 
Wilcoxon signed rank test is the nonparametric counterpart to the paired-samples 
i-test, and tests whether the population median for the difference between paired 
observations could be zero. The Wilcoxon rank sum test compares the medians of 
two independent samples and is the nonparametric counterpart to the two- 
sample, pooled-variances £-test. 

The Kruskal-Wallis test compares the medians of more than two independent 
samples and is the nonparametric counterpart to the one-way analysis of vari- 
ance. Like the randomized block ANOVA, the Friedman test compares the effec- 
tiveness of treatments, but it is concerned with their medians rather than their 
means. Also, the Friedman test is able to examine data that are ordinal (ranks) 
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and makes no assumptions regarding the populations from which data values 
have been drawn. 

The sign test is used for the same purposes as the Wilcoxon signed rank test, 
but data need only be of the ordinal scale of measurement. In this test, each ob- 
servation (one-sample test) or difference between paired observations (paired 
samples) is expressed only as a plus ( + ) or a minus ( — ) to indicate the direction 
of the difference. Because it assumes that data are ordinal instead of interval, it 
makes less efficient use of the data than does the Wilcoxon signed rank test. 

The runs test evaluates the randomness of a series of observations by analyzing 
the number of "runs" it contains. A run is the consecutive appearance of one or 
more observations that are similar in some way. If the number of runs differs sig- 
nificantly from the number expected, we conclude that some pattern exists in the 
sequence of data values. 

The Kolmogorov-Smirnov test for normality can be used for samples of any 
size, including those that are too small for the chi-square method presented in 
Chapter 13. The test is based on the comparison of two cumulative relative fre- 
quencies: one that is derived from the actual data, the other constructed under 
the assumption that the sample was actually drawn from a normal distribution. 

For paired observations where the two variables are at least ordinal, the 
Spearman coefficient of rank correlation can be applied to the ranks in determin- 
ing the strength and direction of the relationship between the variables. 



n 



equations f • * 'wSSi 



This is where you would ordinarily find a neat, concise listing of the equations introduced 
in the chapter. However, for this rather unique chapter, such a listing would be neither neat 
nor concise. The chapter's many equations tend to be inseparable from the detailed proce- 
dures used in obtaining values for the terms within them. For this reason, we are foregoing 
our usual summary in favor of recommending that you refer to the chapter sections for the 
techniques and their respective equation/procedure summaries. 



n - 

■ 










chi 


□pter exercises 


* 4 












-F ) 





14.64 Two different paste wax formulations are being 
compared to see if they hold up equally well in repeated 
washing at an automated car wash. On each of 10 test 
vehicles, formulas A and B are applied to opposite sides 
of the roof, and a coin flip is used in determining the 
side that receives formula A. The number of washings 
before water no longer "beads" on the surface is found 
to be as listed here. At the 0.05 level, can we conclude 
that the wax formulations are equally effective? Using 
the appropriate statistical table, what is the most 



accurate statement that can be made about the p-value 
for the test? 

Vehicle 



123456789 10 

Washings A 12 14 15 14 14 11 14 10 8 15 
Survived B 11 19 17 18 11 16 17 13 13 12 
by Wax 
Formulation 
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14.65 Comparing the lifetimes of two brands of penlight 
batteries, a researcher has randomly selected 12 batteries of 
brand A and 10 of brand B, then subjected each battery to 
the same usage test. Given the lifetimes listed below and 
using the 0.05 level of significance, can we conclude that 
brand A is superior to brand B? Using the appropriate sta- 
tistical table, what is the most accurate statement that can 
be made about the p-value for the test? 

Brand A 93.4 119.1 94.3 90.0 66.0 92.1 

121.4 92.5 111.6 87.6 98.0 79.2 minutes 

Brand B 59.2 65.5 81.5 92.5 71.7 

69.4 83.4 92.5 93.3 81.7 minutes 

14.66 It has been reported that the median price of a 
home purchased in Portland, Oregon, during 2001 was 
$172,300. For a random sample of homes purchased in a 
certain area of the city during that year, the prices were as 
listed below. Using the 0.025 level of significance, test 
whether the median purchase price of all homes sold in 
that area of the city during 2001 could have been less 
than $172,300. Using the appropriate statistical table, 
what is the most accurate statement that can be made 
about the p-value for the test? source: Bureau of the Census, 
Statistical Abstract of the United States 2002, p. 593. 

$177.4 148.5 147.4 172.2 130.0 
164.1 164.4 175.2 135.0 191.0 thousand 

14.67 The resting pulse rates of 6 randomly selected 
adults have been measured before and after a 
4-week aerobic fitness program, with the results listed 
below. At the 0.05 level, can we conclude that the 
program has had a significant effect? Using the appropri- 
ate statistical table, what is the most accurate statement 
that can be made about the p-value for the test? 

Resting Pulse Rate 
Before After 



1 86 85 

2 81 82 

3 77 73 
Person 4 g5 ?g 

5 77 67 

6 77 73 beats/minute 



14.68 In a fast-food restaurant, it's desirable to have seats 
that are comfortable but not too comfortable. Patrons sit- 
ting at a seat that is too comfortable may tend to linger 
and tax the capacity of the restaurant. In an experiment to 
compare seat designs by observing how long people with 
the same food order stay in the restaurant, each of 18 per- 
sons is given a hamburger, fries, and shake, then timed to 
see how much time they require to eat and leave. The 
times are shown here. At the 0.05 level, do the seat de- 
signs differ significantly in terms of the time that patrons 
stay? Using the appropriate statistical table, what is the 



most accurate statement we can make about the p-value 
for the test? 

Time, six patrons and seat design 1 : 
6 10 9 5 5 8 minutes 

Time, six patrons and seat design 2: 
13 10 8 13 9 12 

Time, six patrons and seat design 3: 
10 15 10 8 15 15 

14.69 A consumer magazine is evaluating five brands of 
trash compactors for their effectiveness in reducing the 
volume of typical household products that are 
discarded. In the experiment, each block consists of 
three bags containing identical mixtures of the kinds of 
trash described, and each measurement is the volume 
(cubic feet) to which a given compactor reduced the 
load it was assigned. At the 0.025 level, can the 
magazine conclude the compactors are equally effective 
in reducing the volume of household trash items? Using 
the appropriate statistical table, what is the most accu- 
rate statement that can be made about the p-value for 
the test? 

Compactor 

1 2 3 4 5 

Cans and Bottles 1.2 1.3 1.4 1.6 1.5 
Cardboard Boxes 1.5 1.5 1.8 1.8 1.7 
Newspapers 1.7 1.9 2.1 2.4 2.3 ft 3 

and Magazines 

14.70 The manager of a photography studio has carried 
out an experiment to compare the attractiveness of four 
different poster-size photographs for use in the display 
window. During 23 business days of the previous month, 
she randomly selected one of the four photographs for dis- 
play; the data below show the number of customers for 
each day that a given photograph was displayed in the 
window. At the 0.01 level of significance, examine 
whether the poster-size photos could be equally effective 
in attracting customers into the store. Using the appropri- 
ate statistical table, what is the most accurate statement 
that can be made about the p-value for the test? 



Photo 1 


Photo 2 


Photo 3 


Photo 4 


14 


39 


19 


28 


39 


19 


27 


21 


45 


25 


22 


32 


48 


37 


15 


29 


26 


18 


31 


42 


30 


45 




48 



14.71 Three different O-ring shapes are being considered 
for use in a deep-sea submersible vehicle that will be 
remote-operated from the surface vessel. Because the seal- 
ing effectiveness depends partially on the temperature of 
the surrounding sea water, a randomized block design has 
been set up in which each design will be pressure-tested at 
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five temperatures. The temperatures have been selected to 
represent the maximum of approximately 86° Fahrenheit 
at surface locations in the tropical climates to the near 32° 
temperatures found at the greatest depths. For each seal- 
temperature cell, the measurement below is the pressure 
(thousands of pounds per square inch) recorded just prior 
to seal failure. At the 0.025 level of significance, could the 
respective O-ring shapes be equally effective? Using the 
appropriate statistical table, what is the most accurate 
statement that can be made about the p-value for the test? 







O-Ring Shape 






1 


2 


3 




86 


18.5 


18.3 


17.4 




75 


14.5 


15.8 


10.6 


Temperature, 


65 


13.5 


14.9 


1 1.9 


Degrees 


55 


1 1.9 


11.5 


10.5 


Fahrenheit 


45 


10.7 


9.7 


7.6 




33 


8.7 


5.8 


6.2 



thousands 
of lb. per 
square 
inch 

14.72 Prior to taking an aerobics class, each of 12 health- 
club members is asked to rate the instructor on a scale in 
which 3 = "excellent," 2 = "good," and 1 = "poor." 
Following the class, each student is again asked to rate 
the instructor. Given the following before-and-after 
ratings, use the 0.05 level in determining whether there 
has been a significant improvement in the ratings received 
by the instructor. 

Student 



1 2 3 4 5 6 7 8 9 10 11 12 



Precourse 132222131 2 2 1 
Rating 

Postwurse 223332333 2 3 2 
Rating 

14.73 Selecting a position that ensures that you will not 
be conspicuous, stand or park near a local convenience 
store and observe 35 consecutive individuals entering the 
store. Jot down whether each is male versus female, then 
use the runs test and the 0.05 level of significance in 
determining whether this sequence of customers might be 
other than random. 

14.74 An interviewer has been told to randomly select re- 
spondents and to include x = whether or not they are a 
college graduate among the data recorded. The categories 
of the first 30 persons interviewed, with G = college 
graduate and N = nongraduate, are shown in the 
sequence below. At the 0.10 level, evaluate the random- 
ness of the sequence. 

GNGNGNNGNGGNNNG 
NGGNNGNNGGNGNNG 

14.75 During her interview with an urban firm, Betty 
was told by the personnel director that she would typically 



require "no more than 20 minutes" to drive to work from 
the apartment complex to which she has recently moved. 
During the first 2 weeks of work, her travel times have 
been as listed below. Use the 0.05 level of significance in 
evaluating the personnel director's claim. Using the appro- 
priate statistical table, what is the most accurate statement 
that can be made about the p-value for the test? 

24.7 19.5 22.6 21.4 18.0 
22.2 17.1 27.1 24.9 23.1 min 

14.76 For a random sample of drivers using an automated 
toll booth that requires exact change, the following data 
show the elapsed time (in seconds) from the moment the 
vehicle stopped to the moment it began to move away fol- 
lowing payment of the toll: 6, 5, 7, 4, and 17. Using the 
0.10 level of significance, apply the Kolmogorov-Smirnov 
test for normality in examining whether the times could 
have come from a normally distributed population. 

14.77 Two investment counselors have evaluated the 15 
major companies in an industry according to their poten- 
tial as a "safe investment." Both counselors agree that 
company C is the best firm in which to invest. These and 
the remaining ranks are shown in the table. Use the 
Spearman coefficient of rank correlation in examining the 
direction and the strength of the agreement between the 
two counselors. At the 0.05 level, does r s differ 
significantly from 0? 

Firm ABCDEFGH I J K LMNO 
Counselor 1 2 14 1 8 7 1 3 3 1 2 9 4 1 0 1 5 5 1 1 6 
Counselor 2 3 12 1 5 6 13 7 10 1 1 2 9 14 8 1 5 4 

/ data Set / Note: Exercises 14.78-14.86 require a 
computer and statistical software. 

14.78 To help ensure that packages contain at least the 
stated weight of 12 ounces, a cereal manufacturer wants 
the filling machine set to 12.5 ounces. For a random sam- 
ple of boxes, the weights (in ounces) are listed in file 
XR14078. At the 0.10 level of significance, does the filling 
machine seem to have "drifted" from the desired setting? 
Identify and interpret the p-value for the test. 

14.79 In the past, telephone solicitors for a home 
improvement firm have been paid an hourly wage. A 
small-scale experiment has been set up in which five solici- 
tors are given further incentive in the form of 1 % of the 
eventual sale. Data file XR14079 shows the number of 
sales originating from each solicitor for the day before and 
the day after the test policy was put into place. At the 
0.05 level, has the incentive policy increased performance? 
Identify and interpret the p-value for the test. 

14.80 Two applicants for a component assembly position 
with an electronics firm have been asked to complete each 
of six tasks that their jobs would typically involve. For 
each assigned task, file XR14080 shows the times 
(minutes) required by each of the applicants. At the 0.10 
level, could the applicants be equally qualified? Identify 
and interpret the p-value for the test. 
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14.81 A new taxi driver will be making regular trips 
from the airport to a downtown hotel. She has heard con- 
flicting opinions as to which of two routes takes less time. 
When she randomly selects which route to take for each 
of 10 trips, the five times (minutes) for each route are as 
listed in file XR14081. Using the 0.05 level, examine 
whether the two routes could be equally time efficient. 
Identify and interpret the p-value for the test. 

14.82 Comparing the attitudes of customers at three restau- 
rants, a researcher selects a random sample of customers 
from each and asks each person to rate the restaurant on a 
scale of 50 to 100, with 50 = "poor" and 100 = 
"excellent." Given the results in file XR14082, and using the 
0.05 level of significance, examine whether the population 
median customer ratings for the three restaurants could be 
the same. Identify and interpret the p-value for the test. 

14.83 Reacting to a skeptical reader who doubts that 
so-called wine connoisseurs can really tell the difference 
between cheap and expensive wines, the dining editor of 
a city newspaper sets up a test in which each of five con- 
noisseurs is asked to taste three unidentified wines, then 
rate each one on a scale of 1 ("poor") to 10 ("excellent"). 
Unknown to the raters, wine 1 is $1.50 per gallon Elder- 



integrated case 



berry, wine 2 is $8 per liter Rhine, and wine 3 is $35 
per liter Cabernet Sauvignon. Their ratings are in file 
XR14083. Considering the wine categories as 
"treatments" and the tasters as "blocks," and using the 
0.05 level of significance, advise the dining editor as to 
how to respond to this reader when he writes his next 
column dealing with wine. Identify and interpret the 
p-value for the test. 

14.84 Five computer operators have been asked to rate 
four different workstation arrangements in terms of com- 
fort with 1 = very uncomfortable to 10 = very comfort- 
able. Their ratings are in file XR14084. At the 0.05 level, 
could the workstation arrangements be equally comfort- 
able? Identify and interpret the p-value for the test. 

14.85 The weights (in ounces) of the first 34 units in a 
production run of cast-iron flywheels are in file XR14085. 
Using the 0.10 level of significance, test the randomness 
of this sequence of measurements. Identify and interpret 
the p-value for the test. 

14.86 Apply either the chi-square or the Kolmogorov- 
Smirnov test for normality to the data described in Exer- 
cise 14.85. At the 0.05 level of significance, could the 
data have come from a normal population? 




Thorndike Sports Equipment 

Ted and Luke Thorndike have just returned from lunch 
with Candice Ergonne, president of a chain of 550 health 
clubs located across the United States. The clubs have a 
quality image and are known for featuring equipment and 
facilities that are second to none. The Thorndikes are 
rightfully pleased that Ms. Ergonne has invited their pro- 
posal for supplying exercise bicycles to the 120 new clubs 
that are to be opened during the next year. 

After further negotiations, the deal is practically 
closed, but some fine-tuning remains with regard to the 
seat design for the "ThornBike 2000" model that has been 
selected. Ms. Ergonne recognizes that some club members 
require wider seats than others but doesn't want to call at- 
tention to this fact by having differing seat widths from 



one bike to the next. She would much prefer a "one size 
fits all" approach to the seating decision. 

In an experiment to evaluate four seat designs, Ms. 
Ergonne supplies seven persons who vary widely in size, 
while the Thorndikes provide the seats to be tested. Each of 
the individuals spends 5 minutes riding on each of the four 
seat designs, with the order determined randomly. The sub- 
ject then rates the comfort of the seat on a scale of 1 to 10, 
with 1 = "poor" and 10 = "excellent." The results are as 
shown below. The data are also in file THORN14. 

From these results, and using the 0.10 level of 
significance, can Ted Thorndike reject the null hypo- 
thesis that the four seating designs could be equally 
comfortable? 



Seat Design 



12 3 4 



13 9 5 7 

2 7 8 6 8 

3 8 10 9 7 
Rider 4 4 8 3 6 

5 6 8 5 10 

6 9 5 7 6 

7 8 10 7 9 
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Circuit Systems, Inc. (B) 

In Chapter 11, we visited Circuit Systems, Inc., a company 
that was concerned about the effectiveness of its new pro- 
gram for reducing the cost of absenteeism among hourly 
workers. Besides allowing employees to be paid for unused 
sick leave, the program also pays $200 toward member- 
ship in a local health club for those employees who wish to 
join. Thomas Nelson, who introduced the new program, 
has collected data describing the 233 employees who have 
been with the company for both the year preceding the 
new program and the year following its implementation. 
The information Mr. Nelson has collected is in the 
CIRCUIT data file on the text CD, and the variables and de- 
scription of his program are as described in Circuit Sys- 
tems, Inc. (A), in Chapter 11. 

1. Repeat Question 1 from the Circuit Systems (A) case, 
but use the Wilcoxon signed rank test for paired sam- 
ples instead of the dependent-samples t-test in compar- 
ing the two years in terms of days missed before and 
after the new program was implemented. 

2. Repeat Question 2 from the Circuit Systems (A) case, 
but use the Wilcoxon signed rank test for paired sam- 




ples. Be sure to keep in mind that the goal of the pro- 
gram is to reduce the cost of absenteeism, and that 
you will need to either create two new variables or 
utilize the ones you created in the earlier case. See the 
discussion and hints in the Circuit Systems (A) case 
questions regarding these variables. 
3. Repeat Question 3 from the Circuit Systems (A) case, 
but use an appropriate nonparametric statistical test 
from this chapter in evaluating the effectiveness of the 
exercise program. 



Winning For Dollars 

In sports, some teams are referred to as "the best that money can buy." If team 
payrolls and wins during the 2000 major league baseball season are any indication 
there may be some truth to that adage. Not only did the highest-paid 
team win the World Series, but there was a very strong 
relationship between the size of a team's , little steep er 

payroll and the number of wins it achieved 5 0 me sl °P 

during the regular season. 

This relationship can be described by a 
linear equation fitted to the data using a 
technique called simple linear regression. 
This popular method for data analysis involves 
both the scatterplot discussed in Chapter 2 and 
the coefficient of correlation that was briefly 
introduced in Chapter 3. Using linear regression 
and correlation, we can make interesting 
observations like these: (1) variation in team 
payrolls explains approximately 33% of the 
variation in team wins; (2) on average, each 
additional win cost an extra $0.25 million in team 
payroll; and (3) the Chicago White Sox, one of the 
lowest-paid teams in baseball, pleased their fans 
and owners by winning about 1 8 more games than 
their payroll level would have predicted. Later in the 
chapter, in Statistics in Action 1 5.2, we'll delve a little 
further into how regression and correlation analysis 
applies to the payrolls and win totals of the 2000 
major league baseball season. 
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learning 
objectives 

After reading this 
chapter, you should 
be able to: 



• Explain the individual terms in the simple linear regression model, and describe 
the assumptions that the model requires. 

• Determine the least-squares regression equation, and make point and interval 
estimates for the dependent variable. 

• Determine and interpret the value of the coefficient of correlation. 

• Describe the meaning of the coefficient of determination. 

• Construct confidence intervals and carry out hypothesis tests involving the 
slope of the regression line. 

• Test the significance of the correlation coefficient. 

• Use residual analysis in examining the appropriateness of the linear regression 
model and the extent to which underlying assumptions are met. 



15.1 



INTRODUCTION 



In Section 3.6 of Chapter 3, we briefly introduced the coefficient of correlation as 
a way of examining whether two interval- or ratio-scale variables could be related, 
and in Section 13.4 of Chapter 13, we used chi-square analysis to examine 
whether two nominal-scale variables could be related. Although these approaches 
can suggest that a relationship exists, they do not reveal in exactly what way the 
variables are related. The techniques of this chapter are applicable to two interval- 
or ratio-scale variables and describe both the nature and the strength of the rela- 
tionship between the variables. 

Regression analysis provides a "best-fit" mathematical equation for the values 
of the two variables. The equation may be linear (a straight line) or curvilinear, 
but we will be concentrating on the linear type. Correlation analysis measures the 
strength of the relationship between the variables. 

The topic of this chapter is known as simple linear regression and correlation 
because there are just two variables, y and x. These are called the dependent (y) 
and independent (x) variables, since a typical purpose for this type of analysis is 
to estimate or predict what y will be for a given value of x. In the next chapter 
we'll cover multiple linear regression and correlation, in which two or more inde- 
pendent variables are used in estimating a dependent variable. The following are 
examples of dependent and independent variables that might be the subject of 
simple regression and correlation: 



Dependent Variable (y) 

Sales 

Starting salary 
Product recalls 
Cost of a magazine ad 



Independent Variable (x) 

Advertising 

Grade point average 

Quality assurance expenditures 

Circulation of the magazine 



One way to examine whether two variables might be linearly related is to con- 
struct a scatter diagram, as we did in Section 2.5 of Chapter 2, then use the "eyeball 
method" to fit a straight line roughly approximating the points in the diagram. 
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In the next section, we will introduce the simple linear regression model and em- 
ploy a more quantitative approach that relies on what is known as the least- 
squares criterion. 



THE SIMPLE LINEAR REGRESSION MODEL 



15.2 



Model and Assumptions 

The simple linear regression model is a linear equation having a y-intercept and a 
slope, with estimates of these population parameters based on sample data and 
determined by standard formulas. The model is described in terms of the popula- 
tion parameters as follows: 



The simple linear regression model: 

y, = Po + $\ x i + e i where y, 

x , 

Pi 



a value of the dependent variable, y 
a value of the independent variable, x 
the y-intercept of the regression line 
the slope of the regression line 
random error, or residual 



1. For a given value of x, the expected value of y is given by the linear equation, 



l^yx = Po + Pi x r The term /jl can be stated as "the mean of y, given a 
specific value of x." 

2. The difference between the actual value of y and the expected value of y is the 
error, or residual, e- = y ( - — + P^x-). 

According to this model, the y-intercept for the population of (%-, y ■) pairs is j3 0 
and the slope is jSj. The e-term is the random error, or residual, for the ith observa- 
tion or measurement — this residual is the difference between the actual value (y ; ) 
and the expected value, fi x = /3 0 + j8 1 x ! , from the regression line. The y values 
may be scattered above and below the regression line, but the expected value of y 
for a given value of x will be given by the linear equation, p, y x = /3 Q + jSjXj. 

Three assumptions underlie the simple linear regression model: 

1. For any given value of x, the y values are normally distributed with a mean 
that is on the regression line, fi = /3 0 + jS-j*,-. 

2. Regardless of the value of x, the standard deviation of the distribution of 
y values about the regression line is the same. The assumption of equal stan- 
dard deviations about the regression line is called homoscedasticity. 

3. The y values are statistically independent of each other. For example, if a 
given y value happens to exceed fi yx = /3 Q + /S^, this does not affect the 
probability that the next y value observed will also exceed fi = /6 0 + fi\x t . 

Figure 15.1 (page 552) shows the variation of y values above and below a 
population regression line. There is a "family" of such distributions (one for each 
possible x value). Each distribution has fi x as its mean, and the standard deviations 
are the same (a y x = a). 

The three assumptions can also be expressed in terms of the error, or resid- 
ual, component (e-) in the simple linear regression model: (1) For any given value 



N 


O 


T E 


S 
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FIGURE 15.1 



For any given value of x, the 
y values are assumed to be 
normally distributed about 
the population regression 
line and to have the same 
standard deviation, <r. 
The regression line based 
on sample data is an 
estimate of this "true" line. 
Likewise, s y x is our sample 
estimate of a. 



of x, the population of e ; values will be normally distributed with a mean of zero 
and a standard deviation of a; (2) this standard deviation will be the same re- 
gardless of the value of x; and (3) the e- values are statistically independent of 
each other. 

Based on the sample data, the y-intercept and slope of the population regres- 
sion line can be estimated. The result is the sample regression line: 



Sample regression line: 




y = b 0 + byX where y 


= the estimated value of the dependent variable (y) 


for a given value of x 




= the y-intercept; this is the value of y where the line 


intersects the y-axis whenever x = 0. 




= the slope of the regression line 


X 


= a value for the independent variable 



The cap O over the y indicates that it is an estimate of the (unknown) "true" 
value of y. The equation is completely described by the y-intercept (b 0 ) and slope 
(£>j), which are sample estimates of their population counterparts, /3 0 and (3 V re- 
spectively. An infinite number of possible equations can be fitted to a given scat- 
ter diagram, and each equation will have a unique combination of values for b 0 
and b v However, only one equation will be the "best fit" as defined by the least- 
squares criterion we are going to use. 

The Least-Squares Criterion 

The least-squares criterion requires that the sum of the squared deviations 
between y values in the scatter diagram and y values predicted by the equation 
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be minimized. In symbolic terms: 

Least-squares criterion for determining the best-fit equation: 
The equation must be such that ~~ V;) 2 is minimized 

where y i = the observed value of y for the given value of x 

y i = the predicted value of y for that x value, as determined 
from the regression equation 

To show how the least-squares criterion works, consider parts (a) and (b) of 
Figure 15.2. In part (a) the sum of the squared deviations between observed and 
predicted y values is 100.0, while in part (b) the sum is only 67.0. According to 
the least-squares criterion, the line in part (b) is a better fit to the data than the line 
in part (a). 





0 1 2 3 4 5 

(b) The same scatter diagram and estimation line y = 1 + 3jc 
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Determining the Least-Squares Regression Line 

Equations have been developed for proceeding from a set of data to the least- 
squares regression line. They are based on the methods of calculus and provide 
values for b 0 and b x such that the least-squares criterion is met. The least-squares 
regression line may also be referred to as the least-squares regression equation or 
as simply the regression line: 

Least-squares regression line, y 

• Slope 

_ (S*,-y,-) - nxy 

• y-intercept 



= bf, + b,x: 



where n = number of data points 



b Q = y — b,x 



With the slope determined, we take advantage of the fact that the least- 
squares regression equation passes through the point (3c, y). The equation for find- 
ing the y-intercept (b 0 = y — b^x) is just a rearrangement of y = b 0 + b^x. 
(Note: If you are using a pocket calculator, you may wish to redefine the units so 
as to reduce the number of digits in the data before applying these and other for- 
mulas in the chapter. For example, by converting from dollars to millions of 
dollars, $30,500,000 may be expressed as $30.5 million. For some pocket calcu- 
lators, this can help avoid a "blow-up" in the number of digits when calculating 
summations of products or squares.) 



example 



Regression Equation 

A production manager has compared the dexterity test scores of five assembly-line 
employees with their hourly productivity. The data are in CX15DEX and shown here. 

x = Score on y = Units Produced 

Employee Dexterity Test in One Hour 



A 


12 


55 


B 


14 


63 


C 


17 


67 


D 


16 


70 


E 


11 


51 



SOLUTION 

The calculations necessary for determining the slope and y-intercept of the regres- 
sion equation are shown in Table 15.1. Once the slope (b 1 = 3.0) has been deter- 
mined, this value is substituted into the equation for the y-intercept, and b 0 is 
found to be 19.2. The least-squares regression equation, shown in the scatter di- 
agram of Figure 15.3, is y = 19.2 + 3.0x. 
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Data and Preliminary Calculations 
X: = Score on y ; 



Employee 


Dexterity Test 


in One Hour 




*, ? 


yf 


A 


12 


55 


660 


144 


3025 


B 


14 


63 


882 


196 


3969 


C 


17 


67 


1139 


289 


4489 


D 


16 


70 


1120 


256 


4900 


E 


11 


51 


561 


121 


2601 




70 


306 


4362 


1006 


18,984 




2*7 


27/ 


2*,y, 


2*, 2 


E// 2 



70/5 = 14.0 y = 306/5 = 61.2 



TABLE 15.1 



Data and calculations 
for determining the 
least-squares regression 
line for the example 
involving dexterity test 
score (x) and units 
produced per hour (y) 



t 

Calculations for Slope and y-lntercept of Least-Squares Regression Line 

_ (2*,y ; ) - nxy _ 4362 - 5(14.0)(61 .2) _ 78.0 
slope, o, ( 2 x, 2 )-nx 2 1006 - 5(14.0) 2 26.0 



y-intercept, b 0 = y - 6,* = 61 .2 - 3.0(14.0) = 61.2 - 42.0 = 19.2 



1 


r 


The least-squares regression line is y = 


19.2 + 3.0x 


where y = estimated u 


nits produced per hour 


x = score on manual dexterity test 



FIGURE 15.3 




Scatter diagram and 
least-squares regression 
line for the data of 
Table 15.1 



x = Score on dexterity test 
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The slope of the regression line is positive, suggesting a direct relationship 
between dexterity test score and productivity. The value of the slope (b 1 = 3.0) 
indicates that each one-point increase in the dexterity test score will increase the 
estimated productivity by 3.0 units per hour. 



Point Estimates Using the Regression Line 

Making point estimates based on the regression line is simply a matter of substi- 
tuting a known or assumed value of x into the equation, then calculating the esti- 
mated value of y. For example, if a job applicant were to score x = 15 on the 
manual dexterity test, we would predict this person would be capable of produc- 
ing 64.2 units per hour on the assembly line. This is calculated as shown below. 
You can also use Seeing Statistics Applet 18, at the end of the chapter, to demon- 
strate how different inputs for the dexterity test score will produce different point 
estimates for productivity. 



Estimated 
productivity, 
units per hour 



The applicant's 
score on the 
dexterity test 



y = 19.2 + 3.0(15) = 64.2 units/hour, estimated productivity 

Computer Solutions 15.1 describes the procedures and shows the results 
when Excel and Minitab are used in performing a simple linear regression analy- 
sis on the productivity data in our example. We will be referring back to these 
printouts in later sections of the chapter. In the Excel printout, the intercept (19.2) 
and slope (3.0) are shown in cells F17:F18. Minitab shows the intercept and slope 
in the context of the least-squares equation. 




Simple Linear Regression 
EXCEL 





E 


F 


G 


H 


I 


J 


K 


1 


SUMMARY OUTPUT 






















75 - 
0J 70 - 
















2 




















3 


Regression Statistics 










♦ 


♦ 






4 


Multiple R 


0.955 




£ 65 - 
















5 


R Square 


0.911 




O 60 - 
















6 


Adjusted R Square 


0.882 




Q_ 55 - 
















7 


Standard Error 


2.757 




50 - 




















— 1 


1 — 


1 








8 


Observations 


5 




10 


12 


14 


16 


1 


8 




9 














Dexscore 








10 


ANOVA 










































11 




df 


SS 


MS 


F 


Significance F 




12 


Regression 


1 


234 


234.0 


30.789 


0.012 




13 


Residual 


3 


22.8 


7.6 








14 


Total 


4 


256.8 










15 
















16 




Coefficients 


Standard Error 


rSfaf 


P-value 


Lower 95% 


Upper 95% 


17 


Intercept 


19.2 


7.669 


2.504 


0.087 


-5.206 


43.606 


18 


Dexscore 


3.0 


0.541 


5.549 


0.012 


1.279 


4.721 
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Excel Simple Linear Regression 

1. The data for the primary example in this chapter are in file CX15DEX.XLS. The dexterity test scores (Dexscore) are 
in A2:A6, the production rates (Prodrate) are in B2:B6, and the labels are in A1 and B1. Click Tools. Click Data 
Analysis. Select Regression. Click OK. 

2. Enter B1:B6 into the Input Y Range box. Enter A1:A6 into the Input X Range box. Select Labels. Enter E1 into 
the Output Range box. Check the Line Fit Plots box. Click OK. After slight adjustments to the column widths, the 
number of decimal places, and the line plot, the results are as shown above. 



MINITAB 

Minitab Simple Linear Regression 

Regression Analysis: Prodrate versus Dexscore 



The regression equation is Prodrate = 19. 


.2 


+ 3. 


00 


Dexscore 


Predictor Coef 


SE Coef 


T 




P 






Constant 19.200 


7.669 


2.50 


0 


.087 






Dexscore 3.0000 


0. 5407 


5.55 


0 


.012 






S = 2.75681 R-Sq = 


91.1% R-Sq (ad j) = I 


58. 


2% 






Analysis of Variance 














Source DF 


SS 


MS 






F 


P 


Regression 1 


234.00 


234.00 




30. 


79 


0.012 


Residual Error 3 


22.80 


7.60 










Total 4 


256.80 













1. Using file CX15DEX.MTW, with dexterity test scores in C1 and production rates in C2: Click Stat. Select Regression. 
Click Regression. Enter C2 (Prodrate) into the Response box. Enter C1 (Dexscore) into the Predictors box. Click OK. 

2. Alternatively, we can generate a scatter diagram and line plot (not shown): Click Stat. Select Regression. Click Fitted 
Line Plot. Enter C2 (Prodrate) into the Response box. Enter C1 (Dexscore) into the Predictor box. Select Linear. 
Click OK. 

3. Another alternative is to use the equation to estimate y for a given value of x: In step 1, click Options. Enter the 
desired x value (e.g., 15) into the Prediction intervals for new observations box. Click OK. The estimated value of y 
(not shown here) will be 64.2. 



exercises 




15.1 Explain each term in the linear regression model, 
Vi = Po + Pi x i + e i- 

15.2 What assumptions are required in using the linear 
regression model? 

15.3 In the linear regression equation, y = b Q + b^x, 
why is the term at the left given as y instead of simply y? 

15.4 What is the least-squares criterion, and what does it 
have to do with obtaining a regression line for a given set 
of data? 



15.5 A scatter diagram includes the data points (x = 2, 
y = 10), (x = 3,y= 12), (x = 4,y = 20), and 

(x = 5,y = 16). Two regression lines are proposed: 
(1) y = 10 + x, and (2) y = 8 + 2x. Using the least- 
squares criterion, which of these regression lines is the 
better fit to the data? Why? 

15.6 Determine the least-squares regression line for the 
data in Exercise 15.5, then calculate the sum of the 
squared deviations between y and y. 
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15.7 A scatter diagram includes the data points (x = 3, 

y = 8), (x = 5, y = 18), (x = 7, y = 30), and 
(x = 9, y = 32). Two regression lines are proposed: 
(1) y = 5 + 3x, and (2) y = —2 + 4x. Using the least- 
squares criterion, which of these regression lines is the 
better fit to the data? Why? 

15.8 Determine the least-squares regression line for the 
data in Exercise 15.7, then calculate the sum of the 
squared deviations between y and y. 

15.9 For a sample of 8 employees, a personnel director has 
collected the following data on ownership of company 
stock versus years with the firm. 



Years 


y - Shares 


6 


300 


12 


408 


14 


560 


6 


252 


9 


288 


13 


650 


15 


630 


9 


522 



a. Determine the least-squares regression line and 
interpret its slope. 

b. For an employee who has been with the firm 10 years, 
what is the predicted number of shares of stock owned? 

15.10 The following data represent x = boat sales and 
y = boat trailer sales from 1995 through 2000. source: 
Bureau of the Census, Statistical Abstract of the United States 1999, p. 269 
and Statistical Abstract 2002, p. 755. 

Boat Sales Boat Trailer Sales 

Year (Thousands) (Thousands) 



1995 649 207 

1996 619 194 

1997 596 181 

1998 576 174 

1999 585 168 

2000 574 1 59 



a. Determine the least-squares regression line and inter- 
pret its slope. 

b. Estimate, for a year during which 500,000 boats are 
sold, the number of boat trailers that would be sold. 

c. What reasons might explain why the number of boat 
trailers sold per year is less than the number of 
boats sold per year? 

15.11 For the five top-grossing animated films of all time, 
the data in the table show gross ticket sales during the 



initial 2 weeks compared to total gross sales. Given these 
data, determine the least-squares equation for predicting 
total gross ticket sales and interpret its slope. What 
would be the estimated total gross for an animated film 
that had $100 million in ticket sales during the first 
2 weeks of its run? source: "Toy Story 2' on Big Box Office Track," 
USA Today, December 6, 1999, p. ID. 



Movie 


After 
2 Weeks 
($ millions) 


Total Gross 
Ticket Sales 
During Run 
(S millions) 


Lion King 


104.5 


312.9 


Aladdin 


40.2 


217.4 


Toy Story 


64.7 


191.8 


Tarzan 


77.6 


170.7 


A Bug's Life 


68.7 


162.8 



/ data SEt / Note: Exercises 15.12-15.14 require a 
computer and statistical software. 

15.12 For the highest-ranking law firms in terms of 
profit per partner, data file XR15012 lists the number 
of partners in each firm, along with the profit/partner 
for the reporting year. Given these data, determine the 
least-squares equation for predicting profit/partner on 
the basis of the number of partners, then interpret its 
slope. What would be the estimated profit/partner for 

a law firm with 120 partners? source: The Wall Street Journal 
Almanac 1999, p. 167. 

15.13 The Florida ecosystem is heavily reliant on wet 
and dry seasons, and is particularly dependent on the 
winter and spring rainfall that helps reduce damage due 
to wildfires. Data file XR15013 lists the number of acres 
burned and the number of inches of rainfall during 
January-May for 1988-2000. Given these data, 
determine the least-squares equation for predicting 
acreage burned during the January-May period as a 
function of rainfall during that period, then interpret its 
slope. What would be the estimated number of acres 
burned during January-May if there were 18 inches of 
rainfall during the same period? source: "Rainfall, Fires Linked 
In Florida," USA Today, June 9, 2000, p. 4A. 

15.14 For the leading health maintenance organization 
(HMO) firms, data file XR15014 lists the number of 
persons enrolled in the HMO along with the number of 
plans the firm administers. Given these data, determine 
the least-squares equation for predicting the total enroll- 
ment as a function of the number of plans administered. 
What would be the estimated total enrollment for an 
HMO firm that administered 20 plans? source: The Wall 
Street Journal Almanac 1999, p. 565. 
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INTERVAL ESTIMATION USING THE 
SAMPLE REGRESSION LINE 


15.3 



In the preceding example, we found that an applicant scoring x = 15 on the dex- 
terity test would be predicted to produce 64.2 units per hour. This is just a point 
estimate, however, and there will be some uncertainty as to how high or low the 
individual's productivity might actually be. Using the techniques of this section, 
we can draw some conclusions regarding the upper and lower limits within which 
this individual's actual productivity is likely to fall. 

The Standard Error of Estimate 

To develop interval estimates for the dependent variable, we must first determine 
the standard error of estimate, s y x . This is a standard deviation describing the 
dispersion of data points above and below the regression line. The formula for the 
standard error of estimate is shown below and is very similar to that for determin- 
ing a sample standard deviation, s. 

Standard error of estimate 



each observed value of y in 
the data 

the value of y that would 
have been estimated from 
the regression line 
the number of data points 

The number of degrees of freedom associated with s y x is n — 2, since both the 
y-intercept b Q and the slope b 1 are estimated from sample data. For the dexterity 
test example of Table 15.1 and Figure 15.3, the standard error of estimate can be 
calculated as shown below. In each row, the predicted value (y) is estimated by the 
regression line, y = 19.2 + 3. Ox, and the deviation between the observed and 
predicted values is the residual. Each residual is the vertical distance from the re- 
gression line to an observed data point. 







Number of Units/Hour 


Error, or 


Error, or 
Residual, 




Xj - Test 


Actual 


Predicted 


Residual, 


Squared, 


Employee 


Score 


w 


w 


(y,-W 


(y,-y,) 2 


A 


12 


55 


55.2 


-0.2 


0.04 


B 


14 


63 


61.2 


1.8 


3.24 


C 


17 


67 


70.2 


-3.2 


10.24 


D 


16 


70 


67.2 


2.8 


7.84 


E 


11 


51 


52.2 


-1.2 


1.44 



2(y ; - y,) 2 = 22.80 



s y. x = V n _ 2 where y, = 
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Having calculated the sum of the squared deviations (i.e., squared residuals) 
about the regression line as 2(y ; ~~ y,) 2 = 22.80, we may now calculate the 
standard error of estimate as 

.„ = = Vf^ = = 2.757 units/hour 

y- x V n-2 V 5 - 2 

This is the same value provided in the Excel and Minitab portions of Computer 
Solutions 15.1, shown previously. 

As with any standard deviation, larger values of the standard error of estimate 
reflect a greater amount of scatter in the data. If every data point in the scatter di- 
agram were to fall exactly on the regression line, the standard error of estimate 
would be zero. In this event, there would be no variability at all above and below 
the regression line. 



Confidence Interval for the Mean of y Given 
a Specific x Value 

Given a specific value of x, we can make two kinds of interval estimates regard- 
ing y: (1) a confidence interval for the (unknown) true mean of y, and (2) a pre- 
diction interval for an individual y observation. These applications are examined 
in this section and the next. 

For a given value of x, the best point estimate we can make about the mean 
of y is obtained from the regression line, or /ji yx = y = b 0 + b^x. However, the 
regression line itself is based on sample data, and b 0 is an estimate of the true y- 
intercept (jS Q ) and b 1 is an estimate of the true slope (jSj). As such, both b 0 and 
3c b 1 are subject to error. Estimates improve when they are based on an x value that 

is closer to x. [Remember that the regression line for a set of data always includes 
the point (x, y).] The following formula also considers the number of data 
points, since regression lines based on fewer data points (smaller n) tend to result 
in greater errors of estimation: 



Confidence interval for the mean of y, given a specific value of x: 



t(s y x )\j— + t }* ,^ UC ,„ , where y = the estimated value 



n (2*, ) _ (2*,) of y for the given value 

of x 

n = the number of data points 
s y x = the standard error of 
estimate 
t = t value for confidence 
level desired and 
df = n — 2 
x value = the given value 
of x 
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Confidence Interval 

For persons scoring x = 15 on the dexterity test, what is the 95% confidence in- 
terval for their mean productivity? 

SOLUTION 

The x value given is x = 15, and the standard error of estimate has been calcu- 
lated as s = 2.7 '57 units/hour. The correction factor (the square root symbol 
and the terms within) will end up being slightly larger than if we were making an 
interval estimate based on an x value closer to the mean of x. 

For x = 15, y = 19.2 + 3.0(15), or 64.2 units per hour. This will be the mid- 
point of the interval. The mean of x is x = 14.0, there are n = 5 data points, and 
the "^xf and 2*, terms have been calculated in Table 15.1 as 1006 and 70, re- 
spectively. For the 95% level of confidence and df = n — 2 = 3, t = 3.182 and 
the 95% confidence interval can now be calculated as 



t(s 



y.x 



1 (x value — x) 2 
n (5X 2 ) 



(2*,-) 2 /» 



Jl 15.0 

64.2 ± 3.182 2.757 V- + 

u 5 1006 



14.0)2 



- 64.2 ± 8.773V0.2385 

(70) 2 /5 

= 64.2 ± 8.773(0.488) = 64.2 ± 4.281, or between 59.919 and 68.481 

Based on these calculations, we have 95% confidence that the mean productivity 
for persons scoring x = 15 on the dexterity test will be between 59.919 and 
68.481 units per hour. 



Prediction Interval for an Individual y Observation 

For a given value of x, the estimation interval for an individual y observation is called 
the prediction interval. For the dexterity-test regression line, we'll demonstrate this 
concept by constructing a prediction interval for the productivity of an individual job 
applicant who has achieved a given score on the dexterity test. The formula is simi- 
lar to that just presented, except that a 1 is included beneath the square root symbol: 

Prediction interval for an individual y, given a specific value of x: 



estimated value of y 
for the given value of x 
number of data points 
the standard error of 
estimate 
t value for the 
confidence level 
desired and df = n — 2 
the given value of x 



- , / u L 1 (x value - x) 2 , 

y ± t(s vx )\ 1 + - + -— - where y = 

y - x V n (2>2) - (2>.) 2 /h 

n = 
V = 
t = 



x value = 
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Prediction Interval 

A prospective employee has scored x = 15 on the dexterity test. What is the 95% 
prediction interval for his productivity? 

SOLUTION 

For x = 15, y is estimated as y = 19.2 + 3.0(15), or y = 64.2. As in the preced- 
ing example, this will be the midpoint of the interval. Applying the prediction 
interval formula, the interval is described by 



1 (x value - x) 2 

tfOV 1 + ~ + 



1 15.0 - 14.0 2 

64.2 ± 3.182(2.757 A/1 + - + =r- 

5 1006 - (70) 2 /5 



= 64.2 ± 8.773VL2385 
= 64.2 ± 8.773(1.113) 

= 64.2 ± 9.764, or between 54.436 and 73.964 

For this applicant, we have 95% confidence that his productivity as an 
employee would be between 54.436 and 73.964 units per hour. The width of this 
interval is 19.528 [i.e., 2(9.764)]. 

For purposes of illustration, we have used the same dexterity test score (x = 15 ) 
in determining both the 95% confidence interval for the mean and the 95% pre- 
diction interval for the productivity of an individual. Given that x = 15, notice 
that the 95% confidence interval (64.2 ± 4.281 units per hour) is much narrower 
than the 95% prediction interval (64.2 ± 9.764 units per hour). This reflects that 
we will have less uncertainty about the mean productivity of all persons who 
score x = 15 on the dexterity test than about the productivity of one individual 
who has scored x = 15 on the dexterity test. 

Had the applicant scored x = 14 on the dexterity test, the prediction interval 
would be slightly narrower. This is because x = 14.0, and there is less error in 
making interval estimates based on x values that are closer to the mean. For ad- 
ditional practice, you may wish to repeat the preceding calculations for x = 14. 
If so, you will find the interval to be 61.2 ± 9.610, for a total interval width of 
2(9.610), or 19.220 units per hour. Figure 15.4 shows how the 95% prediction 
interval becomes wider whenever the specified value of x gets farther away from 
x in either direction. 



Computer Solutions 15.2 shows the procedures and results when we use Excel 
and Minitab to obtain confidence and prediction intervals for productivity when 
the score on the dexterity test is x = 15. In each case, we also get a point estimate 
for productivity. 
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Interval Estimation in Simple Linear Regression 



EXCEL 





A B 


C 


D 


1 


Prediction Interval 






2 










3 






Prod rate 




4 


Predicted value 


64.2 




5 










6 


Prediction Interval 






7 


Lower limit 


54.44 




8 


Upper limit 


73.96 




9 










10 


Interval Estimate of Expected Value 


11 


Lower limit 


59.92 




12 


Upper limit 


68.48 





Excel Interval Estimation in Simple Linear Regression 

1 . Using file CX15DEX.XLS, with dexterity test scores in A2:A6, production rates in B2:B6, and the labels in A1 and B1, we 
will obtain interval estimates associated with x = 1 5: Select a convenient cell (e.g., C3) and enter 15 into C3. 

2. Click Tools. Click Data Analysis Plus. Select Prediction Interval. Click OK. 



{continued) 
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3. Enter B1:B6 into the Input Y Range box. Enter A1:A6 into the Input X Range box. Click Labels. Enter C3 into the 
Given X Range box. Enter the desired confidence level as a decimal fraction (e.g., 0.95) into the Confidence Level 

(1 - Alpha) box. Click OK. When x = 15, the point estimate fory is 64.2, and the 95% prediction and confidence 
intervals will be as shown above. 



MINITAB 

Minitab Interval Estimation in Simple Linear Regression 

Predicted Values for New Observations 
New 

Obs Fit SE Fit 95% CI 95% PI 

1 64.20 1.35 (59.92, 68.48) (54.44, 73.96) 

Values of Predictors for New Observations 
New 

Obs Dexscore 
1 15.0 

Using file CX15DEX.MTW, follow the procedure in step 1 of Computer Solutions 15.1, but select Options. Enter 15 into 
the Prediction intervals for new observations box. Enter the desired confidence level as a percentage (e.g., 95) into 
the Confidence level box. Click OK. Click OK. In addition to the printout in Computer Solutions 1 5.1, Minitab will gen- 
erate the items shown above. When x = 1 5, the point estimate fory is 64.20, and the confidence and prediction intervals 
will be as shown here. 



exercises 




15.15 What is the standard error of estimate, and what 
role does it play in simple linear regression and 
correlation analysis? 

15.16 Is it possible for the standard error of estimate to be 
equal to zero? If so, under what circumstances? 

15.17 Differentiate between a confidence interval and a 
prediction interval. 

15.18 For a given value of x, which will be wider, a 95% 
confidence interval or a 95% prediction interval? Explain. 

15.19 What happens to the width of a prediction interval 
for y as the x value on which the interval estimate is 
based gets farther away from the mean of x? Why? 

15.20 For a set of 8 data points, the sum of the squared 
differences between observed and estimated values of y is 
34.72. Given this information, what is the standard error 



c. Construct the 95% confidence interval for the mean 
of y when x = 7.0. 

d. Construct the 95% confidence interval for the mean 
of y when x = 9.0. 

e. Compare the width of the confidence interval 
obtained in part (c) with that obtained in part (d). 
Which is wider and why? 

15.22 For the summary data provided in Exercise 15.21, 
construct a 95% prediction interval for an individual y 
value whenever 

a. x = 2. b. x = 3. c. x = 4. 

15.23 For the National Football League, ratings for the all- 
time leading passers were as shown below. Also shown for 
each quarterback is the percentage of passes that were in- 
terceptions, along with the percentage of passes that were 
touchdowns, source: World Almanac and Book of Facts 2006, p. 930. 



of estimate? 


Player 


Rating 


TD% 


lnter% 


15.21 For n = 6 data points, the following quantities 
have been calculated: 


Steve Young 
Kurt Warner 


96.8 
95.7 


5.6 
5.5 


2.6 
3.5 


2x = 40 2y = 76 ^xy = 400 

£x 2 = 346 2/ = 1160 2(y ~y) 2 = 52.334 


Daunte Culpepper 
Peyton Manning 
Joe Montana 


93.2 
92.3 
92.3 


5.4 
5.6 
5.1 


3.1 
3.1 

2.6 


a. Determine the least-squares regression line. 


Trent Green 


87.9 


4.7 


2.9 


b. Determine the standard error of estimate. 


Tom Brady 


87.5 


4.8 


2.6 
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a. Determine the least-squares regression line for 
estimating the passer rating based on the percentage 
of passes that were touchdowns. 

b. A quarterback says 5.0% of his passes will be 
touchdowns next year. Assuming this to be true, 
estimate his rating for the coming season. 

c. Determine the standard error of estimate. 

d. For the quarterback in part (b), what is the 95% pre- 
diction interval for his passer rating next year? 

e. For quarterbacks similar to the one in part (b), what 
is the 95% confidence interval for their mean rating 
next year? 

15.24 Repeat parts (a)-(e) of Exercise 15.23, but using 
the interception percentage as the independent variable, 
and for a quarterback who claims his interception 
percentage will be just 3.0% next year. 

15.25 For the data in Exercise 15.9, find the standard 
error of estimate, then construct the 95% prediction 
interval for the amount of stock owned by an individual 
employee who has been with the firm for 5 years. 

15.26 For the data in Exercise 15.10, find the standard 
error of estimate, then construct the 95% prediction inter- 
val for boat trailer sales during a year in which the number 
of boats sold is 500,000. 

/ data set/ Note: Exercises 15.27-15.30 require a 
computer and statistical software. 



15.27 Information describing 98 of the top automobile 
dealership groups in the United States is provided in com- 
puter database file DEALERS. For calendar year 1999, the 
data include the number of dealers, the average number of 
retail units sold per dealer, and the total revenue for the 
dealership group. Determine the regression equation 
estimating total revenue as a function of the number of 
dealers. For a dealership group consisting of 100 dealers, 
determine and interpret the 95% confidence and 
prediction intervals associated with total revenue. 

source: Automotive News, May 1 , 2000, pp. 54-60. 

15.28 Repeat Exercise 15.27, but with average retail 
units sold per dealer as the independent variable and with 
the 95% interval estimates assuming a dealer group for 
which the average number of retail units sold per dealer 
is 1000. 

15.29 Given the information in Exercise 15.13, 
determine and interpret the 90% confidence and predic- 
tion intervals associated with burned acreage in Florida 
when the January-May rainfall in the state is 20 inches. 

15.30 Given the information in Exercise 15.12, 
determine and interpret the 90% confidence and predic- 
tion intervals associated with profit per partner when 
there are 150 law partners. 



CORRELATION ANALYSIS 1 5.4 



Regression analysis determines the nature of the linear relationship between two 
interval- or ratio-scale variables, while correlation analysis measures the strength 
of the linear relationship between them. In particular, correlation analysis pro- 
vides us with two important measures of this strength: (1) the coefficient of corre- 
lation and (2) the coefficient of determination. 



The Coefficient of Correlation 

Briefly introduced in Chapter 3, the coefficient of correlation (r, with — 1 <rs + 1 ) 
is a number that indicates both the direction and the strength of the linear 
relationship between the dependent variable (y) and the independent variable (x): 

• Direction of the relationship If r is positive, y and x are directly related — 
i.e., when x increases, y will tend to increase. If r is negative, y and x are in- 
versely related — i.e., when x increases, y will tend to decrease. 

• Strength of the relationship The larger the absolute value of r, the stronger the 
linear relationship between y and x. If r = — 1 or r = +1, the regression line will 
actually include all of the data points and the line will be a perfect fit. 
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Calculating the coefficient of correlation for a set of data involves combining 
the same terms that appear in Table 15.1. The formula for r can be expressed as 
follows: 

The coefficient of correlation: 



where r = coefficient of 



V "(2*?) - (2*,.)2 • Vn(2y2) - ( 2y .)2 correlation, 

-1 < r < +1 
« = number of 
data points 

Referring to the dexterity test example, the coefficient of correlation between 
productivity (y) and dexterity test score (x) can be computed as 

«(S*,-y<) - (2*,-)(2yi) 



5(4362) - (70)(306) 390 

= , = = o 9546 

V5(1006) - (70)2 • V5(18,984) - (306)2 11.402 • 35.833 

The coefficient of correlation (r = 0.9546) is positive, reflecting that produc- 
tivity (y) is directly related to dexterity test score (x). In other words, persons scor- 
ing higher on the dexterity test tend to record higher levels of productivity. This is 
also reflected in the positive slope of the regression line, y = 19.2 + 3. Ox. Using 
Seeing Statistics Applet 19, at the end of the chapter, you can insert data points 
onto a blank grid and instantly see how the correlation coefficient and best-fit 
regression line change in response. 



The Coefficient of Determination 

Another measure of the strength of the relationship is the coefficient of determi- 
nation, r 2 . Its numerical value is the proportion of the variation in y that is ex- 
plained by the regression line, y = b 0 + b t x. For the dexterity test example, r = 
0.9546, r 2 = 0.911, and 91.1% of the variation in productivity is explained by 
the dexterity test scores. 

The coefficient of determination can be described in terms of the total varia- 
tion in y versus the unexplained variation in y. In turn, each type of variation can 
be described as a sum of squared deviations. In the denominator of the fraction in 
the following expression, each y value is compared to the mean of y in arriving at 
the total sum of squares (SST). In the numerator, each y value is compared to the 
value of y predicted by the regression line in obtaining the unexplained, or error, 
sum of squares (SSE): 

The coefficient of determination, r 2 : 

2 _ Error variation, which is not explained by the regression line 
Total variation in the y values 

S(y,- - y,) 2 _ _sse 
2(y ; - y) 2 SST 
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In considering the variation of y, a natural question is, "Variation from 
what?" The logic involved in the calculations of SST and SSE warrants further 
explanation: 

1. Without any knowledge of x, the best estimate of y is the mean of y. This is 
why the variation of y from y is used in calculating SST = total sum of 
squares, or total variation. 

2. When knowledge about x is available, the best estimate of y is the conditional 
mean of y (i.e., y from the regression line). When an actual value of y differs 
from the value predicted by the regression line, this constitutes unexplained 
variation and it contributes to SSE = error sum of squares. 

The error variation (SSE) is the sum of the squared residuals calculated earlier 
during our determination of the standard error of estimate. These calculations 
are repeated below, along with those for determining the total sum of squares 
(SST) for the y values. In the following calculations, recall that the mean of y is 
y = 61.2 units per hour: 

Number of Units/Hour 



x ( - = Test 
Score 


Actual, 

y,- 


Predicted, 

y,- 


y- - 9; 


(y,-y,) 2 


y,-y 


(y, - y) 2 


12 


55 


55.2 


-0.2 


0.04 


-6.2 


38.44 


14 


63 


61.2 


1.8 


3.24 


1.8 


3.24 


17 


67 


70.2 


-3.2 


10.24 


5.8 


33.64 


16 


70 


67.2 


2.8 


7.84 


8.8 


77.44 


1 1 


51 


52.2 


-1.2 


1.44 


-10.2 


104.04 










22.80 




256.80 



s(y,- - y,) 2 . error . 

or residual, sum of 
squares (SSE) 



2(y, - y) 2 , total 
sum of squares (SST) 



and the coefficient of determination can be expressed as 

, „ SSE 22.80 

r 1 = 1 =1 

SST 256.80 



0.911 



Total variation in y (SST) is the sum of the variation explained by the regres- 
sion line (SSR) plus the variation not explained by the regression line (SSE), or 



Total variation 
in y values 
(SST) 



Variation explained 
by regression line 
(SSR) 



Variation not explained 
by regression line 
(SSE) 



We have calculated the total sum of squares as SST = 256.80 and the error 
sum of squares as SSE = 22.80. The variation explained by the regression line 
(SSR) is the difference between these two quantities, and SSR = SST — SSE, or 
256.80 — 22.80, or SSR = 234.0. All three of these quantities are typically in- 
cluded in the output whenever regression and correlation analyses are carried out 
with the assistance of a computer statistical package. You can use Seeing Statistics 
Applet 20, at the end of the chapter, to instantly see how explained and unex- 
plained variation will change as you "try" different lines to fit the data in this 
chapter's primary example. 
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Some statistical software packages — Excel and Minitab included — provide an 
r 1 that has been "adjusted" for degrees of freedom. The "adjusted" r 1 approaches 
the size of the "unadjusted" value as the number of data points becomes greater. 
For our productivity example, the printouts in Computer Solutions 15.1 show 
the adjusted r 1 as 0.882. The formula for obtaining the adjusted r 1 is given in the 
footnote. 1 

Computer Solutions 15.3 shows the procedures and results when we use Excel 
and Minitab to obtain the coefficient of correlation between dexterity test score (x) 
and productivity (y). The procedures are comparable to their counterparts pre- 
sented earlier in the text, in Computer Solutions 3.5. Minitab provides a p-value 
for the coefficient of correlation; if the true coefficient of correlation were 0, there 
would be a 0.012 probability of getting a sample r this far away from 0 just by 
chance. Testing whether the true coefficient of correlation could be 0 is discussed 
in Section 15.5. 



computer solutions 1 5.3 



Coefficient of Correlation 
EXCEL 





A 


B 


C 


D 


E 


1 


Dexscore 


Prodrate 




Dexscore 


Prodrate 


2 


12 


55 


Dexscore 


1 




3 


14 


63 


Prodrate 


0.955 


1 


4 


17 


67 








5 


16 


70 








6 


11 


51 









1 . Open the data file CX15DEX.XLS. The labels and data are already entered, as shown in columns A and B in the print- 
out above. Click Tools. Click Data Analysis. 

2. In the Analysis Tools box, click Correlation. Click OK. Enter A1:B6 into the Input Range box. In the Grouped By 
section, click next to Columns. Select Labels in First Row. Enter C1 into the Output Range box. Click OK. 



MINITAB 

Correlations: Dexscore, Prodrate 

Pearson correlation of Dexscore and Prodrate = 0.955 
P-Value = 0.012 

1 . Open the data file CX15DEX.MTW. The dexterity test scores are in C1 and the production rates are in C2. 

2. Click Stat. Select Basic Statistics. Click Correlation. Enter C1 and C2 into the Variables box. Select Display P-values. 
Click OK. 



Expressed as a proportion, the adjusted r 1 typically included in computer printouts of regression 

, - „ SSEI(n - 2) 

analyses is calculated as adjusted r z = 1 . 

SST/(n — 1) 
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15.31 Differentiate between the coefficients of correlation 
and determination. What information does each one offer 
that the other does not? 

15.32 The values of y and x are inversely related, and 
64% of the variation in y is explained by the regression 
equation. What is the coefficient of correlation? 

15.33 The coefficient of correlation between variables y 
and x is —0.90. Calculate and interpret the value of the 
coefficient of determination. 

15.34 For the data in Exercise 15.23, and with y = 
touchdown percentage and x = interception percentage, 
determine and interpret the coefficients of correlation and 
determination. 

15.35 In regression and correlation analysis, what 
are the two components of the total amount 

of variation in the y values? Define the coefficient of 
determination in terms of these components. 

15.36 For a set of data, the total variation or sum of 
squares for y is SST = 143.0, and the error sum of 
squares is SSE = 24.0. What proportion of the variation 
in y is explained by the regression equation? 

15.37 The ratings below are based on collision claim 
experience and theft frequency for 12 makes of small, 
two-door cars. Higher numbers reflect higher claims 
and more frequent thefts, respectively, source: "Insurance 
Ratings for Popular Cars," Consumers Digest, January/February 1990, 
p. 69. 



Year 



x = U.S. Production 
of Motor Vehicles 



y = Tons of 
Domestic Steel 



Collision 


Theft 


Collision 


Theft 


103 


103 


106 


97 


97 


113 


139 


425 


105 


81 


110 


82 


115 


68 


96 


81 


127 


90 


84 


59 


104 


79 


105 


167 



a. Determine the least-squares regression line for 
predicting the rate of collision claims on the basis 
of theft frequency rating. 

b. Calculate and interpret the values of r and r 1 . 

c. If a new model were to have a theft rating of 110, 
what would be the predicted rating for collision 
claims? 

15.38 The following data show U.S. production of motor 
vehicles versus tons of domestic steel shipped for motor 
vehicle manufacture, source: World Almanac and Book of Facts 
2006, p. 1 1 3; Bureau of the Census, Statistical Abstract of the United 
States 2006, p. 661. 



2000 
2001 
2002 
2003 
2004 



12.83 million 
1 1.52 
12.33 
12.15 
12.02 



1 6.06 million 

14.06 

14.00 

15.88 

13.86 



a. Determine the least-squares regression line and 
calculate r. 

b. What proportion of the variability in steel shipments for 
motor vehicles is explained by the regression equation? 

c. During a year in which U.S. production of motor 
vehicles is 12.0 million, what would be the prediction 
for the number of tons of domestic steel used for 
vehicle production? 

15.39 The accompanying data show the U.S. population 
versus shipments of socks, source: National Association of Hosiery 
Manufacturers, 1 998 Annual Hosiery Statistics, p. 27. 



x = U.S. 



y - Shipments 



Year 


Population 


of Socks 


1994 


260.7 million 


2927.2 million pairs 


1995 


266.1 


3002.4 


1996 


268.7 


3217.3 


1997 


271.3 


3410.5 


1998 


272.9 


3419.2 



a. Determine the least-squares regression line and 
calculate r. 

b. What proportion of the variability in sock shipments 
is explained by the regression equation? 

c. During a year when the U.S. population is 280 
million, estimate shipments for socks. 

/ data Set / Note: Exercises 15.40-15.43 require a 
computer and statistical software. 

15.40 Given the information in Exercise 15.13, 
determine and interpret the coefficients of correlation 
and determination for January-May burned acreage in 
Florida versus January-May rainfall in the state. 

15.41 Given the information in Exercise 15.12, 
determine and interpret the coefficients of correlation and 
determination for profit per partner versus the number of 
law partners. 

15.42 For each of 10 popular prescription drugs, file 
XR15042 lists the retail price (in U.S. dollars) for the drug 
in several different countries, including the United States, 
Canada, Great Britain, and Australia. Determine and 
interpret the coefficients of correlation and 
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determination for U.S. prices versus Canadian prices. 
source: "Prescription Drugs Cheaper in Other Nations," USA Today, 
November 10, 1999, p. 2A. 

15.43 For each of the 20 top-grossing U.S. films made 
during the 1990s, file XR15043 lists the domestic gross 
ticket sales versus foreign gross ticket sales (both in mil- 



lions of U.S. dollars). Determine the linear regression 
equation relating the foreign gross as a function of the 
domestic gross. Identify and interpret the coefficients of 
correlation and determination, source: "Overseas Grosses 
Outdoing North America," USA Today, December 13, 2000. 



1 5.5 ESTIMATION AND TESTS REGARDING 

THE SAMPLE REGRESSION LINE 



The coefficient of correlation (r) is based on sample data and estimates the population 
coefficient of correlation, p. A high absolute value for r may be due merely to chance, 
especially if the number of data points is small. Likewise, the slope of the sample 
regression line is an estimate of the population slope (/Sj). If /3j = 0, the population 
regression line is horizontal and there is no linear relationship between the variables. 

Testing H 0 : p = 0 is equivalent to testing H Q : /3 1 = 0. If one null hypothesis is 
true, there is no linear relationship between the variables and the other null 
hypothesis will also be true. The tests have the same p-value. 

Testing the Coefficient of Correlation 

In this test for the significance of the linear relationship, the null and alternative 
hypotheses pertain to the population coefficient of correlation, p: 

t-test for the population coefficient of correlation, p: 

• Null hypothesis 

H 0 : p = 0 There is no linear relationship. 

• Alternative hypothesis 

Hj: p ¥= 0 A linear relationship exists. 

• Test statistic 

sample coefficient 
of correlation 
number of data points 
t value for the level 
of significance of the 
test and df = n — 2 



where r 

n 
t 



example 



Testing the Coefficient of Correlation 

For the employee dexterity test data, the coefficient of correlation was r = 
0.9546. At the 0.05 level of significance, test the null hypothesis that the popula- 
tion coefficient of correlation (p) is really zero. 
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SOLUTION 

For n = 5 persons in the dexterity test example, with r = 0.9546, the test 



statistic is 



°-' 54<5 



(0.9546)^ 



For a two-tail test at the 0.05 level and df = 5 — 2, or 3, the critical values are 
t = -3.182 and t = +3.182. The calculated test statistic (t = 5.55) falls outside 
these critical values, and the null hypothesis is rejected. At the 0.05 level, the sam- 
ple coefficient of correlation differs significantly from zero, and we conclude that 
a linear relationship could exist between the dexterity score and the production 
rate. If the population coefficient of correlation were really zero, there is less than 
a 5% chance that a sample coefficient of correlation would be this large. 

As shown in Computer Solutions 15.3, Minitab provides the coefficient of 
correlation along with the p-value for the test described here. The exact p-value is 
0.012, considerably less than the 0.05 level of significance used in the test we've just 
carried out. 



Testing and Estimation for the Slope 

An equivalent method of testing the significance of the linear relationship is to ex- 
amine whether the slope (y8j ) of the population regression line could be zero. 

£-test for the slope (jSJ of the population regression line: 

• Null hypothesis 

Hq. f3 1 = /3 10 The population slope is /3 10 . 

• Alternative hypothesis 

H^. /3 1 ¥= j8 10 The population slope is not /3 1Q . 

• Test statistic 

b 1 — /3 10 

t = where b 1 = slope of the sample regression line 

s b 1 j8 10 = a value that has been hypothesized 

for the slope of the population 
regression line 

with 

s 

s b = s b = the estimated standard deviation of 

1 V 2x 2 - „(*)2 i the slope 

s yx = the standard error of estimate 
n = number of data points 
t = t value for the level of significance 
of the test, and df = n — 2 



The Beta Coefficient and Investment Decisions 



Beta, the second letter in the Greek alphabet, ascends to 
the number one position in the alphabet of financial deci- 
sions. In finance, million dollar investments are made with 
the assistance of the Capital Asset Pricing Model (CAPM), a 
mathematical model in which beta plays a promi- 
nent role. 

The CAPM is a lot easier to understand than its name 
might imply. This is because it is basically a simple linear re- 
gression model. For a given stock, and for / = 1 through n 
time periods, the model is 

y, = p Q + p,x, + €; 

where y ( . = rate of return for the stock during 
the /th time period 
Xj = rate of return for the overall stock 
market during the /th time period 

For the stock being evaluated, the slope of the regres- 
sion line (i.e., /3 V the beta coefficient) is a measure of how 
risky the stock is in terms of having volatile swings in return 



versus the market as a whole. A stock with j8 1 = 1 is con- 
sidered to be an average risk, while f3 } < 1 and p } > 1 re- 
flect more conservative and more risky stocks, respectively. 

Finance managers first set guidelines on how much risk 
they are willing to take in making their investments. The 
CAPM is then used in selecting individual investments that 
represent a mixture of risk levels, as represented by their 
beta coefficients, but for which the total amount of risk 
falls within the guidelines originally specified. 

For a given stock, the slope of the sample regression 
equation {bj is used as an estimate of its population coun- 
terpart (fij, and estimation and testing for jS 1 can be done 
on the basis of the / = 1 through n time periods for which 
sample data have been collected. This is similar to the ap- 
proach discussed within the chapter. 

Source: For more information on the beta coefficient and the Capital Asset 
Pricing Model, interested readers may wish to refer to finance textbooks 
such as J. Fred Weston and Thomas E. Copeland, Managerial Finance, 
9th ed. (Hinsdale, III.: The Dryden Press, 1992). 



Most typically, we are interested in testing whether [3 1 could be equal to zero, 
and the numerator of the test statistic will simply be (b 1 — 0). In this case, the 
hypothesized value will be /3 10 = 0. However, the method described above can 
also be used to test whether the slope of the population regression line might be 
any specific, nonzero value. (For example, in the Capital Asset Pricing Model dis- 
cussed in Statistics in Action 15.1, an investor may wish to test whether the slope 
differs significantly from 1.0.) 

An important component of the preceding test is the estimated standard devi- 
ation of the sample slope b v This is because s b can also be used in the construc- 
tion of a confidence interval for f3 v 

Confidence interval for the slope (jS^ of the population regression line: 
The interval is 

b 1 ± ts h where b 1 = slope of the sample regression line 

t = t value for the confidence level desired, 
and df = n — 2 
s, = the estimated standard deviation 

s 

of the slope, s h = , y 

1 V^x2 - w(*)2 
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Testing and Estimation for the Slope 

For the dexterity test data, the slope of the sample regression line was b 1 = 3.0. 

1. Using the 0.05 level of significance, examine whether the slope of the popula- 
tion regression line could be zero. 

2. Construct the 95% confidence interval for the slope of the population regres- 
sion line. 

SOLUTION 

Testing H 0 : j8 1 = 0 versus H,: (i, + 0 

For the n = 5 persons in the dexterity test example, we have already calculated 
s = 2.7 '57, 2*, 2 = 1006, and x = 14.0. The standard deviation of the slope 
will be 



y.x 



2.757 



1 V^%2 - n ( x )2 Vl006 - 5(14.0)2 
and the observed value of the test statistic is 



0.5407 



Observed t 



b 1 -Q _ 3.0 - 0 
s. ~ 0.5407 



5.55 



For the 0.05 level of significance and df = 5 — 2 = 3, the critical values of t 
are —3.182 and +3.182, just as they were for the test of the coefficient of corre- 
lation. Notice that the calculated test statistic (t = 5.55) is also equal to the t 
statistic calculated during that test. As pointed out earlier, the tests are equivalent 
and yield the same conclusions and p-values. At the 0.05 level, the test statistic is 
outside the critical values, and we are able to reject the null hypothesis that the 
slope of the population regression line could be zero. In Computer Solutions 15.1, 
the exact p-value is 0.012 — see the "Dexscore" row of either printout. 

95% Confidence Interval for the Slope of the 
Population Regression Line 

The 95% confidence interval is centered on the slope of the sample regression line, 
b 1 = 3.0, and s h has already been calculated as 0.5407. For n = 5 data points, 
the number of degrees of freedom for the t distribution will be df = n — 2, or 
df = 3. Referring to the t distribution table, t = 3.182 will correspond to the 
95% confidence interval: 

b x ± ts bi = 3.0 ± 3.182(0.5407), or from 1.279 to 4.721 

We have 95% confidence that the slope (/3j) of the population regression line 
is in the interval bounded by 1.279 and 4.721. This interval is also provided by 
Excel in Computer Solutions 15.1 — see the "Dexscore" row of the printout. 
Since 0 is not within the 95% confidence interval, we can conclude, at the 
0.05 level, that a linear relationship might exist between productivity and the dex- 
terity test score. 
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The Analysis of Variance Perspective 

Finally, we can carry out an analysis of variance test based on the total amount of 
variation in y (SST) and the amount explained by the regression line (SSR). This 
test is equivalent to those for the coefficient of correlation and the slope, con- 
ducted earlier, and its results are usually given when a computer statistical pack- 
age is used in carrying out the analysis. 

The test is based on the sum-of-squares values that were discussed in the con- 
text of the coefficient of determination, or 

Total Variation Variation explained Variation not explained 
in y values = by regression line + by regression line 
{SST) (SSR) (SSE) 

- y) 1 SSR = SST - SSE 2(y. - yf 



ANOVA test for the significance of a simple linear regression: 

• Null and alternative hypotheses 

Hq. There is no linear relationship between x and y. 
FL: A linear relationship exists between x and y. 

• Test Statistic 

SSR/ 1 

Calculated F = — where SSR = sum of squares explained 

' * ' by the regression line 

SSE = error, or residual, sum of 
squares 
n = number of data points 

• Decision Rule Reject H 0 if calculated F exceeds the critical F for 
the level of significance of the test, with df (numerator) = 1 and df 
(denominator) = n — 2. 



In the numerator of the calculated F-ratio, SSR is divided by 1, the number of 
degrees of freedom associated with the regression equation. In the denominator, 
SSE is divided by (n — 2), the number of degrees of freedom for the error sum of 
squares. For the dexterity test example, our calculations in Section 15.4 showed 
that SST = 256.80, SSR = 234.0, and SSE = 22.80. We now substitute the SSR 
and SSE values into the expression for the calculated F-ratio: 

r i w at, SSR/1 234.0/1 
CakuIated = SSE/(n - 2) = 22.80/(5 - 2) = ^ 

For df (numerator) = 1 and df (denominator) = (n — 2) = 3, the critical 
value of F for the 0.025 level is 17.44, while the critical F for the 0.01 level is 
34.12. The calculated value (F = 30.79) exceeds the critical value for the 0.025 
level but does not exceed that for the 0.01 level. Using the F-distribution tables, 
the most accurate statement we can make about the p-value for the test is that it 
is between 0.025 and 0.01. From Computer Solutions 15.1, the exact p-value is 
0.012 — see the ANOVA portion of either printout. 
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15.44 Testing the null hypothesis that the slope of the true 
regression line equals zero is equivalent to testing whether 
the true coefficient of correlation could be zero. Why? 

15.45 For n = 15 data points, r 2 = 0.81. At the 0.05 
level of significance, can we conclude that the true coeffi- 
cient of correlation could be zero? 

15.46 For n = 37 data points, r 2 = 0.29. At the 0.02 
level of significance, can we conclude that the true coeffi- 
cient of correlation could be zero? 

15.47 Based on sample data, the 90% confidence interval 
for the slope of the population regression line is found to 
be from —2.5 to 1.4. Based on this information, what is 
the most accurate statement that can be made about the 
p-value in testing H Q : p 1 = 0 versus H v - fi 1 + 0? Explain. 

15.48 Based on sample data, the 95% confidence interval 
for the slope of the population regression line is found to 
be from 4.2 to 6.7. Based on this information, what is 
the most accurate statement that can be made about the 
p-value in testing H Q : p 1 = 0 versus H y - f) 1 + 0? Explain. 

15.49 For the regression line developed in Exercise 15.37, 

a. Use the 0.05 level in testing whether the population 
coefficient of correlation could be zero. 

b. Use the 0.05 level in testing whether the population 
regression equation could have a slope of zero. 

c. Construct the 95% confidence interval for the slope of 
the population regression equation. 

15.50 For the regression line developed in Exercise 15.38, 

a. Use the 0.05 level in testing whether the population 
coefficient of correlation could be zero. 

b. Use the 0.05 level in testing whether the population 
regression equation could have a slope of zero. 

c. Construct the 95% confidence interval for the slope of 
the population regression equation. 

15.51 A computer analysis of 30 pairs of observations 
results in the least-squares regression equation 

y = 14.0 + 5. Ox, and the standard deviation of the slope 
is listed as s h ^ = 2.25. 

a. At the 0.05 level of significance, can we conclude that 
no linear relationship exists within the population 

of x and y values? 

b. Given the results in part (a), what conclusion would 
be reached if the same level of significance were used 
in testing whether the population coefficient of 
correlation could be zero? 

c. Construct the 95% confidence interval for the 
population slope, f) v 

15.52 How is analysis of variance related to regression 
analysis? 



15.53 In a regression analysis, the sum of the squared devi- 
ations between y and y is SST = 120.0. If the coefficient of 
correlation is r = 0.7, what are the values of SSE and SSR? 

15.54 In a regression analysis, the sum of the squared 
deviations between y and y is SST = 200.0. If the sum of 
the squared deviations about the regression line is SSE = 
40.0, what is the coefficient of determination? 

15.55 Given the information below, calculate SST, SSR, 
and SSE, determine the coefficient of determination, then 
use ANOVA and the 0.05 level in testing whether r 2 is 
significantly different from zero. 

x: 5 4 10 9 10 Least-squares equation: 
y: 34 44 65 47 66 y = 20.66 + 4.02x 

15.56 Given the information below, calculate SST, SSR, 
and SSE, determine the coefficient of determination, then 
use ANOVA and the 0.05 level in testing whether r 2 is 
significantly different from zero. 



x: 

y: 



381 



1 4 6 Least-squares equation: 
403 394 385 y = 404.94 - 3.78x 



/ data set/ Note: Exercises 15.57-15.59 require a 
computer and statistical software. 

15.57 The General Aviation Manufacturers Association 
has reported annual flying hours and fuel consumption 
for airplanes with a single, piston-driven engine as listed 
in file XR15057. Data are in millions of flying hours and 
millions of gallons of fuel, respectively. Determine the lin- 
ear regression equation describing fuel consumption as a 
function of flying hours, then identify and interpret the 
slope, the coefficient of correlation, and the coefficient of 
determination. At the 0.05 level of significance, could the 
population slope and the population coefficient of corre- 
lation be zero? Determine the 95% confidence interval 
for the population slope, source: General Aviation Manufacturers 
Association, 1 999 Statistical Databook, p. 25. 

15.58 Given the information in Exercise 15.13, 
determine the linear regression equation describing 
January-May burned acreage in Florida as a function of 
January-May rainfall in the state, then identify and inter- 
pret the slope, the coefficient of correlation, and the coef- 
ficient of determination. At the 0.10 level of significance, 
could the population slope and the population coefficient 
of correlation be zero? Determine the 90% confidence 
interval for the population slope. 

15.59 Computer database GROWCO describes the charac- 
teristics of 100 companies identified by Fortune as the 
fastest growing. Two of the variables listed are revenue 
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and net income (millions of dollars) for the most recent 
four quarters. Determine the linear regression equation 
describing net income as a function of revenue, then iden- 
tify and interpret the slope, the coefficient of correlation, 
and the coefficient of determination. At the 0.05 level of 



significance, could the population slope and the popula- 
tion coefficient of correlation be zero? Determine the 
95% confidence interval for the population slope, source: 
"Fortune's 100 Fastest-Growing Companies," Fortune, September 4, 2000, 
pp. 142-158. 



1 5.6 ADDITIONAL TOPICS IN REGRESSION 

AND CORRELATION ANALYSIS 



Residual Analysis 

If you wish, you can have your computer statistical package or spreadsheet list the 
residuals (observed minus estimated values of y) along with the regression print- 
out. For example, Table 15.2 is Minitab's listing of the residuals for the dexterity 
data we've been examining throughout the chapter. It was generated using the 
Minitab procedure in Computer Solutions 15.4, which we will present shortly. 

For each of the 5 data points in the dexterity example, Table 15.2 provides ob- 
served and estimated ("fitted") values of y, along with the difference between them 
(the residual). In Table 15.2, the residuals have also been "standardized," indicat- 
ing how many standard deviation multiples each represents. The standard devia- 
tions for the estimated y values are listed in the "SE Fit" column. Note that the 
value differs from one row to the next — this is because estimates become more un- 
certain when they are based on x values that are farther away from the mean of x. 

Besides simply listing the residuals, computer packages can readily provide 
an analysis of them. For example, some data points may be flagged as "unusual 
observations," or "outliers." These points are relatively distant from the regres- 
sion line, and their distance may be a reflection of unique characteristics possessed 
by the sample members involved. For example, a person whose productivity is 
extremely high may have done poorly on the dexterity test because he had a mi- 
graine headache on the test day. If the discrepancy between his observed and esti- 
mated productivity scores were great enough, he would be identified as an outlier. 

Perhaps most important, residual analysis can also help us determine whether 
the assumptions of the regression model have been met. These assumptions 



TABLE 15.2 



Minitab summary of the 
residuals and standardized 
residuals for the dexterity 
regression analysis using the 
procedure in Computer 
Solutions 1 5.4 



The standard error of fitted values for this particular 
value of Dexscore. This gets larger whenever the 
value of Dexscore is farther away from its mean. 



Obs 
1 
2 
3 
4 
5 



Observed 
Productivity 



Dexscore 
12.0 
14.0 
17.0 
16.0 
11.0 



t 

Prodrate 
55.00 
63.00 
67.00 
70.00 
51.00 



Estimated 
Productivity 



t 

Fit 

55.20 
61.20 
70.20 
67.20 
52.20 



Y 

SE Fit 
1.64 
1.23 
2.04 
1.64 
2.04 



Standardized Residuals 



Observed 
Minus 
Estimated 
Productivity 



Y 

Residual 
-0.20 

1.80 
-3.20 

2.80 
-1.20 



Y 

St Resid 
-0.09 

0.73 
-1.72 

1.26 
-0.65 
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(in terms of the errors or residuals) and the methods by which they can be exam- 
ined are as follows: 



a. 



Assumption 1: The population of e values is normally distributed with a mean of 
zero. 

Construct a histogram of the y — y values, or residuals. The histogram 
should be nearly symmetric, centered near or at zero, and have a shape that is 
approximately normal. 

b. Use the computer to test for normality of the residuals. Whether the number of 
data points is small or large, we can use the Kolmogorov-Smirnov test for nor- 
mality, introduced in Chapter 14. If the number of data points is sufficiently large 
(n > 32), Excel and the chi-square goodness-of-fit test for normality of Chapter 
13 can be applied. 

Excel and Minitab can also generate a normal probability plot for the residuals, 
another method introduced in Chapter 14. To the extent that the residuals are 
from a normal distribution, the points in the normal probability plot should fall 
approximately along a straight line. 

c. Plot the residuals versus the x values. The points should be distributed about the 
horizontal line at zero, as shown in part (a) of Figure 15.5. 

Assumption 2: The standard deviation of the e values is the same regardless of the 
given value of x. 

(continued on page 580) 



• • • • 



FIGURE 15.5 



Selected residual plots; the 
interpretations are 
discussed in the text. 



(a) 



(b) 



• • • 



• • • • 



• • • 



• • • 
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(c) 



(d) 
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computer solutions 15.4 




Residual Analysis 
EXCEL 



_9_ 
JIO 

v\_ 

13 

11 

11 
16 

rz 

18 



li 

20 

21 

22 
23 



24 
25 
26 
27 
28 



29 
30 
31_ 
32 
33 



34 



35 
36 
37 
38 



RESIDUAL OUTPUT 



Observation Predicted Prodrate 



1 



55.2 



61.2 



70.2 



67.2 



52.2 



Q 



Residuals 



-0.2 



1.8 



-3.2 



2.8 



-1.2 



Standard Residuals 



-0.084 



0.754 



-1.340 



1.173 



-0.503 



Dexscore Residual Plot 



re 



5 -i 



■O 0 

to 

0) 



10 



12 



14 



16 



13 



Dexscore 



Dexscore Line Fit Plot 




♦ Prodrate 

■ Predicted 
Prodrate 

^—Linear 

(Prodrate) 



Normal Probability Plot 



£ 80 

■o 60 
o 

a. 40 



20 40 60 80 100 

Sample Percentile 



In step 2 of the procedure in Computer Solutions 15.1, select all four items in the Residuals section and select Normal 
Probability Plot. 
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3- 
2- 
1- 
§ 0- 

"D 

'5> 

-2- 
-3- 
-4 • 



Residuals Versus the Fitted Values 

(response is Prodrate) 



50 55 60 65 70 

Fitted Value 




In step 2 of the procedure in Computer Solutions 15.1, click Graphs. In the Residuals for plots menu, select Regular. 
In the Residual Plots menu, select Normal plot of residuals and Residuals versus fits. A histogram of residuals and a 
plot of residuals versus order are also available, but not shown here. To obtain the printout of residuals shown in Table 
1 5.2, in step 1 of Computer Solutions 15.1 click Results, then select In addition, the full table of fits and residuals. 
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Plot the residuals versus the x values. The amount of scatter should be approxi- 
mately the same for all values of x, as shown in part (a) of Figure 15.5. In part (b) 
of Figure 15.5, the assumption of equal standard deviations has been violated, as 
the residuals tend to have a greater standard deviation for larger values of x. In 
part (c) of Figure 15.5, the scatter diagram of residuals is in a band that is not 
horizontal and centered on zero for all values of x, reflecting that the relationship 
between y and x may not be linear. 

Assumption 3: The residuals are independent of each other. 

Plot {y j — y t ) versus the value of i. If successive observations have errors, or residu- 
als, that are correlated, a phenomenon known as autocorrelation exists. For exam- 
ple, if the scatter diagram in this plot tends to have a negative slope or a positive 
slope, as shown in part (d) of Figure 15.5, this would suggest the presence of auto- 
correlation. The topic of autocorrelation is especially relevant to regression models 
fitted to a time series, and is discussed in Chapter 18. 

Computer Solutions 15.4 describes the procedures and shows the results when 
Excel and Minitab are used in analyzing the residuals associated with the dexter- 
ity regression analysis. Referring to the first and third plots in the Excel portion of 
Computer Solutions 15.4, we make the following observations: (1) Except for the 
residual associated with the highest dexterity test score, the residuals increase as 
the dexterity test score increases; (2) The points in the normal probability plot are 
approximately in a straight line, which supports the possibility that the residuals 
could have come from a normal distribution. The number of data points is rather 
small and, despite the tendency noted in item (1), we are unable to conclude that 
the assumptions of the simple linear regression model have not been met. 

Cautionary Notes 

In applying regression and correlation analysis, it is not enough just to plug in y 
and x values, then carry out the calculations or assign them to the computer. Two 
major pitfalls can undermine our good intentions. 

Causation Confusion 

When analysis results in a high value for the coefficient of determination, the in- 
dication is that changes in x go a long way toward explaining changes in y. How- 
ever, this does not mean that x causes y. In fact, y may be causing x, or both y and 
x may be caused by one or more other variables that have not been included in the 
analysis. 

If we were to analyze 1990-1999 net sales (y) versus advertising and promo- 
tion (x) for Bristol-Myers Squibb Company, we would find the coefficient of cor- 
relation to be r = 0.982 and the coefficient of determination to be r 1 = 0.965 (see 
Figure 15.6). Statistically, changes in advertising and promotion "explain" 96.5% 
of the changes in net sales levels. This supports the widespread notion that more 
advertising tends to result in greater sales. However, higher sales allow for a larger 
advertising budget, and many companies actually budget their advertising as a 
fraction of their actual or anticipated sales. In such a case, y and x are actually 
causing each other! 

Sometimes variables that are highly correlated have almost nothing at all to 
do with each other. For example, an analysis of BMW sales versus the average 
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Bristol-Myers Squibb, Sales vs. Advertising 

Sales = 1841 + 7.118 Advertising 
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FIGURE 15.6 



Minitab scatter diagram 
and fitted line for annual 
sales (y) versus annual ex- 
penditures on advertising 
and promotion (x) for 
Bristol-Myers Squibb 
Company for the period 
1990-1999. While the 
coefficient of correlation is 
very high (r = +0.982), the 
question remains as to 
whether (1) higher levels of 
advertising cause greater 
sales, or (2) higher levels 
of sales lead to a larger 
advertising budget. Source 
of underlying data: Bristol- 
Myers Squibb Company, 
1999 Annual Report, 
pp. 50, 51. 



salary of secondary school teachers results in a very high correlation: 2 
For the years 1991-1995: 

y = Sales of BMW automobiles in the United States 
x = Average salary, secondary school teachers 

The regression equation is y = —305,873 + 10,644x 

r = +0.996 and r 1 = 0.992 

Without further consideration, it might seem as though all these BMWs were 
being purchased by secondary school teachers, since the teachers' salary levels "ex- 
plain" 99.2% of the variation in BMW sales. However, it is more likely that both y 
and x have simply increased largely as a function of time. If two variables are linearly 
related to time, they will also be linearly related to each other. Whenever two vari- 
ables are highly correlated, but there is little evidence to suggest that either causes the 
other (e.g., annual flounder catch in the North Atlantic and annual bachelor's 
degrees granted in physics), this is known as a spurious, or nonsense, correlation. 

Extrapolating beyond the Range of the Data 

When regression and correlation are based on a given set of data, it can be dan- 
gerous to make estimates for y based on x values outside the range of these data. 
Figure 15.7 (page 582) shows the scatter diagram and regression line for EPA city 
miles per gallon (y) versus curb weight (x) for a sampling of automobiles from the 
2001 model year. As expected, heavier cars tend to get fewer miles per gallon. 
However, if we go beyond the range of x values from which the regression line 
was derived, we find that a car weighing 6460 pounds would get zero miles per 
gallon on the EPA city test, a physical impossibility. Statistics in Action 15.2 dis- 
cusses both causation and extrapolation beyond the data for salaries versus team 
performance in professional baseball. 



2 Source: Automotive News 1996 Market Data Book, p. 147; and Bureau of the Census, Statistical 
Abstract of the United States 1996, p. 167. 




Major League Baseball: Does Money Really Win Games? 



In the chapter opening vignette, we discussed the possible 
relationship between how well a team is paid and how well 
they play. Using Excel, we've performed a simple regression 
analysis of the performance of major league baseball teams 
versus the size of their team payrolls during the 2000 sea- 
son. Applying Excel and the methods in Computer Solu- 
tions 15.1 and 15.4, we obtain the regression equation 

Wins = 67.635 + 0.249 Payroll 

(payroll in millions of dollars) 

r = +0.5732 with r 2 = 0.3286 

Not unexpectedly, teams with higher payrolls tended 
to win more games. Actually, the slope of the equation 
(+0.249) indicates that, on average, it costs a team about 
four million dollars for each additional win for the season. 
Table 1 5.3 shows the payroll and number of wins for each 
team, the number of wins predicted by the regression equa- 
tion, and the residual (actual wins minus predicted wins). 
Especially interesting in Table 15.3 is a comparison of two 
teams that were not among the highest-paid in the league: 
the Chicago White Sox and the Chicago Cubs. The White 
Sox got a lot of wins for the money, 18.16 more than the 



equation predicted, while the Cubs were 15.16 wins below 
the performance predicted for a team with their salary level. 

The upper portion of Figure 1 5.8 shows the scatter dia- 
gram and fitted regression line, while the lower portion is 
the plot of residuals (vertical axis) versus payrolls (horizontal 
axis). At approximately the $37 million and $50 million pay- 
roll levels, respectively, the overachieving White Sox and the 
underachieving Cubs can be seen to be exceptionally far 
from the regression line. With their league-high $112,013 
million payroll, one would expect the New York Yankees to 
have had a successful season — and they did. Not only did 
the Yankees have the most regular-season wins, they also 
beat the New York Mets in the World Series, 4 games to 1 . 
A final observation: If there were a team whose members 
received zero salary, the equation would nevertheless pre- 
dict them as winning nearly 68 games. This is misleading, 
however, because such an estimate would be based on a 
payroll that lies far below those in the underlying data 
($1 4.692-$ 1 12.013 million). 



Data sources: Payrolls and wins, USA Today, October 3, 2000, 
World Series, World Almanac and Book of Facts 2001, p. 974. 



p. 5C; 



FIGURE 15.7 



Making estimates of y 
based on x values beyond 
the data is a dangerous 
proposition. This Excel scat- 
ter diagram represents EPA 
city gas mileage (y) versus 
vehicle weight (x) for a sam- 
ple consisting of fourteen 
2001 auto models. Accord- 
ing to the regression line, a 
car weighing 6460 pounds 
would get zero miles per 
gallon in the test. Source 
of underlying data: 
Consumer Reports 
New Car Preview 200 1 . 
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A 


B 


C 


D 


E 


1 


Team 


Payroll 


Wins 


Predicted Wins 


Residual 


2 


Anaheim Angels 


53.049 


82 


80.84 


1.16 


3 


Arizona Diamondbacks 


73.866 


85 


86.03 


-1.03 


4 


Atlanta Braves 


94.398 


95 


91.14 


3.86 


5 


Baltimore Orioles 


59.265 


74 


82.39 


-8.39 


6 


Boston Red Sox 


90.535 


85 


90.18 


-5.18 


7 


Chicago Cubs 


50.311 


65 


80.16 


-15.16 


8 


Chicago White Sox 


36.980 


95 


76.84 


18.16 


9 


Cincinnati Reds 


36.252 


85 


76.66 


8.34 


10 


Cleveland Indians 


76.518 


90 


86.69 


3.31 


11 


Colorado Rockies 


56.050 


82 


81.59 


0.41 


12 


Detroit Tigers 


58.545 


79 


82.21 


-3.21 


13 


Florida Marlins 


25.107 


79 


73.89 


5.11 


14 


Houston Astros 


50.525 


72 


80.22 


-8.22 


15 


Kansas City Royals 


23.655 


77 


73.53 


3.47 


16 


LA Dodgers 


90.383 


86 


90.14 


-4.14 


17 


Milwaukee Brewers 


26.183 


73 


74.15 


-1.15 


18 


Minnesota Twins 


14.692 


69 


71.29 


-2.29 


19 


Montreal Expos 


27.617 


67 


74.51 


-7.51 


20 


New York Mets 


81.835 


94 


88.01 


5.99 


21 


New York Yankees 


112.013 


98 


95.53 


2.47 


22 


Oakland Athletics 


32.169 


91 


75.65 


15.35 


23 


Philadelphia Phillies 


35.934 


65 


76.58 


-11.58 


24 


Pittsburgh Pirates 


29.336 


69 


74.94 


-5.94 


25 


St. Louis Cardinals 


72.688 


95 


85.73 


9.27 


26 


San Diego Padres 


55.326 


76 


81.41 


-5.41 


27 


SF Giants 


54.036 


97 


81.09 


15.91 


28 


Seattle Mariners 


61.715 


91 


83.00 


8.00 


29 


Texas Rangers 


61.831 


71 


83.03 


-12.03 


30 


Toronto Blue Jays 


54.080 


83 


81.10 


1.90 


31 


Tampa Bay Devil Rays 


51.450 


69 


80.45 


-11.45 



TABLE 15.3 



For the 2000 major league 
baseball season discussed in 
Statistics in Action 15.2, 
team payrolls, wins, wins 
predicted by the regression 
equation, and residuals 



15.60 In regression analysis, what is a "residual"? 

15.61 What is residual analysis, and what information 
can it provide? 

15.62 What is spurious correlation? Provide a real or 
hypothetical example where two variables might exhibit 
such a relationship. 

15.63 A firm finds the coefficient of correlation between 
y = annual sales and x = annual expenditure on research 
and development to be r = +0.90. Comment on the 
likely direction of causation, if any, between these 
variables. 

15.64 "The Ford Edsel was sold only in the 1957-1959 
model years, but the company made a big mistake in 
dropping it. If they had just fitted a regression line to the 
sales for these 3 years and extended it to 2006, they could 
have seen that it would have been one of the ten sales 
leaders of that year. " Comment on the appropriateness of 
this statement. 



/ data set/ Note: Exercises 15.65-15.68 require a 
computer and statistical software. 

15.65 For the y and x values listed in file XR15065, 
obtain the simple linear regression equation, then analyze 
the residuals by (a) constructing a histogram, (b) using a 
normal probability plot, (c) plotting the residuals versus 
the x values, and (d) plotting the residuals versus the 
order in which they were observed. Do any of the 
assumptions of the simple linear regression model appear 
to have been violated? 

15.66 For the y and x values listed in file XR15066, 
obtain the simple linear regression equation, then analyze 
the residuals by (a) constructing a histogram, (b) using a 
normal probability plot, (c) plotting the residuals versus 
the x values, and (d) plotting the residuals versus the 
order in which they were observed. Do any of the 
assumptions of the simple linear regression model appear 
to have been violated? 
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FIGURE 15.8 



Excel scatter diagram, 
fitted regression equation, 
and residuals plot for the 
regression analysis of the 
payroll and wins data in 
Table 15.3 
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15.67 For the regression equation obtained in 
Exercise 15.57, analyze the residuals by (a) constructing 
a histogram, (b) using a normal probability plot, 

(c) plotting the residuals versus the x values, and 

(d) plotting the residuals versus the order in which they 
were observed. Do any of the assumptions of the simple 
linear regression model appear to have been violated? 
Comment on whether any of the years were associated 
with especially high or low fuel consumption for the 
number of flying hours during the year. 



15.68 For the regression equation obtained in Exercise 
15.59, analyze the residuals by (a) constructing a 
histogram, (b) using a normal probability plot, (c) plotting 
the residuals versus the x values, and (d) plotting the 
residuals versus the order in which they were observed. 
Do any of the assumptions of the simple linear regression 
model appear to have been violated? Comment on 
whether any of these high-growth companies had an 
especially high or low income for a firm with their level 
of revenue. 



1 5.7 SUMMARY 



• Linear regression model and its assumptions 

In simple linear regression and correlation, regression analysis provides a "best- 
fit" linear equation for the data, and correlation analysis measures the strength of 
the linear relationship. In the linear model, y i = /3 0 + /3 l x l + e-, y and x are the 
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dependent and independent variables, respectively. The simple linear regression 
model assumes the following: (1) for any given value of x, the y values are nor- 
mally distributed with a mean that is on the regression line, or /j, y x = /6 0 + /i^x-; 
(2) regardless of the value of x, the standard deviation of the y values about the 
regression line is the same; and (3) the y values are statistically independent of 
each other. In multiple linear regression and correlation (Chapter 16), there is one 
dependent variable, but two or more independent variables. 

Variables are directly related when they increase or decrease together and in- 
versely related when an increase in one is accompanied by a decrease in the other. 
The relationship may also be linear or curvilinear, depending on whether a 
straight line or a curve is best for describing their relationship. The emphasis of 
the chapter is on linear relationships. 

• Determining the least-squares linear regression equation 

The scatter diagram visually summarizes the data points and consists of vertical 
(y) and horizontal (x) axes. A ruler and the "eyeball" method can be used in fit- 
ting an approximate straight line to the data. A superior approach is to calculate 
the regression line, y = b 0 + b^x, that satisfies the least-squares criterion. This 
criterion requires that the y-intercept (b Q ) and slope be such that the sum of 
the squared vertical distances between data points and the line is minimized. 

Formulas are provided for the calculation of b Q and b 1 for the least-squares re- 
gression line, y = b 0 + b x x. Because the regression line is based on sample data, 
the y-intercept (b Q ) and slope (b x ) are really estimates of their respective (and un- 
known) population values, /3 0 and j3 1 . For a given value of x, the point estimate 
of y will be y, calculated from the regression line. Interval estimates for y can also 
be developed, and the first step is to calculate the standard error of estimate, s y x . 
The standard error of estimate is the extent to which y values are scattered or dis- 
persed above and below the regression line. For a given x value, the difference be- 
tween the observed and estimated values of y is the error, or residual. 

• Point and interval estimates 

Given a specific value of x, two kinds of interval estimates can be made regarding 
y: (1) a confidence interval for the (unknown) true mean of y, and (2) a prediction 
interval for an individual observation of y. Each approach relies on a correction 
formula that takes into account the standard error of estimate, the number of 
data points in the sample, and the distance between x and the specific value of x 
on which the interval is based. 

• Coefficients of correlation and determination 

Two important measures of the strength of the linear relationship between y and 
x are the coefficient of correlation (r) and the coefficient of determination (r 1 ). 
The coefficient of correlation will be no less than r = — 1 and no greater than 
r = +1, and the larger the absolute value of r, the stronger the linear relationship 
between the variables. Whenever r = 0, there is no linear relationship at all, and 
the slope of the regression line will be b 1 = 0. The coefficient of determination (r 2 ) 
is the proportion of the variation in y that is explained by the regression line, 
y = b 0 + b x x, and r 2 will always be ^ 1. 

• Testing the linear relationship 

The significance of the linear relationship can be tested by examining whether the 
coefficient of correlation differs significantly from zero, or by testing whether the 
slope differs significantly from zero. The tests are equivalent to each other as well 
as to an analysis of variance test that examines the amount of variation in y that 
is explained by the equation. 
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• Computer-assisted regression analysis 

Computer statistical packages provide detailed regression and correlation analysis, 
and their output typically includes the descriptors, tests, and interval estimates dis- 
cussed in the chapter. Both Excel and Minitab Computer Solutions procedures are 
provided that correspond to the principal example used throughout the chapter. 

With statistical software, it is easy to analyze the residuals and examine the as- 
sumptions of the model. Computer statistical packages also identify data points 
that are relatively distant from the regression line. Such "unusual" observations 
may reflect the possession of unusual characteristics by the sample members that 
these points represent. Residual analysis is especially useful in examining whether 
various assumptions of the regression model may have been violated. 

• Potential problems with simple linear regression 

In applying regression and correlation analysis, it is important to avoid confusing 
a strong linear relationship between y and x with the conclusion that y is actually 
caused by x. In reality, y could be causing x, or both y and x might be heavily in- 
fluenced by another variable that has not been included in the analysis. Another 
precaution is to avoid extrapolating beyond the range of the data. It can be ex- 
tremely misleading to make point or interval estimates of y based on an x value 
that lies beyond the range of data points from which the regression line has been 
derived. 




N 


O 
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The Simple Linear Regression Model 

y i = /3 0 + (i 1 x i + e- where y i = a value of the dependent variable, y 

Xj = a value of the independent variable, x 
;6 0 = the y-intercept of the regression line 
/Sj = the slope of the regression line 
e. = random error, or residual 

1. For a given value of x, the expected value of y is given by the linear equation, 
fly x = f) 0 + PiXj. The term fi y x can be stated as "the mean of y, given a specific value 
of x." 

2. The difference between the actual value of y and the expected value of y is the error, or 
residual, e ( - = y, - (j8 0 + /S^,). 

The Sample Regression Line 

y = b 0 + b x x where y = the estimated value of the dependent variable (y) 

for a given value of x 
b 0 = the y-intercept; this is the value of y where the 

line intersects the y-axis whenever x = 0 
b-, = the slope of the regression line 
x = a value for the independent variable 

Least-Squares Criterion for the Best-Fit Regression Line 

The equation must be such that 2(y ( — y,) 2 is minimized 

where y ; = the observed value of y for the given value of x 

y i = the predicted value of y for the x value, as determined from the 
regression equation 
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Least-Squares Regression Line,/ 

• Slope: 



2* 



b 0 + Z^x 



where n = number of data points 



y-intercept: 



b 0 = V 



Standard Error of Estimate 



where y- = each observed value of y in 
the data 
3> ; = the value of y that would 
have been estimated from 
the regression line 
n = the number of data points 



Confidence Interval for the Mean of y. Given a Specific Value of x 



±f(s ft/ 1 ■ Rvalue-*) 2 

v » Cs*, 2 ) - Cs*,.) 2 /« 



where y = the estimated value of y 
for the given value of x 
n = the number of data points 
s,, „ = the standard error of estimate 

y. x 

t = t value for confidence 

level desired and df = n — 2 
x value = the given value of x 



Prediction Interval for an Individual y. Given a Specific Value of x 

V(^c Vciluc — ^c)^ 
1 H h ^ r- ; ry— where y = the estimated value of y 
» (2*r) " (2*ir/» for the given value of x 

n = number of data points 
s y x = the standard error of 
estimate 
t = t value for the confidence 
level desired and 
df = n — 2 
x value = the given value of x 

The Coefficient of Correlation 

where r = coerncient or 



V »(E^ 2 ) " (2*,-) 2 • V «(2y 2 ) " (Sy,) 2 correlation, -1 < r < +1 

n = number of data points 

The Coefficient of Determination, r 2 

r 2 = 1 



Error variation, which is not explained by the regression line 



Total variation in the y values 



m - y,r _ _sse 

2(y,--y) 2 SST 
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t-Test for the Population Coefficient of Correlation, p 

• Null hypothesis 

H Q : p = 0 There is no linear relationship. 

• Alternative hypothesis 

Hjt p ~f~ 0 A linear relationship exists. 

• Test statistic 

where r = sample coefficient 

~ of correlation 

— 2 n = number of data points 

t = t value for the level 
of significance of the 
test and df = n — 2 

(Testing H Q : p = 0 is equivalent to testing H 0 : /3j = 0, described below.) 

t-Test for the Slope (fij of the Population Regression Line 

• Null hypothesis 

H Q : j8j = j3 10 The population slope is /3 1Q . 

• Alternative hypothesis 

/3j ¥= f) 10 The population slope is not /3 10 . 



Test statistic 

b 



i f-'io 



with 



bl V^xf - n(x) 2 



where b t = slope of the sample regression line 
;6 10 = a value that has been hypothesized 
for the slope of the population 
regression line 



where s b = the estimated standard deviation 



of the slope 
s„ „ = the standard error of estimate 

y. x 

n = the number of data points 
t = t value for the level of significance 
of the test, and df = n — 2 

Confidence Interval for the Slope (pj of the Population Regression Line 

The interval is 

£>j ± ts h where b t = slope of the sample regression line 
t = t value for the confidence level 
desired, and df = n — 2 
s fe] = the estimated standard deviation 

of the slope, s 



V2*2 - w(3c)2 



Components of the Total Variation in the Dependent Variable, y 

Total variation Variation explained Variation not explained 
in y values = by regression line + by regression line 
(SST) (SSR) (SSE) 

- y) 2 SSR = SST - SSE 2(y ; - y ; ) 2 
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ANOVA Test for the Significance of a Simple Linear Regression 

• Null and alternative hypotheses 

H 0 : There is no linear relationship between x and y. 
Hj: A linear relationship exists between x and y. 

• Test statistic 

Calculated F = ., — where SSR = sum of squares explained 

SSE/(n -2 , ■ ,. 

by the regression line 

SSE = error, or residual, sum of squares 

n = number of data points 

• Decision rule Reject H 0 if calculated F exceeds the critical F for the level of significance 
of the test, with df (numerator) = 1 and df (denominator) = n — 2. 




15.69 For households in a community, many different 
variables can be observed or measured. If y = monthly 
mortgage payment, x = annual income would probably 
be directly related to y. For y = monthly mortgage pay- 
ment, provide an example of an x variable likely to be 
(a) directly related to y, (b) inversely related to y, and 
(c) unrelated to y. 

15.70 For high school seniors entering college, and 
for y = grade point average for the freshman year, 

x = Scholastic Aptitude Test score would tend to be 
directly related to y. Provide an example of another per- 
sonal variable that might be directly related to y along 
with an example of a personal variable that you think 
would tend to be inversely related to y. 

15.71 McDonald's Corporation has reported the following 
values for total revenues and net income during the 1998 to 
2005 period. All data are in billions of dollars: 

source: McDonald's Corporation, 2005 Annual Report. 

1998 1999 2000 2001 

Net 

Income 1.55 1.95 1.98 1.64 
Total 

Revenues 12.42 13.26 14.24 14.87 

2002 2003 2004 2005 

Net 

Income 0.89 1.47 2.28 2.60 
Total 

Revenues 15.41 17.14 19.07 20.46 

Determine the least-squares regression equation line for 
estimating net income and interpret its slope. For a year in 



which total revenues are $18.0 billion, estimate the net 
income for that year. 

15.72 Prior to being hired, the five salespersons for a 
computer store were given a standard sales aptitude test. 
For each individual, the score achieved on the aptitude 
test and the number of computer systems sold during the 
first 3 months of their employment are shown here. 

John Cora Sam Cindy Henry 

x = Score on 
aptitude 

test 80 70 45 90 20 

y = Units 
sold in 

3 months 25 15 10 40 5 

Determine the least-squares regression line and inter- 
pret its slope. Estimate, for a new employee who 
scores 60 on the sales aptitude test, the number of 
units the new employee will sell in her first 3 months 
with the company. 

15.73 The following data represent y = the number of 
passenger cars retired from service and x = the number 
of new passenger cars registered during the years 
1991-1995. source: Bureau of the Census, Statistical Abstract of the 
United States 1999, p. 639. 

x = New-Car y = Cars Retired 

Registrations from Service 



Year (Millions) (Millions) 

1991 8.175 8.565 

1992 8.213 11.194 

1993 8.518 7.366 

1994 8.990 7.824 

1995 8.635 7.414 
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a. Determine the least-squares regression line and 
interpret its slope. 

b. Estimate, for a year in which x = 7.0 million new 
passenger cars are registered, the number of cars that 
will be retired from service. 

c. Estimate, for a year in which 9.0 million new 
passenger cars are registered, the number of cars 
that will be retired from service. 

15.74 Peoples Energy, Inc. reports the following data for 
operating revenue and net income for 2001-2005. source: 
Peoples Energy, 2005 Annual Report, pp. 83-84. 

x = Operating 



Revenue y = Net Income 

Year (Millions) (Millions) 

2001 2270.2 96.9 

2002 1482.5 89.1 

2003 2138.4 103.9 

2004 2260.2 81.6 

2005 2600.0 78.1 



Determine the least-squares regression line and interpret 
its slope. Estimate the net income if the operating revenue 
figure is $2500 million. 

15.75 For 1998 and 1999, quarterly revenue and net in- 
come (millions of dollars) for Pacer Technology were as 
shown here, source: Pacer Technology, 1 999 Annual Report, p. 30. 

1998 1999 
I II III IV I II III IV 

x = 

Revenue $13.66 11.84 9.99 10.56 7.38 6.27 8.32 9.97 
y = Net 

income 0.96 0.42 -0.39 0.30 0.45 0.28 0.43 0.39 

Determine the least-squares regression line and interpret 
its slope. Estimate net income if the revenue figure is $10 
million. 

15.76 The data in the table describe rug and carpeting 
shipments ($ billions) and the number of housing starts 
(millions) over a 6-year time period, source: Bureau of the 
Census, Statistical Abstract of the United States 1999, p. 723. 

Year 

1992 1993 1994 1995 1996 1997 

Rugs and Carpeting 

Shipped (SBillions) 8.75 9.28 9.53 9.77 10.15 10.26 
Housing Starts 

(Millions) 1.20 1.29 1.46 1.35 1.48 1.47 

a. Determine the least-squares regression line for 

predicting rugs and carpeting shipments on the basis 
of housing starts. 



b. Determine and interpret the coefficients of correlation 
and determination. 

c. If there were 1.4 million housing starts in a year, 
what would be the prediction for rugs and carpeting 
shipments during that year? 

15.77 The following data describe fuel consumption 
and flying hours for turboprop general aviation aircraft 
from 1992 through 1997. Fuel consumption is in 
millions of gallons; flying time is in millions of hours. 

source: General Aviation Manufacturers Association, 1999 Statistical 
Databook, p. 21 . 

Year 

1992 1993 1994 1995 1996 1997 

Fuel Consumed 131.8 95.9 92.7 124.4 145.0 135.7 
Flying Time 1.6 1.2 1.1 1.5 1.8 1.7 

a. Determine the least-squares regression line for 
predicting fuel consumption on the basis of flying 
time. 

b. Determine and interpret the coefficients of correlation 
and determination. 

c. If there were 2.0 million flying hours during a given 
year, what would be the prediction for the amount of 
fuel consumed? 

15.78 For the regression analysis of the data in 
Exercise 15.77: 

a. Use the 0.10 level in testing whether the population 
coefficient of correlation could be zero. 

b. Use the 0.10 level in testing whether the population re- 
gression equation could have a slope of zero. 

c. Construct the 90% confidence interval for the slope of 
the population regression equation. Discuss the inter- 
val in terms of the results of part (b). 

15.79 The following data describe U.S. passenger car 
travel and fuel consumption from 1995 through 1999. 
The data represent billions of gallons consumed and 
billions of miles traveled, respectively, source: Time Almanac 

2003, p. 591. 

1995 1996 1997 1998 1999 

Fuel 

Consumed 68.07 69.22 69.87 71.70 73.16 
Miles 

Traveled 1438.29 1469.85 1501.82 1549.58 1569.27 

a. Determine the least-squares equation line for 
predicting fuel consumption on the basis of driving 
miles. 

b. Determine and interpret the coefficients of correlation 
and determination. 

c. If passenger car travel during a year were 1580 billion 
miles, what level of fuel consumption would be 
predicted? 
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Vehicle 


Front Crash into Flat Barrier 


Rear Crash into Pole 


2002 Buick Rendezvous 


$21 3 


<t" DCOl 


2002 Chevrolet Trailblazer 


81 1 


/DD 


zuuz hord explorer 


1 1 27 


1 699 


2002 Isuzu Axiom 


948 


1 583 


")nm loon 1 il-ior+i/ 

zuuz jeep LiDeriy 


ono 


1 a on 
1 4yu 


2001 Acura MDX 


128 


755 


2001 BMW X5 


928 


196 


2001 Mitsubishi Montero 


1257 


2897 


2001 PontiacAztek 


358 


2302 


2001 Suzuki Grand Vitara 


681 


2095 


2001 Toyota Highlander 


650 


542 



15.80 For the regression analysis of the data in 
Exercise 15.79: 

a. Use the 0.05 level in testing whether the population 
coefficient of correlation could be zero. 

b. Use the 0.05 level in testing whether the population 
regression equation could have a slope of zero. 

c. Construct the 95% confidence interval for 

the slope of the population regression equation. 
Discuss the interval in terms of the results of 
part (b). 

15.81 As part of its ongoing testing and reporting 
program, The Insurance Institute for Highway Safety 
crashes vehicles into fixed objects at low speeds, then 
records the cost of repairs necessary to fix whatever 
damage has occurred. In crash-testing a sample of 
sport utility vehicles (SUVs) at 5 mph, the repair costs 
were as shown in the table at the top of the page. 
source: The Insurance Information Institute Fact Book 2002, 

p. 107. 

a. Determine the least-squares equation line for 
predicting rear-crash repair costs on the basis of 
front-barrier repair costs. 

b. Determine and interpret the coefficients of correlation 
and determination. 

c. If a sport utility vehicle were to incur $1000 in dam- 
ages in the front-crash test, what level of repairs 
would be predicted for the rear-crash test? 

15.82 For the regression analysis of the data in Exer- 
cise 15.81: 

a. Use the 0.10 level in testing whether the population 
coefficient of correlation could be zero. 

b. Use the 0.10 level in testing whether the population 
regression equation could have a slope of zero. 

c. Construct the 90% confidence interval for the slope 
of the population regression equation. Discuss the 
interval in terms of the results of part (b). 

15.83 The following Minitab output describes the results 
of a production experiment in which n = 10 different 
steel bars were subjected to tempering times ranging from 



6 to 10 seconds, then measured for tensile strength 
(thousands of pounds): 

Regression Analysis 

The regression equation is 
STRENGTH = 60.0 + 10.5 TEMPTIME 



Predi ctor 

Constant 

TEMPTIME 



Coef 
60.02 
10.507 



StDev 
29.36 
3.496 



T 

04 
01 



P 

0.075 
0.017 



S = 14.03 R-Sq = 53. 

Analysis of Variance 



R-Sq(adj) = 47.2% 



Source 
Regression 
Error 
Total 



DF 
1 



SS 

1777.4 
1574.6 
3352.0 



MS 

1777.4 
196.8 



F 

9.03 



P 

0.017 



a. To the greatest number of decimal places in the 
printout, what is the least-squares regression line? 

b. What proportion of the variation in strength is 
explained by the regression line? 

c. At what level of significance does the slope of the line 
differ from zero? What type of test did Minitab use in 
reaching this conclusion? 

d. At what level of significance does the coefficient of cor- 
relation differ from zero? Compare this with the level 
found in part (c) and explain either why they are dif- 
ferent or why they are the same. 

e. Construct the 95% confidence interval for the slope 
of the population regression line. 

15.84 For the ANOVA portion of the printout shown in 
Exercise 15.83, explain the exact meaning of each num- 
ber in the "SS" and "F" columns. 

15.85 A tire company has carried out tests in which 
rolling resistance (pounds) and inflation pressure (pounds 
per square inch, or psi) have been measured for psi values 
ranging from 20 to 45. The regression analysis is summa- 
rized in the following Minitab printout: 

Regression Analysis 



The regression equation is 
ROLRESIS = 9.45 - 0.0811 PSI 



[continued) 
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Predi ctor 


Coef 


StDev 




T 


P 


Constant 


9.450 


1.228 




7.69 


0.000 


PSI 


-0.08113 


0.03416 




-2.38 


0.029 


S = 0.8808 


R-Sq = 


23.9% 


R- 


Sq(adj) = 


19.6% 


Analysis of 


Variance 










Source 


DF 


SS 




MS 


F P 


Regression 


1 4 


3766 


4. 


3766 


5.64 0.029 


Error 


18 13 


9657 


0. 


7759 




Total 


19 18 


3422 









a. To the greatest number of decimal places in the print- 
out, what is the least-squares regression line? 

b. What proportion of the variation in rolling resistance 
is explained by the regression line? 

c. At what level of significance does the slope of the line dif- 
fer from zero? What type of test did Minitab use in 
reaching this conclusion? 

d. At what level of significance does the coefficient of cor- 
relation differ from zero? Compare this with the level 
found in part (c) and explain either why they are dif- 
ferent or why they are the same. 

e. Construct the 95% confidence interval for the slope of 
the population regression line. 

15.86 For the ANOVA portion of the printout shown in 
Exercise 15.85, explain the exact meaning of each num- 
ber in the "SS" and "F" columns. 

15.87 Given the data and regression analysis for 
Exercise 15.72, construct and interpret the 95% 
confidence and prediction intervals associated with a 
score of 60 on the sales aptitude test. 

15.88 Given the data and regression analysis for 
Exercise 15.73, construct and interpret the 90% 
confidence and prediction intervals associated with 9.0 
million new-car registrations. 

1 5.89 Given the data and regression analysis for Exercise 
15.76, construct and interpret the 95% confidence and pre- 
diction intervals associated with 1.4 million housing starts. 

15.90 A physical-fitness researcher tests 25 professional 
football players, finding that the time required to run 

40 yards (y, in seconds) is related to weight (x, in pounds) 
by the least-squares equation y = —2.0 + 0.038x. Accord- 
ing to this equation, a person weighing 200 pounds would 
run the 40-yard dash in an estimated time of 5.6 seconds. 
However, a child weighing 55 pounds would have an 
estimated time of just 0.09 seconds to run 40 yards. Briefly 
explain how this unlikely result could have been obtained. 

/ data set/ Note: Exercises 15.91-15.96 require a 
computer and statistical software. 

15.91 A university administrator has collected data on 
total Scholastic Aptitude Test (SAT) score versus 
freshman grade point average (GPA) for a sample of 
students who recently finished their first year at the 



university. The data are listed in file XR15091. Given 
these data: 

a. Determine the least-squares regression equation for 
estimating the freshman grade point average based on 
total SAT score, then estimate the freshman grade 
point average for an applicant who has scored a total 
of 1100 on his SAT examination. 

b. What percentage of the variation in freshman GPA is 
explained by SAT total? 

c. Given an SAT total of 1100, construct and interpret 
the 99% confidence and prediction intervals 
associated with freshman GPA. 

15.92 For the regression equation obtained in Exercise 
15.91, analyze the residuals by (a) constructing a histo- 
gram, (b) using a normal probability plot, (c) plotting the 
residuals versus the x values, and (d) plotting the residu- 
als versus the order in which they were observed. Do any 
of the assumptions of the simple linear regression model 
appear to have been violated? 

15.93 The National Association of Realtors has reported 
data on housing affordability for 1990-1999, and values 
for two of the variables are listed in file XR15093. Repre- 
senting affordability is y = average monthly mortgage 
payment as a percentage of median household income. 
Representing the cost of a mortgage is x = the average 
mortgage rate, as a percentage, source: World Almanac and 
Book of Facts 2001, p. 743. 

a. Determine the least-squares regression equation for y 
as a function of x, then estimate y if the average 
mortgage rate is x = 8.0%. 

b. What percentage of the variation in affordability (y) is 
explained by the average mortgage rate (at)? At the 
0.05 level, does the slope of the equation differ signifi- 
cantly from 0 ? 

c. Given x = 8.0%, construct and interpret the 95% 
confidence and prediction intervals associated with y. 

15.94 For the regression equation obtained in Exercise 
15.93, analyze the residuals by (a) constructing a 
histogram, (b) using a normal probability plot, (c) plotting 
the residuals versus the x values, and (d) plotting the resid- 
uals versus the order in which they were observed. Do any 
of the assumptions of the simple linear regression model 
appear to have been violated? 

15.95 Computer database GROWCO describes the charac- 
teristics of 100 companies identified by Fortune as among 
the fastest growing. Two of the variables listed are ( 1 ) esti- 
mated stock price/earnings ratio next year and (2) annual 
revenue growth (as a percentage), source: "Fortune's 100 
Fastest-Growing Companies," Fortune, September 4, 2000, pp. 142-158. 
a. Determine the least-squares regression equation for 

y = next year's price/earning ratio estimate as a func- 
tion of x = annual revenue growth. What would be 
the predicted value of y for a company whose annual 
revenue growth has been 150%? 
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b. What percentage of the variation in the magazine's 
price/earnings estimates (y) is explained by the an- 
nual revenue growth percentage (x)? At the 0.05 
level, does the slope of the equation differ signifi- 
cantly from 0? 

c. Given x = 150%, construct and interpret the 95% 
confidence and prediction intervals associated with y. 



15.96 For the regression equation obtained in Exercise 
15.95, analyze the residuals by (a) constructing a 
histogram, (b) using a normal probability plot, (c) 
plotting the residuals versus the x values, and (d) plotting 
the residuals versus the order in which they were 
observed. Do any of the assumptions of the simple linear 
regression model appear to have been violated? 



integrated 





Thorndike Sports Equipment 

Ted Thorndike's friend Mary Stuart teaches a statistics 
course at the local university, and she has asked Ted to 
stop in and talk to the class about statistics as it relates to 
sports. Among the topics that Ted thinks might be interest- 
ing is a discussion of how Olympic swimming perfor- 
mances for both men and women have improved over the 
years. 

According to Ted, swimmers and other athletes have 
been getting bigger, faster, and stronger. In addition, train- 
ing technologies and equipment have improved by leaps 
and bounds. Among the data that Ted has collected are the 
winning times for men and women in the 400-meter 



freestyle swimming event for Olympics held from 1924 
through 2004. The times are shown here and are also listed 
in data file THORN15. source: Time Almanac 2006, pp. 924, 926. 

1. Using the women's winning time as the dependent vari- 
able and year as the independent variable, determine 
and interpret both the regression equation and the coef- 
ficient of correlation. For the year 2012, determine the 
point estimate and the 95% prediction interval for the 
winning time in the women's 400-meter freestyle. 

2. Repeat question 1, using the men's winning time as the de- 
pendent variable. 



400-Meter Freestyle 
Swimming, Winning 
Time for 



Year 


Women 


Men 


1924 


6.037 minutes 


5.070 minutes 


1928 


5.713 


5.027 


1932 


5.475 


4.807 


1936 


5.440 


4.742 


1948 


5.297 


4.683 


1952 


5.202 


4.512 


1956 


4.910 


4.455 


1960 


4.843 


4.305 


1964 


4.722 


4.203 


1968 


4.530 


4.150 


1972 


4.324 


4.005 


1976 


4.165 


3.866 


1980 


4.146 


3.855 


1984 


4.1 18 


3.854 


1988 


4.064 


3.783 


1992 


4.120 


3.750 


1996 


4.121 


3.800 


2000 


4.097 


3.676 


2004 


4.089 


3.718 
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3. Using the women's winning time as the dependent vari- 
able and the men's winning time as the independent 
variable, determine and interpret both the regression 
equation and the coefficient of correlation. If the men's 
winning time were 4.000 minutes in a given Olympics, 



determine the point estimate and the 95% prediction 
interval for the winning time in the women's event. 
4. For the regression equation obtained in question 3, use 
a test of your choice in examining the significance of 
the linear relationship between the variables. 



Springdale Shopping Survey 

This exercise involves the following variables from data 
file SHOPPING, the Springdale Shopping Survey: 

• Variables 7-9 Overall attitude toward the shopping 
area. The highest possible rating is 5. In each anal- 
ysis, one of these will be the dependent variable. 

• Variables 18-25 The importance (highest rating = 
7) placed on each of 8 attributes that a shopping 
area might possess. In each analysis, one of these 
will be the independent variable. 

Information like that gained from this database exer- 
cise could provide management with useful insights into 
how respondents form their overall attitude toward a 
shopping area. For example, management can find out 
whether people who place more importance on "a lot of 
bargain sales" tend to have a higher perception or a lower 
perception of a shopping area. 

1. With variable 7 (attitude toward Springdale Mall) as 
the dependent variable, perform two separate regres- 



■ 




sion analyses — one for each of the independent vari- 
ables listed below. In each case, determine and inter- 
pret the regression equation and the coefficients of 
correlation and determination, then use the 0.05 level 
of significance in reaching a conclusion on the signifi- 
cance of the linear relationship. If possible on your 
computer statistical package, retain the residuals for 
analysis in question 2. 

a. Variable 21 (good variety of sizes/styles). 

b. Variable 25 (a lot of bargain sales). 

2. For each of the regression analyses in question 1, exam- 
ine the residuals by using a histogram, a normal proba- 
bility plot, and a plot of the residuals against the values 
of the independent variable. In each case, comment on 
whether an assumption of the linear regression model 
may have been violated. 

3. Repeat questions 1 and 2, using variable 9 (attitude to- 
ward West Mall) as the dependent variable. 




Pronto Pizza (B) 

In the Pronto Pizza case, visited previously in Chapter 
10, the company was facing stiff competition in its pizza 
delivery service, and the owners were particularly con- 
cerned about being able to provide call-in customers 
with a guarantee as to how quickly their pizza would ar- 
rive. Manager Tony Scapelli had collected a month's 
worth of data that included such variables as prepara- 
tion time, wait time, travel time, and delivery distance. In 
this chapter, we will use simple linear regression and cor- 
relation to help Tony examine the number of travel min- 
utes required for a delivery versus the number of miles 
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involved in making the delivery. Using the PRONTO data 
file on the text CD and the variables TRAVEL_TIME and 
DISTANCE: 

1. Determine the coefficient of correlation between 
TRAVEL_TIME and DISTANCE. Is the sign of the 
coefficient of correlation positive or negative? Is this the 
sign you would expect to be associated with this coeffi- 
cient? What percentage of the variation in the driving 
time to deliver a pizza is explained by the number of 
miles for the trip? 

2. Determine the best-fit linear regression equation for esti- 
mating TRAVELJTIME on the basis of DISTANCE. 
Identify and interpret the slope of the equation in the 
context of this situation. 



3. What would be the estimated time of travel for a pizza 
delivery that involved a 5-mile trip? 

4. Determine and interpret the 95% confidence and pre- 
diction intervals associated with a pizza delivery that 
involved a 5-mile trip. 

5. Were there any deliveries for which the time for travel 
was "flagged" by your computer statistical package as 
being unusually long or short compared to the time 
that would have been estimated by the equation? What 
managerial implications could this have, especially 
considering the possibility of litigation if a delivery per- 
son were to be involved in an accident while trying to 
meet a delivery guarantee set by the company? 



Regression: Point Estimate fory 



When this applet loads, we see the best-fit linear regression 
equation for the dexterity test example featured through- 
out this chapter. Moving the slider at the bottom allows us 
to select a dexterity test score and immediately see the es- 
timated productivity for someone with that score. 
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Applet Exercises 

18.1 Move the slider so that the dexterity test score is 10. 
What is the point estimate of the productivity for someone 
with this score on the dexterity test? 

18.2 Move the slider so that the dexterity test score is 1 1 . 
What is the point estimate of the productivity for someone 
with this score on the dexterity test? 

18.3 In Applet Exercise 18.2, we increased the dexterity test 
score by 1 . By how much did the point estimate for produc- 
tivity increase? Compare this increase with the slope of the 
regression equation. 

18.4 By how much should the point estimate for productiv- 
ity increase if we were to increase the dexterity test score by 
5 points? Verify your answer by comparing the point esti- 
mates for dexterity test scores of 1 0 and 1 5. 



seeing statistics: applet 19 - 




Point Insertion Diagram and Correlation 



This applet loads with a blank grid. By clicking the mouse, 
we can insert points anywhere within the grid. For the 
points we have chosen, the applet automatically updates 
the coefficient of correlation and the best-fit regression 
line. To start over again with a blank grid, just click on the 
"Reset" button. 



100.0 



r = 0.85 




40.0 



20.0 



0 0 0.0 



20.0 40.0 60.0 80.0 100.0 
Reset | Clear | Remove Last 



Applet Exercises 

19.1 Insert any three points so that the best-fit line will 
slope upward. What is the sign of r? 

19.2 Reset to a clear screen and insert a pattern so that r 
will be between +0.7 and +0.9. Does the regression line 
seem to be a very good fit to the points? Next, insert some 
additional points that will make r < 0.7. 

19.3 Reset to a clear screen and insert two points that form 
an upward-sloping diagonal line. What is the value of rl 

19.4 Reset to a clear screen and insert a pattern of at least 
1 0 points so that r will be very close to - 1 . 

19.5 Reset to a clear screen and insert a pattern of at least 
1 0 points so that r will be very close to 0. 



Regression Error Components 



This applet loads with a scatter diagram containing the 
points in the dexterity test example used in this chapter. By 
clicking on the line, we can make it better or worse as an 
estimator of productivity. For each data point, the error is 



Productivity* = 19.2 ♦ 3.0 * Dexterity 

i sq « Productivity 
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the vertical distance between that point and the line we 
have selected. In the display on the left, the sum of the 
squared errors is shown at the bottom and the value of r 2 is 
shown at the top. They are represented by the green and 
red rectangles, respectively. By clicking the "Find Best 
Model" button, we obtain the best-fit regression equation 
for the data. 

Applet Exercises 

20.1 Click on the regression line and move it so that it is as 
close as possible to horizontal. Identify the value of r 2 and 
interpret its meaning. 

20.2 Click on the regression line and move it so that the 
slope of the equation is approximately 1 .0. What effect has 
this had on r 2 ? 

20.3 Click on the regression line and gradually move it so 
that the slope of the equation increases from approximately 
1.0 to approximately 5.0. What happens to r 2 as you are 
shifting the line? 

20.4 Click on the "Find the Best Model" button, then iden- 
tify and interpret both the slope of the equation and the 
resulting r 2 value in the context of the dexterity test exam- 
ple in the text. 



Accidents Don't Take Holidays OFF 

Statistics often represent positive events and pleasant experiences. For example, 40.7 
million people visited Walt Disney World in 2004, and Americans bought 7.5 million 
new cars that same year. 

On the other hand, it's often necessary to use statistics to keep track of things 
that are not so positive. A few examples: the 114 million visits to hospital emergency 
rooms in 2003 (leading reason: stomach and abdominal pain), the 16,200 who died 
as the result of falls in 2003, and the 44,800 who perished in automobile accidents 
during 2003. 

A disproportionate share of the auto accident fatalities tend to occur during major 
holiday periods, such as Thanksgiving, Christmas, and New Year's. Using multiple re- 
gression, an important technique covered in this chapter, we find a strong relationship 
exists between (1) the number of fatalities that occur during New Year's and (2) the 
numbers that occurred during the preceding Thanksgiving and Christmas periods. 

As we'll examine more closely in Statistics in Action 1 6. 1 , a multiple regression 
analysis of holiday traffic fatalities for a 1 0-year period revealed that nearly 85% of 
the variation in New Year's fatalities was explained by a linear 



equation linking New Year's fatalities (y) with 
the numbers occurring during the preceding 
Thanksgiving and Christmas (x 1 and x 2 , 
respectively). Although driving during all holiday 
periods tends to be a little more dangerous than 
everyday driving, holidays during some years would 
appear to be worse than holidays in others. If you're 
reading this either before or during a major holiday, 
be careful out there. 



Source: World Almanac and Book of Facts 2006, pp. 115, 184-185 
190, 749. 
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learning 
objectives 

After reading this 
chapter, you should 
be able to: 



Explain how the scatter diagram and least-squares concepts apply to multiple 
regression. 

Obtain and interpret the multiple regression equation; then make point and in- 
terval estimates regarding the dependent variable. 

Interpret the value of the coefficient of multiple determination and carry out a 
hypothesis test for its significance. 

Construct confidence intervals and carry out hypothesis tests involving the par- 
tial regression coefficients. 

Explain both the meaning and the applicability of a dummy variable. 

Use residual analysis in examining the appropriateness of the multiple regres- 
sion model and the extent to which underlying assumptions are met. 



16.1 



INTRODUCTION 



In Chapter 15, we examined the linear relationship between a dependent variable 
(y) and just one independent variable (x). More realistically, the value of y will be 
related to two or more independent variables (x v x 2 , and so on). The inclusion of 
more than one independent variable is the reason for the multiple descriptor in 
the chapter title. Multiple regression and correlation analysis could prove useful 
with variables like these: 



Dependent Variable (y) 



Independent Variables (x's) 



Annual unit sales 



Selling price of a home 



Advertising expenditure 

Size of sales force 

Price of the company's product 

Number of rooms 
Square feet of living space 
Size of the surrounding lot 



The purpose of this chapter is to determine, interpret, and examine the 
strength of linear relationships between a single y and two or more x's. As in 
Chapter 15, regression analysis is used in determining and interpreting the linear 
relationship, while correlation analysis measures its strength. 

In previous chapters, the basics of a technique were presented through small- 
scale problems for which relevant calculations could easily be performed with the 
pocket calculator. The calculations in multiple regression and correlation analysis 
are especially intensive. Computer statistical packages are readily available for re- 
ducing the number-crunching drudgery, and the computer will be our partner 
through much of the chapter. 

We'll begin by discussing the multiple regression model and examining the 
sample regression equation for an example that will continue throughout the 
chapter. The complete Excel and Minitab printouts will be presented and inter- 
preted; then selected segments of these printouts will be featured as they become 
relevant to the topics under discussion. Later in the chapter, we'll revisit the full 
printouts for an overview of their format and interpretation. Once again, for mul- 
tiple regression and correlation, the term "computer assisted" is a distinct under- 
statement. It's like saying a jet pilot is "aircraft assisted." 
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As in the preceding chapter, point and interval estimates for y are discussed, 
as are tests of significance involving the regression and correlation analysis itself. 
The basic principles are similar to their counterparts from Chapter 15. However, 
specific topics and methods of analysis and interpretation will differ slightly in 
order to accommodate the greater number of independent variables. 




exercises 




16.1 What is the difference between simple linear regres- 
sion and multiple regression? Under what circumstances 
would a multiple regression analysis be preferable? 

16.2 In multiple regression and correlation analysis, what 
is the purpose of the regression component? Of the corre- 
lation component? 

16.3 For y = annual household expenditure for auto 
maintenance and repair, what independent variables 
could help explain the amount of money a household 
spends per year for this purpose? 



16.4 A personnel director is using multiple regression 
analysis to examine employee absence records. He wants 
to find an estimation equation that will improve his 
understanding of variables that might explain the extent 
of such absences. For y = the number of days an 
employee called in sick last year, what x variables might 
be both useful to this analysis and likely to be available in 
each employee's personnel file? 



THE MULTIPLE REGRESSION MODEL 



16.2 



Model and Assumptions 

The multiple regression model is an extension of the simple linear regression 
model of Chapter 15. However, there are two or more independent variables in- 
stead of just one. As in Chapter 15, estimates of the population parameters in the 
model are made on the basis of sample data. Described in terms of its population 
parameters, the multiple regression model is as follows: 



The multiple regression model: 
where 



y, 



a value of the dependent variable, y 
a constant 



x u> x n> •••■> x ki = vames °f tne independent variables, jc«, x 2 , x k 
f3p /3 2 , Pk = partial regression coefficients for the independent 
variables, x v x 2 , ...,3tt 
e = random error, or residual 



For a given set of x values, the expected value of y is given by the regression 
equation, E(y) = j3 0 + P l x ll + P 2 x 2i + ••• + (3 k x kj . For each x term in the equa- 
tion, the corresponding /? term is referred to as the partial regression coefficient. 
Each p i (i = 1, 2, k) is a slope relating changes in E(y) to changes in one x vari- 
able whenever all of the other x's are held constant. 
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As in Chapter 15, the difference between an observed value of y and the value 
that was expected is the residual, e-. In terms of the residual component of the 
model, the following assumptions underlie multiple regression: 

1. For any given set of values for the independent variables, the population of e- 
values will be normally distributed with a mean of zero and a standard devia- 
tion of cr. 

2. The standard deviation of the e- values is the same regardless of the combina- 
tion of values taken on by the independent variables. 

3. The e- values are statistically independent of each other. 

Determining the Sample Regression Equation 

The sample regression equation is based on observed values for the dependent 
and independent variables. An extension of its counterpart from Chapter 15, it 
has the form y = b Q + b 1 x l + b 2 x 2 + ••• + b k x k . The constants in the sample re- 
gression equation (b Q , b v b 2 , b/J are estimates of their population counter- 
parts (P 0 ,p 1 ,p 2 , ...,P k ). 

Determination of the "best-fit" multiple regression equation is according to 
the least-squares criterion, in which the sum of the squared deviations between 
observed and estimated values of y is minimized. The following information, and 
analyses related to it, will be used as our medium for discussing multiple regres- 
sion and correlation in the sections to follow. 



example 



Multiple Regression Equation 

The president of a large chain of fast-food restaurants has randomly selected 
10 franchises and recorded for each franchise the following information on last 
year's net profit and sales activity. The data are also in file CX16REST. 



Franchise Net Counter Drive-Through 

Number Profit, y Sales, x 1 Sales, x 2 



1 $1.5 million $8.4 million $7.7 million 

2 0.8 3.3 4.5 

3 1.2 5.8 8.4 

4 1.4 10.0 7.8 

5 0.2 4.7 2.4 

6 0.8 7.7 4.8 

7 0.6 4.5 2.5 

8 1.3 8.6 3.4 

9 0.4 5.9 2.0 
10 0.6 6.3 4.1 



For these data, there will be one dependent variable (y = net profit) and two in- 
dependent variables (x 1 = counter sales; x 2 = drive-through sales). The form of 
the sample regression equation will be y = b 0 + b 1 x 1 + b 2 x 2 . 
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E 


F 


16 




Coefficients 


17 


Intercept 


-0.2159 


18 


$Counter 


0.0855 


19 


$Driveup 


0.1132 



The regression equation is 

SProfit = - 0.216 + 0.0855 SCounter + 0.113 SDriveup 
SOLUTION 

Table 16.1 shows Excel and Minitab printout segments describing the multiple re- 
gression equation for the restaurant data. The full printouts and the steps to obtain 
them are given in Computer Solutions 1 6. 1 , at the end of this section. In later sections 
of the chapter, we will examine other portions of the printouts. At this stage, our pri- 
mary concern is the sample regression equation itself. To four decimal places, it is 



Estimated 
net profit 



y-mtercept 



Partial regression 
coefficients 



-0.2159 + 0.0855*! + 0.1132x 2 



Counter 




Drive-through 


sales 




sales 



This least-squares regression equation is actually a two-dimensional plane, and 
for each possible value of x 1 and x 2 , the estimated value of y lies on the surface of 
the plane. Imagine the data points in Figure 16.1 (page 604) as helium-filled bal- 
loons in a room, with each balloon fastened to the floor at its particular (x v x 2 ) 
location. The length of the string holding each balloon represents the observed 
y value for that individual data point. The height where each string would intersect 
the regression plane is the estimated y value for that combination of x 1 and x 2 
values. Some of the balloons (data points) will be above the plane, others below it. 
As in Chapter 15, the distances between observed and predicted positions are 
residuals, or errors of estimation. Because the least-squares criterion is also used in 
multiple regression, the sum of the squared residuals will be minimized. 

When there are only two variables (y and x), the least-squares equation is a 
straight line. With (y, x v and x 2 ), it is a plane fitted to a three-dimensional scat- 
ter diagram like Figure 16.1. However, when we have more than three variables, 
there is no counterpart to the scatter diagram. With four or more variables, the re- 
gression equation becomes a mathematical entity called a hyperplane. Scatter di- 
agrams in three dimensions are difficult to draw; with four or more dimensions, 
such visual summarization becomes impossible. 

Interpreting the Regression Equation 

The y-intercept (b 0 = —0.2159) in the fast-food regression equation is the esti- 
mated value of y (net profit) whenever x x (counter sales) and x 2 (drive-through 
sales) are both zero. In other words, a franchise with no customers would be esti- 
mated as losing $215,900 for the year. {Caution: None of the 10 franchises in the 
study even approaches having zero sales in both departments, and we must be 



TABLE 16.1 



These portions of the Excel 
and Minitab printouts for 
the fast-food franchise anal- 
ysis show the y-intercept 
and the partial regression 
coefficients for the least- 
squares multiple regression 
equation. The full printouts 
and the procedures for ob- 
taining them are shown at 
the end of this section, in 
Computer Solutions 16.1. 
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FIGURE 16.1 



The scatter diagram for the 
fast-food data can be visu- 
alized as a room where 
each of 10 helium-filled bal- 
loons is held to the floor by 
a string. The length of each 
string is the observed value 
of y for that data point. The 
least-squares regression 
equation (not shown) 
passes through the data 
and takes the form of a 
two-dimensional surface, 
or plane. 



The height of this 
"balloon" is y = $0.8 
million, the net profit 
for franchise number 2. 



o 



o 



o 



o 



o 



o 



Franchise number 2 had 
counter sales of 
JCj = $3.3 million, 
drive-through sales 
of %2 = $4.5 million. 



O 0 



o 



x 2 = Drive- 
through 
sales 



o 



x 1 = Counter 
sales 



skeptical of estimates based on x values that lie beyond the range of the underly- 
ing data.) 

The partial regression coefficient for x x (b 1 = 0.0855) indicates that, for a 
given level of drive-through sales, the estimated net profit will increase by 
$0.0855 for each additional $1.00 of counter sales. 

The partial regression coefficient for x 2 (b 2 = 0.1132) shows that, for a given 
level of counter sales, the estimated net profit will increase by $0. 1 132 for each addi- 
tional $1.00 of drive-through sales. Both partial regression coefficients are positive, 
reflecting that estimated net profit will increase regardless of whether an additional 
$1.00 of sales happens to occur over the counter or at the drive-through window. 

Point Estimates Using the Regression Equation 

Point estimates are made by substituting a set of x values into the regression equa- 
tion and calculating the estimated value of y. For example, if a franchise had sold 
x 1 = $5.0 million over the counter and x 2 = $7.4 million at the drive-through, 
we would estimate its net profit as $1.05 million: 



The estimated 
net profit for 
the franchise 



$5.0 million 
in counter 
sales 



$7.4 million in 
drive-through sales 



y = -0.2159 + 0.0855(5.0) + 0.1132(7.4) = $1.05 million 
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Computer Solutions 16.1 shows the procedures and results when we use Excel 
and Minitab to perform a multiple regression analysis of the restaurant data. Both 
printouts contain information we will be examining in other sections of the chap- 
ter. For now, just focus on the segments that were shown in Table 16.1. 



CO 



uter solutions 1 G.1 




Multiple Regression 
EXCEL 





E I 


G 


H 


I 


J 


K 


1 


SUMMARY OUTPUT 












2 














3 


Regression Statistics 












4 


Multiple R 


0.8786 












5 


R Square 


0.7719 












6 


Adjusted R Square 


0.7067 












7 


Standard Error 


0.2419 












8 


Observations 


10 












9 
















10 


ANOVA 














11 




df 


SS 


MS 


F 


Significance F 




12 


Regression 


2 


1 .3864 


0.6932 


1 1 .8448 


0.0057 




13 


Residual 


7 


0.4096 


0.0585 








14 


Total 


9 


1 .7960 










15 
















16 




Coefficients 


Standard Error 


f Stat 


P-value 


Lower 95% 


Upper 95% 


17 


Intercept 


-0.2159 


0.2643 


-0.8169 


0.4409 


-0.8409 


0.4091 


18 


$Counter 


0.0855 


0.0438 


1.9516 


0.0920 


-0.0181 


0.1890 


19 


SDriveup 


0.1132 


0.0385 


2.9368 


0.0218 


0.0220 


0.2043 



Excel Multiple Regression 



1. 



2. 



The data for the primary example in this chapter are in file CX16REST.XLS. The profits ($Prof it) are in A2:A1 1, the 
counter sales (SCounter) are in B2:B1 1, the drive-through sales (SDriveup) are in C2:C1 1, and the labels are in the first 
rows. Click Tools. Click Data Analysis. Select Regression. Click OK. 

Enter A1:A11 into the Input Y Range box. Enter B1:C11 into the Input X Range box. Select Labels. Enter E1 into 
the Output Range box. Click OK. After slight adjustments to the column widths and the number of decimal places, 
the results are as shown above. 



MINITAB 

Regression Analysis: $Profit versus SCounter, SDriveup 

The regression equation is 

SProfit = - 0.216 + 0.0855 SCounter + 0.113 SDriveup 

Predictor Coef SE Coef T P 

Constant -0.2159 0.2643 -0.82 0.441 

SCounter 0.08547 0.04380 1.95 0.092 

SDriveup 0.11315 0.03853 2.94 0.022 



(continued) 
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S = 0.241912 R-Sq = 77.2% R-Sq(adj) = 70.7% 
Analysis of Variance 



Source 
Regressi on 
Residual Error 
Total 



DF 
2 
7 
9 



SS 

1.38635 
0.40965 
1.79600 



MS 

0.69318 
0.05852 



F 

11.84 



Source 
SCounter 
$Dri veup 



DF 
1 
1 



Seq SS 
0.88160 
0.50475 



Unusual Observations 
Obs SCounter SProfit Fit 
8 8.6 1.3000 0.9039 



SE Fit 
0.1468 



Resi dual 
0.3961 



P 

0.006 



St Resid 
2.06R 



R denotes an observation with a large standardized residual. 

Using file CX16REST.MTW, with $Profit in C1, SCounter in C2, and $Driveup in C3: Click 
Stat. Select Regression. Click Regression. Enter C1 (SProfit) into the Response box. Enter 
C2-C3 (SCounter and SDriveup) into the Predictors box. Click OK. 





16.5 Explain each of the terms in the multiple regression 
model. 

16.6 What assumptions are required in using the multiple 
regression model? 

16.7 In simple linear regression, the regression equation 
is a straight line. In multiple regression, what geometric 
form is taken by the regression equation when there are 
two independent variables? When there are three or more 
independent variables? 

16.8 For the multiple regression equation y = 100 + 
20xj — 3x 2 + 120*3: 

a. Identify the y-intercept and partial regression 
coefficients. 

b. If x 1 = 12, x 2 = 5, and x 3 = 10, what is the 
estimated value of y? 

c. If x 3 were to increase by 4, what change would be 
necessary in x 2 in order for the estimated value of y 
to remain unchanged? 

16.9 Last year's utility bill (in dollars) for a sample of 
homes is estimated by the multiple regression equation 

y = 300 + 7x 1 + 13x 2 , where x 1 is the number of occu- 
pants and x 2 is the number of rooms in the home. 

a. Identify the y-intercept and partial regression 
coefficients. 

b. If 3 people live in a 6-room home, what is the 
estimated bill? 



1 6. 1 0 A study describes operating costs for comparable 
manufacturing plants operating in 1 5 different U.S. 
metropolitan areas. From the costs described in the article, 
the following multiple regression equation was developed: 
y= —0.1 + l.lxj + 2.8x 2 , withy= total operating cost, x^ = 
labor cost, and x 2 = electric power cost. All costs are in mil- 
lions of dollars and on an annual basis, source: Mary Pembleton, 
"Cleveland Is No. 1— In Factory Costs," USAToday, February 22, 1988, p. 6B. 

a. Interpret the y-intercept and the partial regression 
coefficients. 

b. If labor costs $6 million and electric power costs 
SO. 3 million, what is the estimated annual cost to 
operate the plant? 

/ data set/ Note: Exercises 16.11-16.14 require a 
computer and statistical software. 

16.1 1 The owner of a large chain of health spas has selected 
eight of her smaller clubs for a test in which she varies the 
size of the newspaper ad and the amount of the initiation fee 
discount to see how this might affect the number of prospec- 
tive members who visit each club during the following week. 
The results are shown in the table on the next page. 

a. Determine the least-squares multiple regression equation. 

b. Interpret the y-intercept and partial regression 
coefficients. 

c. What is the estimated number of new visitors to a 
club if the size of the ad is 5 column-inches and a $75 
discount is offered? 
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New Ad Column- Discount 

Club Visitors, y Inches, x. Amount, x 2 



1 23 4 $100 

2 30 7 20 

3 20 3 40 

4 26 6 25 

5 20 2 50 

6 18 5 30 

7 17 4 25 

8 31 8 80 



16.12 In testing 9 sedans, Car and Driver magazine rated 
each on 13 different characteristics, including ride, han- 
dling, and driver comfort. Each vehicle also received an 
overall rating. Scores for each vehicle were as follows. 
source: Frank Markus, "Middle-Management Motivators," Car and Driver, 
February 1997, p. 99. 

y= * 3 = 

Overall x- = x, = Driver 



Car Rating Ride Handling Comfort 



1 83 8 7 7 

2 86 8 8 8 

3 83 6 8 7 

4 83 8 7 9 

5 95 9 9 9 

6 84 8 8 9 

7 88 9 6 9 

8 82 7 8 7 

9 92 8 9 8 



a. Determine the least-squares multiple regression 
equation. 

b. Interpret the y-intercept and partial regression 
coefficients. 

c. What is the estimated overall rating for a vehicle that 
scores 6 on ride, 9 on handling, and 7 on driver comfort? 

16.13 The makers of a pie crust mix have baked test pies 
and collected the following data on y = amount of crisp- 
ness (a rating of 1 to 100), Xj = minutes in the oven, and 
x 2 — oven temperature (degrees Fahrenheit). 



y 


x 1 


X 2 


68 


6.0 


460 


76 


8.9 


430 


49 


8.8 


360 


99 


7.8 


460 


90 


7.3 


390 


32 


5.3 


360 


96 


8.8 


420 


77 


9.0 


350 


94 


8.0 


450 


82 


8.2 


400 


97 


6.4 


450 



a. Determine the least-squares multiple regression 
equation. 
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b. Interpret the y-intercept and partial regression 
coefficients. 

c. What is the estimated crispness rating for a pie that is 
baked for 5 minutes at 200 degrees? 

16.14 The following data have been reported for a sam- 
ple of 10 major U.S. zoological parks: source: World Almanac 
and Book of Facts 1997, p. 233. 

x 3 = 
Number 





y = 


x 1 = 


X 2 = 


of 




Budget 


Attendance 


Acres 


Specie 


Albuquerque 


$14.5 million 


0.6 million 


210 


271 


Chicago 


35.0 


2.0 


216 


400 


Dallas 


6.9 


0.4 


70 


377 


Detroit 


9.0 


1.0 


125 


277 


Houston 


6.6 


1.5 


55 


721 


Los Angeles 


17.2 


1.3 


80 


400 


Philadelphia 


15.5 


1.3 


42 


437 


St. Louis 


21.0 


2.5 


91 


759 


San Francisco 


12.0 


0.9 


125 


270 


Seattle 


9.6 


1.1 


92 


260 



a. Determine the least-squares multiple regression 
equation. 

b. Interpret the y-intercept and partial regression 
coefficients. 

c. What is the estimated budget for a zoological park 
that draws an annual attendance of 2.0 million, 
occupies 150 acres, and has 600 species? 

16.15 During its manufacture, a product is subjected to 
four different tests, each giving a score of 200 to 800. An 
efficiency expert claims the fourth test to be unnecessary 
since its results can be predicted based on the first three. 
The following Minitab printout is an analysis of scores re- 
ceived by a sample of 12 units subjected to all four tests. 

Regression Analysis 

The regression equation is 



TEST04 = 12 


.0 + 0.274 


TEST01 + 0. 


376 TEST02 


+ 0.326 TEST03 


Predictor 


Coef 


StDev 


T 


P 


Constant 


11.98 


80.50 


0.15 


0.885 


TEST01 


0.2745 


0 . 1111 


2.47 


0.039 


TEST02 


0.37619 


0.09858 


3.82 


0.005 


TEST03 


0.32648 


0.08084 


4.04 


0.004 


S = 52.72 


R-Sq 


= 87.2% 


R-SqCadj) = 


= 82.4% 


Analysis of Variance 








Source 


DF 


SS 


MS 


F P 


Regression 


3 


151417 


50472 


18.16 0.001 


Error 


8 


22231 


2779 




Total 


11 


173648 







a. Interpret the multiple regression equation. 

b. An individual unit from the assembly area receives 
ratings of 350, 400, and 600 on the first three tests. 
Using the estimation equation in the printout, what is 
the unit's estimated score on test 4? 
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1 6.3 INTERVAL ESTIMATION IN MULTIPLE REGRESSION 



Given a specific combination of x values, the multiple regression equation pro- 
vides our single best estimate for both the mean of y and an individual y value for 
that set of x values. However, as in simple linear regression, the estimate is subject 
to uncertainty. The first step in quantifying this uncertainty is to determine the 
multiple standard error of estimate. 

The Multiple Standard Error of Estimate 

The multiple standard error of estimate (s e ) is analogous to the standard error of 
estimate in Chapter 15, and it will be part of the output when the regression anal- 
ysis is computer assisted. The numerical value of s e reflects the amount of scatter, 
or dispersion, of data points about the plane (or hyperplane) represented by the 
multiple regression equation. For two independent variables, it describes the 
amount of scatter above and below the two-dimensional surface passing through 
data points like those shown in Figure 16.1. Regardless of the number of indepen- 
dent variables, the multiple standard error of estimate can be calculated as 

The multiple standard error of estimate: 



s e = y — i-J — where y i = each observed value of y in the data 

n ~ k — 1 y- . = th e value of y that would have been 

estimated from the regression equation 
n = the number of data points 
k = the number of independent (x) variables 

The numerator of the term beneath the square root symbol is the sum of the 
squared residuals, also known as the error sum of squares, or SSE. For the fast- 
food example and its multiple regression equation, the calculation of the error 
sum of squares (SSE) is demonstrated in Table 16.2. Each row of the table repre- 
sents one of the franchises in the sample and compares the actual net profit (y) 
with the predicted value (y). When the resulting residuals (errors) are squared and 
totaled, the error sum of squares is SSE = 0.4096. 

For our example, there are k = 2 independent variables and n = 10 data 
points. Having calculated 2(y ; — y,) 2 , the error sum of squares, as SSE = 0.4096, 
the multiple standard error of estimate can be calculated as 

•. - vis - v^?f^ - « - »»» — 

Just as b Q , b v and b 2 are sample estimates of their counterparts in the "true" 
population regression equation, E(y) = /3 Q + (3 1 x l + /3 2 x 2 , s e is our sample esti- 
mate of the actual multiple standard error of estimate, cr e . 

Confidence Interval for the Conditional Mean of y 

In Chapter 15, the least-squares regression line passed through the point (x, y), 
and estimation intervals were shown to become wider when based on x values 
more distant from x. The estimation formulas in Chapter 15 took into account 
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Counter 
Sales, 



Drive- 
Through 
Sales, 



Net Profit 



Actual, Predicted, 



Error, or 
Residual, 



Error, or 
Residual, 



Actual y versus 
the Mean of y 



x 1 


X 2 


y 


y 


(y-y)' 


(y-y) 2 ' 


(y -y) (y - y) 2 


$8.4 


$7 


.7 


$1 .5 


$1.3739 


$0.1261 




0.0159 


$0.62 


0.3844 


3.3 


4 


.5 


0.8 


0.5757 


0.2243 




0.0503 


-0.08 


0.0064 


5.8 


g 


4 


1.2 


1.2309 


-0.0309 




0 001 0 


0 32 


0 1 024 


10.0 


7 


.8 


1.4 


1.5221 


-0.1221 




0.0149 


0.52 


0.2704 


4.7 


2 


.4 


0.2 


0.4576 


— 0.2576 




0.0664 


-0.68 


0.4624 


7.7 


4 


.8 


0.8 


0.9858 


-0.1858 




0.0345 


-0.08 


0.0064 


4.5 


2 


.5 


0.6 


0.4519 


0.1481 




0.0219 


-0.28 


0.0784 


8.6 


3 


.4 


1.3 


0.9043 


0.3957 




0.1566 


0.42 


0.1764 


5.9 


2 


.0 


0.4 


0.5150 


-0.1150 




0.0132 


-0.48 


0.2304 


6.3 


4 


.1 


0.6 

t 


0.7869 


-0.1869 




0.0349 


-0.28 


0.0784 






y = 


$0.88 








0.4096 




1.7960 




I 



TABLE 16.2 



For the sample of 1 0 fast- 
food franchises, the 
summary results from 
these calculations are 
important in determining 
both the multiple standard 
error of estimate and the 
coefficients of multiple 
correlation and determina- 
tion. (Data are in millions 
of dollars.) 



2(y - J/) 2 ; this is the residual 2(y _ y)2. this is the tota | sum 

(error) sum of squares, or SSE. It is of squa res, or SST. It is the total 

the amount of variation in y that is amount of variation in y, both 

not explained by the multiple re- explained and unexplained, 
gression equation. 



the difference between x and x, and increased the interval widths accordingly as 
x values got farther away from the mean of x. 

As in simple regression, the least-squares multiple regression equation passes 
through the point containing the means of all the variables. (To verify this, substi- 
tute the means [y = $0.88, x 1 = $6.52, and x 2 = $4.76] into the fast-food 
multiple regression equation.) Likewise, interval estimates become wider as the 
specified x values get farther away from their respective means. 

Most computer statistical packages increase interval widths according to how 
far away the various x values are from their means. However, if your package 
does not have this capability, such calculations are nearly impossible to carry out 
by hand. Also, you may have only summary information (e.g., n, s e , and the re- 
gression equation itself) from an analysis performed by someone else. In either 
case, all is not lost. You can still construct an approximate estimation interval 
using the methods in this section and the next. 

Approximate confidence interval for the conditional mean of y: 

the estimated value of y based on the set of x values 
provided 

t value from the t distribution table for the desired 
confidence level and df = n — k — 1 
(n = number of data points, k = number of 
x variables) 

the multiple standard error of estimate 



S e 

y — t—/= where y 
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example 



Confidence Interval 

For franchises having counter sales of $6.0 million and drive-through sales of 
$4.0 million, what is the approximate 95% confidence interval for their average 
net profit? 

SOLUTION 

For «j = $6.0 and x 2 = $4.0, y = -0.2159 + 0.0855(6.0) + 0.1132(4.0), or 
$0.7499 million. This is the midpoint of the approximate confidence interval for 
the mean of y for these x values. As in the previous example, the number of de- 
grees of freedom will ben — k — 1, or df = 7. The multiple standard error of es- 
timate is s e = $0.2419 million, and the t value for 95% confidence and df =7 will 
be 2.365. The approximate confidence interval for the mean of y is 

y ± t^f= = 0.7499 ± 2.365 = 0.7499 ± 0.1809, 

or from $0.5690 to $0.9308 million 

We have 95% confidence that the average net profit for stores having $6.0 
million in counter sales and $4.0 million in drive-through sales will be within the 
approximate confidence interval bounded by $0.5690 and $0.9308 million. The 
width of this interval is 0.9308 — 0.5690, or $0.3618 million. Because the interval 
is based on x values that differ from their respective means, the exact 95% confi- 
dence interval will be wider than the $0.3618 million calculated here. 



Prediction Interval for an Individual y Observation 

As with the confidence interval, we can construct an approximate prediction in- 
terval for an individual y observation. Such intervals can be approximated by 
using the t distribution, the multiple standard error of estimate, and y as the point 
estimate for y. 

Approximate prediction interval for an individual y observation: 

y ± ts e where y = the estimated value of y based on the set of x values 
provided 

t = t value from the t distribution table for the desired 
confidence level and df = n — k — 1 
(n = number of data points, k = number of x variables) 
s e = the multiple standard error of estimate 



Example 



Prediction Interval 

The president of the fast-food chain believes that a new franchise being planned 
for construction in a suburban neighborhood would have experienced $6.0 mil- 
lion in counter sales and $4.0 million in drive-through sales had it been in 
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operation during the preceding year. For these sales figures, what would be the ap- 
proximate 95% prediction interval for the store's net profit? 

SOLUTION 

The midpoint of the approximate prediction interval will be the point estimate of 
net profit for the counter sales (x 1 = $6.0 million) and drive-through sales 
(% 2 = $4.0 million) specified. The point estimate is 

y = -0.2159 + 0.0855(6.0) + 0.1132(4.0) = $0.7499 million 

For n = 10 data points and k = 2 independent variables, the number of de- 
grees of freedom associated with t will be df = n — k — 1, or df = 7. Referring to 
the t distribution table, t = ±2.365 includes 95% of the area beneath the curve. 
The approximate 95% prediction interval is 

y ± ts e = 0.7499 ± 2.365(0.2149) = 0.7499 ± 0.5721, 
or from $0.1778 to $1.3220 million 

We have 95% confidence that this store's net profit would have been within 
the approximate prediction interval bounded by $0.1778 and $1.3220 million, an 
interval width of 1.3220 - 0.1778, or $1.1442 million. Since x 1 = $6.52 million 
and x 2 = $4.76 million (as noted earlier), this approximate prediction interval is 
based on x values that differ from their respective means. As such, the exact pre- 
diction interval will be slightly wider than the $1.1442 million calculated here. 



Computer Solutions 16.2 shows the procedures and results when we use Excel 
and Minitab to generate confidence and prediction intervals like those obtained 
here. Because the values for x 1 and x 2 on which the intervals are based are not the 
sample means for the two independent variables, the exact intervals produced by 
Excel and Minitab are slightly wider than their more approximate counterparts 
that were determined using the methods described in this section. 



16.16 Given the printout in Exercise 16.15, determine 
the approximate 

a. 90% confidence interval for the mean rating on test 4 
for units that have been rated at 300, 500, and 400 
on the first three tests. 

b. 90% prediction interval for the rating a unit will get 
on test 4 if its ratings are 300, 500, and 400 on the 
first three tests. 

16.17 The regression equation y = 5.0 + l.Oxi + 2.5x 2 
has been fitted to 20 data points. The means of x 1 and x 2 
are 25 and 40, respectively. The sum of the squared 
differences between observed and predicted values of 

y has been calculated as SSE = 173.5, and the sum of 
squared differences between y values and the mean 
of y is SST = 520.8. Determine the following: 



a. The mean of the y values in the data. 

b. The multiple standard error of estimate. 

c. The approximate 95% confidence interval for the 
mean of y whenever x 1 = 20 and x 2 = 30. 

d. The approximate 95% prediction interval for an 
individual y value whenever x t = 20 and x 2 = 30. 

/ data Set / Note: Exercises 16.18-16.22 require a 
computer and statistical software. If possible with your 
statistical software, determine the exact, rather than the 
approximate, intervals. 

16.18 In testing eight value notebook computers, PC 
World reported various physical and performance 
measures of each. Data included overall rating, street 
price, PC WorldBench 4 performance score, and battery 
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computer solutions 16.2 






» 
I 





Interval Estimation in Multiple Regression 



EXCEL 





A B 


C 


D 


1 


Prediction Interval 






2 










3 






$Profit 




4 


Predicted value 


0.750 




5 










6 


Prediction Interval 






7 


Lower limit 


0.146 




8 


Upper limit 


1.353 




9 










10 


Interval Estimate of Expected Value 


11 


Lower limit 


0.558 




12 


Upper limit 


0.941 





Excel Interval Estimation in Multiple Regression 

1 . Using file CX16REST.XLS, with data in the fields specified in Computer Solutions 16.1, select two convenient cells (e.g., 
A15:B15) and enter the specified values for the two independent variables — i.e., enter 6 into A15 and 4 into B15. 

2. Click Tools. Click Data Analysis Plus. Select Prediction Interval. Click OK. 

3. Enter A1:A11 into the Input Y Range box. Enter B1:C11 into the Input X Range box. Select Labels. Enter A15:B15 
into the Given X Range box. Enter the desired confidence level as a decimal fraction (e.g., 0.95) into the Confidence 
Level (1 - Alpha) box. Click OK. 

MINI TAB 

Minitab Interval Estimation in Multiple Regression 

Predicted Values for New Observations 
New 

Obs Fit SE Fit 95% CI 95% PI 

1 0.7496 0.0811 (0.5578, 0.9413) (0.1462, 1.3529) 

Values of Predictors for New Observations 

New 

Obs SCounter $Driveup 
1 6.00 4.00 

Using file CX16REST.MTW, follow the procedure in Computer Solutions 16.1, but select Options. Enter 6 4 (include the 
space between them) into the Prediction intervals for new observations box. Enter the desired confidence level as a 
percentage (e.g., 95) into the Confidence level box. Click OK. Click OK. In addition to the printout in Computer 
Solutions 16.1, Minitab will generate the items shown here. 



Chapter 16: Multiple Regression and Correlation 



613 



lite, with the results 


shown 


here, source: 


"Top 1 5 Notebook 


PCs," PC World, July 2003 


, p. 135. 








Computer: 


1 


2 


3 


4 


Overall Rating: 


80 


80 


76 


75 


Street Price: 


1024 


949 


1499 


1478 


Performance Score: 


102 


96 


1 1 1 


104 


Battery Life (hours): 


3.03 


2.97 


2.00 


3.05 


Computer: 


5 


6 


7 


8 


Overall Rating: 


75 


74 


72 


72 


Street Price: 


1857 


1865 


1699 


1298 


Performance Score: 


100 


98 


104 


91 


Battery Life (hours): 


1.88 


3.00 


2.38 


2.33 



a. From these data, obtain the multiple regression equa- 
tion estimating overall rating on the basis of the 
other three variables; then determine and interpret 
the approximate 95% confidence interval for the 
mean overall rating of notebook computers having a 
$1000 street price, a performance score of 100, and 
a 2.50-hour battery life. 

b. Determine the 95% prediction interval for the overall 
rating of an individual notebook computer with a 
$1000 street price, a performance score of 100, and a 
2.50-hour battery life. 

16.19 In considering whether to implement a program 
where selected entering freshmen would be required to 
take a precalculus refresher course before moving on to 
calculus, a university's business college has given mathe- 
matical proficiency tests to a sample of entering freshmen. 
The college hopes to use this score and the Scholastic 
Aptitude Test (SAT) quantitative score in predicting how 
well a student would end up scoring on the standardized 
final exam in calculus later in his or her studies. Data are 
available for a sample of nine entering freshmen, and 
their scores are shown here. 



Student: 


1 


2 


3 


4 


5 


Math Proficiency: 


72 


96 


68 


86 


70 


SAT, Quantitative: 


462 


545 


585 


580 


592 


Calculus Final: 


71 


92 


72 


82 


74 



Student: 


6 


7 


8 


9 


Math Proficiency: 


73 


91 


75 


76 


SAT, Quantitative: 


516 


638 


615 


596 


Calculus Final: 


71 


100 


87 


81 



a. From these data, obtain the multiple regression equa- 
tion estimating calculus final exam score on the basis 
of the other two variables; then determine and inter- 
pret the approximate 90% confidence interval for the 
mean calculus final exam score for entering freshmen 
who score 70 on the math proficiency test and 500 
on the quantitative portion of the SAT. 

b. Determine the 90% prediction interval for the calcu- 
lus final exam score for an entering freshman who 
scored 70 on the math proficiency test and 500 on 
the quantitative portion of the SAT. 

16.20 For the multiple regression equation obtained in 
Exercise 16.11, determine the approximate 

a. 95% confidence interval for the mean number of new 
visitors for clubs using a 5 -column-inch ad and offer- 
ing an $80 discount. 

b. 95% prediction interval for the number of new visi- 
tors to a given club when the ad is 5 column-inches 
and the discount is $80. 

16.21 For the multiple regression equation obtained in 
Exercise 16.12, determine the approximate 

a. 95% confidence interval for the mean overall rating 
of cars that receive ratings of 8 on ride, 7 on 
handling, and 9 on driver comfort. 

b. 95% prediction interval for the overall rating of a 
car that has received ratings of 8 on ride, 7 on 
handling, and 9 on driver comfort. 

16.22 For the multiple regression equation obtained in 
Exercise 16.13, determine the approximate 

a. 95% confidence interval for the mean crispness rating 
of pies that are baked 5.0 minutes at 300 degrees. 

b. 95% prediction interval for the crispness rating of a 
pie that is baked 5.0 minutes at 300 degrees. 



MULTIPLE CORRELATION ANALYSIS 1 6.4 



The purpose of the multiple correlation analysis is to measure the strength of 
the relationship between the dependent (y) and the set of independent (x) vari- 
ables. This strength can be measured on an overall basis by the coefficient of mul- 
tiple determination. 
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The Coefficient of Multiple Determination 

The coefficient of multiple determination (R 2 , with 0 s R 2 < 1) is the proportion 
of the variation in y that is explained by the multiple regression equation. Its pos- 
itive square root is the coefficient of multiple correlation (R), a measure that is less 
important than its counterpart from Chapter 15. 

Computation of the coefficient of multiple determination is similar to that for 
the coefficient of determination for simple linear regression, and R 2 can be both 
defined and calculated in terms of the total variation in y versus the unexplained 
variation in y: 



The coefficient of multiple determination: 

^ 2 _ ^ Error variation, which is not explained by the regression equation 

Total variation in the y values 

, 2{y, - y,) 2 , sse 



As in Chapter 15, the total variation in y is the sum of the explained variation 
plus the unexplained (error) variation, or 

Total variation Variation explained by Variation not explained by 
in y values = the regression equation + the regression equation 
(SST) (SSR) (SSE) 

For the fast-food multiple regression equation, both SST (the sum of the 
squared deviations of y from its mean) and SSE (the sum of the squared deviations 
between observed and predicted values of y) were calculated in Table 16.2. Sub- 
stituting SST = 1.7960 and SSE = 0.4096 into the expression above, the coeffi- 
cient of multiple determination is 



SSE 0.4096 

R 2 = 1 =1 = 1 - 0.2281 = 0.7719 

SST 1.7960 

For the 10 franchises, 77.19% of the variation in net profit is explained by 
the multiple regression equation. This value of R 2 is listed in the computer 
output segments in Table 16.3. Note the inclusion of an "adjusted" R 2 that 
takes into account the number of data points and the number of independent 
variables. The formula used in obtaining the adjusted value is shown in the 
footnote. 1 

The printout segments in Table 16.3 list the value of the multiple standard 
error of estimate (sj calculated earlier. They also provide the regression, error 
(residual), and total sums of squares, along with the number of degrees of freedom 
associated with each. The remainder of the analysis of variance table will be dis- 
cussed in the significance tests of Section 16.5. 



Expressed as a proportion, the adjusted R 2 typically included in computer printouts of multiple re- 
gression analyses is calculated as: 

,^,2 , SSE/(n -k-1) 

adjusted R A = 1 t— — — 

SST/(n - 1) 
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E F 


G 


3 


Regression Statistics 




4 


Multiple R 


0.8786 




5 


R Square 


0.7719 




6 


Adjusted R Square 


0.7067 




7 


Standard Error 


0.2419 




8 


Observations 


10 




9 








10 


ANOVA 






11 




df 


SS 


12 


Regression 


2 


1 .3864 


13 


Residual 


7 


0.4096 


14 


Total 


9 


1 .7960 



S = 0.241912 R-Sq = 77.2% R-Sq(adj) = 70.7% 
Analysis of Variance 



TABLE 16.3 



These portions of the Excel 
and Minitab printouts for 
the fast-food franchise 
analysis include the multiple 
standard error of estimate 
and the coefficient of 
multiple determination. 
The full printouts and the 
procedures for obtaining 
them are shown in 
Computer Solutions 16.1. 



Source DF SS MS 

Regression 2 1.38635 0.69318 

Residual Error 7 0.40965 0.05852 

Total 9 1.79600 



F 

11.84 



P 

0.006 



EXEfCISES 



16.23 What is the coefficient of multiple determination, 
and what does it tell us about the relationship between y 
and the independent variables? 

16.24 Among the results of a multiple regression analysis 
are the following sum-of-squares terms: SST, SSR, and 
SSE. What does each term represent, and how do the 
terms contribute to our understanding of the relationship 
between y and the set of independent variables? 

16.25 Given the printout in Exercise 16.15, identify the 
coefficient of multiple determination and interpret its 
value in terms of that exercise. 



/ data Set / Note: Exercises 16.26 and 16.27 require 
a computer and statistical software. 

16.26 For the multiple regression equation obtained in 
Exercise 16.12, what is the coefficient of multiple 
determination and exactly what does it mean? 

16.27 For the multiple regression equation obtained in 
Exercise 16.11, what is the coefficient of multiple 
determination and exactly what does it mean? 



SIGNIFICANCE TESTS IN MULTIPLE 1 6.5 

REGRESSION AND CORRELATION 



In the simple linear regression (y= b 0 + b x x) of Chapter 15, several tests were 
available regarding the relationship between the variables, but their results were 
the same. With just one independent variable, concluding that the population 
slope could be zero was equivalent to concluding that the population coefficient of 
correlation could be zero. However, in multiple regression there are two or more 
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independent variables, and it is necessary to separately test (1) the overall signifi- 
cance of the multiple regression equation and (2) each of the partial regression 
coefficients in the equation. 



Testing the Significance of the Regression Equation 

The multiple regression equation, y = b 0 + b 1 x l + b 2 x 2 + • • ■ + b k x k , is based on 
sample data and is an estimate of the (unknown) population multiple regression 
equation, E(y) = /3 0 + fi 1 x 1 + P 2 x 2 + ••• + P k X k . If there really is no relationship 
between y and any of the x variables, all of the partial regression coefficients 
(/3 l5 j6 2 , /3^) in the population regression equation will be zero, and this is the 
basis on which we test the overall significance of our regression equation. 

The test uses analysis of variance in examining the "explained" or regression 
sum of squares (SSR) versus the "unexplained" or error sum of squares (SSE). 
Some computer statistical packages carry out the entire test and report a p-value 
for its significance. Others may provide summary information (e.g., SSR, SSE, 
SST, and their degrees of freedom) that can be used in making the final calcula- 
tions and interpreting the results. In any event, the procedure is as follows. 

ANOVA test for the overall significance of the multiple regression equation: 

• Null hypothesis 

H 0 : j3j = /3 2 = •■• = = 0 The regression equation is not 

significant. 

• Alternative hypothesis 

One or more of the /3 ; (i = 1, 2, k) ¥= 0 The regression equa- 
tion is significant. 

• Test statistic 

F = — — — : — where SSR = regression sum of squares 

SSEKn - k — 1) CCE . £ 

v ' SSE = error sum of squares 

n = number of data points 

k = number of independent (%) variables 

• Critical value of the F statistic 

F for significance level specified and df (numerator) = k, 
and df (denominator) = n — k — 1 

In the ANOVA test, the number of degrees of freedom associated with the nu- 
merator of the F statistic will be k, the number of independent variables used in 
estimating y. For the denominator, df = n — k — 1, oin — (k + 1), since (k + 1) 
quantities (b 0 , along with b v b 2 , b k ) have been determined on the basis of 
sample data. 

For the fast-food franchise data, we have calculated SST = 1.7960 and SSE = 
0.4096. Since SST = SSR + SSE, SSR will be 1.7960 - 0.4096, or SSR = 1.3864. 
There are n = 10 data points and k = 2 independent variables in our analysis, so 
the test statistic can be calculated as 

SSR/k 1.3864/2 0.6932 

- 11.85 



SSE/(n - k - 1) 0.4096/(10 - 2 - 1) 0.0585 
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TABLE 16.4 





E 


F 


G 


H 


I 


J 


K 


10 


ANOVA 














11 




df 


SS 


MS 


F 


Significance F 




12 


Regression 


2 


1.3864 


0.6932 


1 1 .8448 


0.0057 




13 


Residual 


7 


0.4096 


0.0585 








14 


Total 


9 


1.7960 










15 
















16 




Coefficients 


Standard Error 


f Stat 


P- value 


Lower 95% 


Upper 95% 


17 


Intercept 


-0.2159 


0.2643 


-0.8169 


0.4409 


-0.8409 


0.4091 


18 


$Counter 


0.0855 


0.0438 


1.9516 


0.0920 


-0.0181 


0.1890 


19 


$Driveup 


0.1132 


0.0385 


2.9368 


0.0218 


0.0220 


0.2043 



MINITAB 



Predi ctor 


Coef 


SE Coef 




T 




P 


Constant 


-0.2159 


0.2643 


-0 


82 


0 


441 


SCounter 


0.08547 


0.04380 


1 


95 


0 


092 


$Dri veup 


0.11315 


0.03853 


2 


94 


0 


022 



S = 0.241912 R-Sq = 77.2% R-Sq(adj) = 70.7% 
Analysis of Variance 



Source DF SS MS F P 

Regression 2 1.38635 0.69318 11.84 0.006 

Residual Error 7 0.40965 0.05852 

Total 9 1.79600 



These portions of the Excel and Minitab printouts for the fast-food franchise show the results of tests 
for (1) the overall significance of the regression equation and (2) the significance of each of the partial 
regression coefficients. The full printouts and the procedures for obtaining them are shown in 
Computer Solutions 16.1. 



Using the F distribution tables to identify the critical value for F, the number 
of degrees of freedom for the numerator will be k = 2; J/' for the denominator will 
be (n — k — 1), or (10 — 2 — 1) = 7. Conducting the test at the 0.01 level of sig- 
nificance, the critical value is F = 9.55. The calculated test statistic (F = 11.85) 
exceeds the critical value, and H 0 is rejected. At the 0.01 level, the multiple regres- 
sion equation is significant. 

Table 16.4 shows the Excel and Minitab printout segments applicable to this 
test. The two printouts show the p-value for the overall significance of the equa- 
tion as 0.0057 and 0.006, respectively. 



Testing the Partial Regression Coefficients 

In testing the significance of each partial regression coefficient (b v b 2 , and so on), 
the null hypothesis is that the corresponding population value is really zero — i.e., 
that the observed value differs from zero merely by chance. The test is a two-tail 
Mest with the following format. 
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N 


O 


T E 


S 



t-test for the significance of a partial regression coefficient, b-. 

• Null hypothesis 

H Q : /3 ; = 0 The population coefficient is 0. 

• Alternative hypothesis 

H-r. /6 ; ¥= 0 The population coefficient is not 0. 

• Test statistic 

b s - 0 



t = — ^ where b ; = the observed value of the regression 

b 

' coefficient 

Sy = the estimated standard deviation of b i 

1. We can also test H Q : (3 i = /3- 0 versus H^. /3 ( # j8- 0 where /3 i0 is any value of in- 
terest. With the exception of jS ;0 replacing zero, the test is the same as described 
here. 

2. Critical values of t are ± t for the level of significance desired and df = n — k — 1 
(n = number of data points, k = number of independent variables). 

The printout segments in Table 16.4 list the value and the standard deviation 
for each partial regression coefficient in the fast-food regression equation. In 
demonstrating these tests, we will use the number of decimal places shown in the 
Minitab portion of Table 16.4, and the 0.05 level of significance. 

1. Test of the null hypothesis, fi Q : y3 1 = 0 versus H^. /3 1 ¥= 0. 

b x - 0 0.08547 - 0 



Calculated test statistic, t 



0.04380 



1.95 



2. 



For df = n — k — 1, or (10 — 2 — 1) = 7, critical values for the 0.05 level 
of significance are t = —2.365 and t = +2.365. The calculated t (1.95) 
falls within these limits, and we are unable to reject the null hypothesis that 
[3 1 = 0. At this level, b t = 0.08547 does not differ significantly from zero. 
Had the test been done at the 0.10 level (with critical values of t = —1.895 
and t = +1.895), the conclusion would be reversed, and the null hypothesis 
would have been rejected. 

From our t distribution table, the most accurate statement we can make 
about the exact f-value for the test is that it is between 0.05 and 0.10. As 
shown in Table 16.4, the actual p-value is 0.092. If there were no linear rela- 
tionship between profit and counter sales (i.e., /3 1 = 0), there would be just a 
0.092 probability of obtaining a sample b^ this different from zero. 
Test of the null hypothesis, H Q : /3 2 = 0, versus Hf. /3 2 + 0. 



Calculated test statistic, t 



0 0.11315 - 0 



0.03853 



2.94 



As with the preceding test, df = n — k — 1, or 7, and critical values for the 
0.05 level of significance are t = —2.365 and t = +2.365. The calculated t ex- 
ceeds these limits, and we are able to reject the null hypothesis that /3 2 = 0. At 
this level, b 2 = 0.11315 differs significantly from zero. Had the test been done 
at the more demanding 0.02 level (having critical t values of£ = —2.998 and 
t = +2.998), the conclusion would have been reversed, and the null hypothe- 
sis would not have been rejected. 
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From the t distribution table in Appendix A, the most accurate statement 
we can make about the exact p-value for this test is that it is between 0.05 and 
0.02. As shown in Table 16.4, the actual p-value is 0.022. If there were no lin- 
ear relationship between profit and drive-through sales (i.e., /6 2 = 0), there 
would be just a 0.022 chance of obtaining a sample b 2 this different from 
zero. 

We have just duplicated the procedure used by Excel and Minitab to test each 
of the partial regression coefficients. As expected, our results are consistent with 
those produced by the computer. The printouts in Table 16.4 go beyond our text 
discussion by including a i-test and p-value for the constant in the regression equa- 
tion. The y-intercept, b Q = —0.2159 does not differ very significantly from zero 
(p-value = 0.441). As noted in Chapter 15, testing the y-intercept is generally not 
of practical importance. 

Interval Estimation for the Partial 
Regression Coefficients 

In addition to playing a role in significance testing, the estimated standard deviation 
of bj is used in constructing a confidence interval for its population counterpart jS^ 
The method, which is similar to the procedure used in Chapter 15, is described 
below. 

Confidence interval for a partial regression coefficient, /3 ( : 
The interval is 

b t ± t\s b | where b i = partial regression coefficient in the 
sample regression equation 
t = t value for the confidence level desired, 
and df = n — k — 1 
s b = estimated standard deviation of b i 

Both bj and the estimated standard deviation of b i will typically be included in 
the printout whenever the regression analysis is done with the assistance of a com- 
puter statistical package. Applying these to our fast-food example, we will con- 
struct the 95% confidence intervals for fi l and jS ? . 

1. 95% confidence interval for j8 1 . Referring to the Minitab portion of Table 16.4, 
b x (the partial regression coefficient for x v or counter sales) is 0.08547, and the 
standard deviation of by is 0.04380. There are n = 10 observations in the data 
and k = 2 independent variables. For df = n — k — 1, or 7 degrees of freedom, 
the value of t for a 95% confidence interval is t = 2.365. The confidence interval 
will be 

b x ± tls bi ) = 0.08547 ± 2.365(0.04380) 

= 0.08547 ± 0.10359, or from -0.01812 to 0.18906 

We have 95% confidence that {3 1 is somewhere from —0.01812 to 
0.18906. Note that ^3-^ = 0 falls within the interval. This is consistent with our 
earlier finding that, at the 0.05 level, b x = 0.08547 is not significantly different 
from zero. As we've seen before, an x% confidence interval and a nondirec- 
tional significance test at the (100 — x)/l00 level will give the same conclusion 
regarding a hypothesized value for a population parameter. 
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2. 95% confidence interval for /S 2 . Referring to the Minitab portion of Table 
16.4, b 2 (associated with drive-through sales) is 0.11315, and the standard de- 
viation of b 2 is 0.03853. With n = 10 observations and k = 2 independent 
variables, df = n — k — 1, or 7 degrees of freedom, and the value of t for the 
95% confidence interval will be t = 2.365. The confidence interval is 

b 2 ± t(s b J = 0.11315 ± 2.365(0.03853) 

= 0.11315 ± 0.09112, or from 0.02203 to 0.20427 

We have 95% confidence that /3 2 is somewhere from 0.02203 to 
0.20427. Note that /3 2 = 0 does not fall within the interval. This is consistent 
with our earlier finding that, at the 0.05 level, b 2 = 0.11315 differs signifi- 
cantly from zero. 

If you're using Excel, confidence intervals for the partial regression coeffi- 
cients are included in the output, as shown in Table 16.4. Unless you specify oth- 
erwise, 95% confidence intervals are provided. In the section exercises that 
follow, Excel users can readily obtain many of the requested confidence intervals 
without the need for further calculations. If you wish, you can use a pocket calcu- 
lator and the techniques in this section to verify the confidence intervals your 
Excel spreadsheet package has so thoughtfully carried out for you. 




16.28 What is the relationship between (a) the results 

of the hypothesis test examining whether fc 3 differs signif- 
icantly from zero at the 0.02 level and (b) the 98% confi- 
dence interval for /3 3 ? 

16.29 For the multiple regression equation obtained in 
Exercise 16.11: 

a. At the 0.05 level, is the overall regression equation 
significant? 

b. Use the 0.05 level in concluding whether each partial 
regression coefficient differs significantly from zero. 

c. Interpret the results of the preceding tests in the con- 
text of the variables described in that exercise. 

d. Construct a 95% confidence interval for each partial 
regression coefficient in the population regression 
equation. 

16.30 For the multiple regression equation obtained in 
Exercise 16.12: 

a. At the 0.05 level, is the overall regression equation 
significant? 

b. Use the 0.05 level in concluding whether each partial 
regression coefficient differs significantly from zero. 

c. Interpret the results of the preceding tests in the con- 
text of the variables described in that exercise. 

d. Construct a 95% confidence interval for each partial 
regression coefficient in the population regression 
equation. 



16.31 Given the printout in Exercise 16.15, (a) determine 
the 90% confidence interval for each partial regression 
coefficient, and (b) interpret each significance test in the 
context of that exercise. 

/ data set/ Note: Exercises 16.32-16.35 require a 
computer and statistical software. 

16.32 Referring to the least-squares regression equation 
and printout obtained in Exercise 16.18: 

a. At the 0.10 level, is the overall regression equation 
significant? 

b. Use the 0.10 level in concluding whether each partial 
regression coefficient differs significantly from zero. 

16.33 Referring to the least-squares regression equation 
and printout obtained in Exercise 16.19: 

a. At the 0.05 level, is the overall regression equation 
significant? 

b. Use the 0.05 level in concluding whether each partial 
regression coefficient differs significantly from zero. 

16.34 The computer database GROWCO describes the 
characteristics of 100 companies identified by Fortune as 
among the fastest growing. Three of the variables listed 
are (1) estimated stock price/earnings ratio next year, 

(2) annual revenue growth (as a percentage), and 

(3) annual earnings per share growth (as a percentage). 
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source: "Fortune's 100 Fastest-Growing Companies," Fortune, 
September 4, 2000, pp. 142-158. 

a. Determine the least-squares regression equation for y 
= next year's price/earnings ratio estimate as a func- 
tion of Xj = annual revenue growth and x 2 
annual earnings per share growth. Interpret each of 
the partial regression coefficients. 

b. At the 0.05 level, is the overall regression equation 
significant? 

c. Use the 0.05 level in concluding whether each partial 
regression coefficient differs significantly from zero. 

d. Construct a 95% confidence interval for each partial 
regression coefficient in the population regression 
equation. 

16.35 Information describing 98 of the top automobile 
dealership groups in the United States is provided in com- 
puter database file DEALERS. For each group, the data in- 



clude (1) total group revenue during 1999, (2) the num- 
ber of retail vehicles sold that year, and (3) the number of 
dealerships in the group, source: Automotive News, May 1, 2000, 
pp. 54-60. 

a. Determine the least-squares regression equation for 

y = total group revenue as a function of x l = number 
of retail vehicles sold and x 2 = number of dealers in 
the group. Interpret each of the partial regression 
coefficients. 

b. At the 0.02 level, is the overall regression equation 
significant? 

c. Use the 0.02 level in concluding whether each partial 
regression coefficient differs significantly from zero. 

d. Construct a 98% confidence interval for each partial 
regression coefficient in the population regression 
equation. 



OVERVIEW OF THE COMPUTER 1 6.6 

ANALYSIS AND INTERPRETATION 



A Summary of the Results 

In the preceding sections, it was useful to segment the Excel and Minitab print- 
outs and discuss each portion as it became relevant. We'll now reverse the process 
and put the pieces back together. 

Table 16.5 is the Excel multiple regression printout for the fast-food example. 
In addition to the standard Excel printout shown in Computer Solutions 16.1, it 
includes the confidence and prediction interval estimates that were generated in 
Computer Solutions 16.2. 

The Minitab printout in Table 16.6 includes the segments in Computer Solu- 
tions 16.1 and 16.2, plus two standard Minitab items we have not discussed. 





E I F 


G 


H 




J 


K 


1 


SUMMARY OUTPUT 












2 






0.95 


Prediction Interval 




3 


Regression Statistics 




Predicted value = 


0.750 




4 


Multiple R 


0.8786 




Lower limit = 


0.146 




5 


R Square 


0.7719 




Upper limit = 


1.353 




6 


Adjusted R Square 


0.7067 












7 


Standard Error 


0.2419 




0.95 


Confidence Interval Estimate 


8 


Observations 


10 




Lower limit = 


0.558 




9 








Upper limit 


0.941 




10 


ANOVA 














11 




df 


SS 


MS 


F 


Significance F 




12 


Regression 


2 


1.3864 


0.6932 


1 1 .8448 


0.0057 




13 


Residual 


7 


0.4096 


0.0585 








14 


Total 


9 


1.7960 










15 
















16 




Coefficients 


Standard Error 


f Sfaf 


P-value 


Lower 95% 


Upper 95% 


17 


Intercept 


-0.2159 


0.2643 


-0.8169 


0.4409 


-0.8409 


0.4091 


18 


$Counter 


0.0855 


0.0438 


1.9516 


0.0920 


-0.0181 


0.1890 


19 


$Driveup 


0.1132 


0.0385 


2.9368 


0.0218 


0.0220 


0.2043 



TABLE 16.5 



Excel printout for the 
multiple regression analysis 
of the fast-food franchise 
data. In addition to the 
standard Excel printout 
from Computer Solutions 
16.1, it includes the 
confidence and prediction 
intervals generated in 
Computer Solutions 16.2. 
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TABLE 16.6 



Minitab printout for the 
multiple regression analysis 
of the fast-food franchise 
data. It includes the 
standard Minitab printout 
in Computer Solutions 16.1 
plus the confidence and 
prediction intervals in 
Computer Solutions 16.2. 



Regression Analysis: $Profit versus $Counter, $Driveup 

The regression equation is 

SProfit = - 0.216 + 0.0855 SCounter + 0.113 SDriveup 

Predictor Coef SE Coef T P 

Constant -0.2159 0.2643 -0.82 0.441 

SCounter 0.08547 0.04380 1.95 0.092 

SDriveup 0.11315 0.03853 2.94 0.022 

S = 0.241912 R-Sq = 77.2% R-Sq(adj) = 70.7% 

Analysis of Variance 



Source 


DF 




ss 




MS 


Regression 


2 


1 


,38635 


0 


.69318 


Residual Error 


7 


0 


40965 


0 


.05852 


Total 


9 


1 


79600 






Source DF 


Seq 


ss 









SCounter 
SDriveup 



0.88160 
0.50475 



F 

11.84 



P 

0.006 



Unusual Observations 
Obs SCounter SProfit 



Fit SE Fit Residual St Resid 
0.3961 2.06R 



8 8.6 1.3000 0.9039 0.1468 

R denotes an observation with a large standardized residual. 
Predicted Values for New Observations 
New 

Obs Fit SE Fit 95% CI 95% PI 

1 0.7496 0.0811 (0.5578, 0.9413) (0.1462, 1.3529) 

Values of Predictors for New Observations 



New 

Obs SCounter SDriveup 
1 6.00 4.00 



The first one is the "Unusual Observations" comment at the bottom, where fran- 
chise number 8 is identified as a data point that is especially distant from the re- 
gression equation. We'll expand on this shortly. 

The other portion of Table 16.6 that has not been previously discussed is the 
information in the "SEQ SS" column near the bottom. In this section, Minitab 
shows how much the error sum of squares (SSE) is reduced by each x variable 
when they are sequentially introduced into the regression equation. The first row 
in the "SEQ SS" column indicates that, compared to an "equation" in which only 
a constant is used as a predictor for y, the inclusion of x 1 reduces SSE by 0.88160. 
Once x 1 has been included, bringing x 2 into the regression equation reduces SSE 
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by another 0.50475. Note that 0.88160 + 0.50475 = 1.38635, the regression (or 
explained) sum of squares. 

In describing our overall conclusions for the fast-food franchise regression 
analysis, we will use the number of decimal places shown in Table 16.5. This is to 
avoid confusion, since the two printouts do not always present findings to the 
same number of decimal places. 

• Dependent and independent variables 

y = net profit x 1 = counter sales x 2 = drive-through sales 

• Multiple regression equation 

y = -0.2159 + 0.0855^ + 0.1132x 2 

• Partial regression coefficients 

b x = 0.0855 and b 2 = 0.1132 

• For a given level of drive-through sales, another $1.00 of counter sales in- 
creases the estimated net profit by $0.0855. 

• For a given level of counter sales, another $1.00 of drive-through sales in- 
creases the estimated net profit by $0.1132. 

• Coefficient of multiple determination 

R 1 = 0.7719 

Changes in x^ and x 2 explain 77.19% of the variation in y. 

• Significance test for the overall regression equation The regression equation 
is quite significant. For the analysis of variance test of H Q : /3 1 = /3 2 = 0, the 
p-value is just 0.0057. 

• Significance of the individual partial regression coefficients 

• b 1 (0.0855) differs from 0 at the p-value = 0.0920 level of significance 

• b 2 (0.1132) differs from 0 at the p-value = 0.0218 level of significance 

• Confidence intervals for the partial regression coefficients These are provided 
by Excel but not Minitab. We have 95% confidence that (3^ is between 
-0.0181 and 0.1890, and that /3 2 is between 0.0220 and 0.2043. 

• Confidence and prediction intervals for the dependent variable These inter- 
vals are associated with x x = $6 million in counter sales and x 2 = $4 million 
in drive-through sales. When x^ = $6 million in counter sales and x 2 = $4 
million in drive-through sales, the point estimate for y = profit is $0,750 mil- 
lion. The corresponding interval estimates are: 

• Confidence interval: For all franchises having x^ = $6 million in counter 
sales and x 2 = $4 million in drive-through sales, we are 95% confident 
that the mean profit for all such franchises is between $0,558 and 
$0,941 million. 

• Prediction interval: For any individual franchise having x 1 = $6 million in 
counter sales and x 2 = $4 million in drive-through sales, we are 95% 
confident that the profit for this single franchise is between $0,146 and 
$1,353 million. 

Statistics in Action 16.1, mentioned in this chapter's opening vignette, describes 
a multiple regression analysis in which traffic fatalities during the New Year's holiday 




Predicting Holiday Traffic Fatalities 

Among other data, the National Safety Council (NSC) holidays preceding it, we used NSC figures and multiple 

reports the number of U.S. traffic fatalities during holiday regression to analyze Thanksgiving, Christmas, and New 

periods. To see if we might be able to predict New Year's fa- Year's fatalities during the period 1981-1990. A portion of 

talities based on the number during each of the two major the Minitab printout is shown. 



The regression equation is 

NEWYRS = - 72.2 + 0.571 THKSCV + 0.396 CHRMAS 



Predi ctor 


Coef 


SE Coef 


T 


P 


Constant 


-72.19 


91.78 


-0.79 


0.457 


THKSCV 


0.5709 


0.2203 


2.59 


0.036 


CHRMAS 


0.39593 


0.08175 


4.84 


0.002 


S = 23.32 


R-Sq = 84.5% 


R-Sq(adj) 


= 80.1% 




Analysis of 


Variance 








Source 


DF 


SS 


MS 


F 


Regression 


2 


20753 


10376 


19.08 


Residual Error 7 


3806 


544 




Total 


9 


24559 







The overall regression equation is highly significant (the p- the New Year's holiday. Over this time period, 84.5% of the 

value, rounded, is 0.001), and x 2 = Christmas fatalities (p- variation in New Year's fatalities is explained by the regres- 

value = 0.002) is easily the better of the two independent sion equation. 

variables in predicting the number that will occur during Source: National Safety Council, Accident Facts 1 991 Edition p. 66. 



period are estimated based on the number of fatalities during the Thanksgiving and 
Christmas periods. The Minitab printout is also included, and it is similar in 
appearance to the one we've discussed for the fast-food franchise data. 



Residual Analysis 

Like most computer statistical software, Excel and Minitab give you an optional 
summary of the residuals, as shown in Table 16.7. As discussed in Chapter 15, 
this information can be used in finding data points that are especially distant from 
the regression equation as well as in identifying residual "patterns" that suggest 
our model may not be valid. 

Refer to the Minitab portion of Table 16.7. For each of the 10 data points, ob- 
served and estimated ("fitted") values of y are printed, along with their difference 
(the residual). Each residual has been "standardized," or expressed in terms of the 
number of standard deviation multiples it represents. The standard deviations for 
the estimated y values, listed in the "SE Fit" column, differ for the various rows. 
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EXCEL 





E 


F 


G 


H 


23 


RESIDUAL OUTPUT 






24 










25 


v^/jl/oc?/ v c* nisi i 


Predicted UPmfit 

i i rju/vicu i^rf twin 


Residuals 


Standard Residuals 


26 


1 


1.373 


0.127 


0.594 


27 


2 


0.575 


0.225 


1.053 


28 


3 


1.230 


-0.030 


-0.142 


29 


4 


1.521 


-0.121 


-0.569 


30 


5 


0.457 


-0.257 


-1.206 


31 


6 


0.985 


-0.185 


-0.869 


32 


7 


0.452 


0.148 


0.696 


33 


8 


0.904 


0.396 


1.857 


34 


9 


0.515 


-0.115 


-0.538 


35 


10 


0.787 


-0.187 


-0.874 



TABLE 16.7 



Using the procedures in 
Computer Solutions 16.3, 
we can obtain a listing of 
the residuals like the Excel 
and Minitab printouts 
shown here. Minitab and 
Excel differ slightly in their 
method of determining the 
values of the standardized 
residuals. 



MINITAB 



Obs 


SCounter 


SProfit 




Fit 


SE Fit 


Residual 


St Resid 


1 


8 


4 


1 


5000 


1 


3734 


0.1279 


0 


1266 


0.62 


2 


3 


3 


0 


8000 


0 


5754 


0.1564 


0 


2246 


1.22 


3 


5 


8 


1 


2000 


1 


2303 


0.1756 


-0 


0303 


-0.18 


4 


10 


0 


1 


4000 


1 


5214 


0.1592 


-0 


1214 


-0.67 


5 


4 


7 


0 


2000 


0 


4574 


0.1157 


-0 


2574 


-1.21 


6 


7 


7 


0 


8000 


0 


9854 


0.0919 


-0 


1854 


-0.83 


7 


4 


5 


0 


6000 


0 


4516 


0.1172 


0 


1484 


0.70 


8 


8 


6 


1 


3000 


0 


9039 


0.1468 


0 


3961 


2.06R 


9 


5 


9 


0 


4000 


0 


5147 


0.1228 


-0 


1147 


-0.55 


10 


6 


3 


0 


6000 


0 


7865 


0.0797 


-0 


1865 


-0.82 



R denotes an observation with a large standardized residual. 



This is because estimates become more uncertain when they are based on x 1 and 
x 2 values that are farther away from their respective means. 

Franchise number 8 has been "flagged" as an outlier because its standardized 
residual is more than 2 standard deviation multiples away from zero. The net profit 
for this franchise was considerably higher than estimated, and the franchise may 
have some unique characteristic that explains its strong performance versus the 
regression equation. Perhaps the manager is an excellent cost cutter or employee 
motivator, or the franchise may have a very attractive location, such as near an office 
building or high school. 

Among many other possibilities for residual analysis, we can (1) construct a 
histogram of the residuals as a rough check to see whether they are approxi- 
mately normally distributed, (2) examine a normal probability plot to check for 
normality, (3) plot the residuals versus the predicted values of y, (4) plot the 
residuals versus the order in which the observations are arranged, (5) plot the 
residuals versus x t to see whether they exhibit some kind of cycle or pattern with 
respect to this variable, and (6) repeat plot 5 for each of the other independent 
variables. 
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computer solutions 1 G.3 




Residual Analysis in Multiple Regression 
EXCEL 

Excel Residual Analysis in Multiple Regression 

Follow the procedure in Computer Solutions 16.1, but in step 2 select all four items in the Residuals section and se- 
lect Normal Probability Plots. The tabular results will be as shown in the Excel portion of Table 16.7. Three of the 
resulting Excel residual analysis plots are shown in Figure 16.2. 



MINITAB 

Minitab Residual Analysis in Multiple Regression 

With two exceptions, the procedure is as described in Computer Solutions 16.1 . (1) To get the printout of residuals 
shown in the Minitab portion of Table 1 6.7, click Results, then select In addition, the full table of fits and resid- 
uals. (2) To get the plots shown in Figure 16.3 and discussed in the text: Click Graphs. In the Residuals for plots 
menu, select Regular. In the Residual Plots menu, select all of the options. Enter C2 C3 (include the space between 
them) into the Residuals versus the variables box. 



In testing for normality, we can also apply the chi-square test that was intro- 
duced in Chapter 13 (n must be ^32) or the Kolmogorov-Smirnov test that was 
introduced in Chapter 14. In possibility number 4 of the preceding paragraph, 
we are concerned about the possible presence of autocorrelation, in which the 
value of a residual is related to the values of those which precede or follow it. 
The topic of autocorrelation is especially relevant to regression models fitted to a 
time series, and is discussed further in Chapter 18. 

Computer Solutions 16.3 shows the Excel and Minitab procedures for getting 
residuals listings like those in Table 16.7, as well as the procedures involved in 
generating Excel and Minitab residual analysis plots like those in Figures 16.2 and 
16.3 (pages 627-629). The Minitab residual analysis plots in Figure 16.3 include the 
following features: 

• Histogram of residuals. There are only 10 franchises. Given this histogram, it 
looks like the 10 residuals could have come from a normal distribution with 
a mean of 0. 

• Normal probability plot. The normal probability plot is not terribly nonlin- 
ear, again supporting the assumption that the residuals could have come from 
a normal distribution with a mean of 0. 

• Residuals versus predicted y values. The residuals seem to be somewhat ran- 
domly scattered, so it does not look like there is any relationship between the 
residuals and the predicted profits for the 10 franchises. 

• Residuals versus order of the observations. There seems to be a bit of a U shape 
here. It would appear that the order in which the franchises are listed in the 
original data may have something to do with whether they have made (a) a 
higher profit than the equation predicts or (b) a lower profit than the equation 
predicts. For example, franchise #5 has "underperformed," earning a much 
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FIGURE 16.2 
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SDriveup Residual Plot 
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Several of the plots that Excel provides for analysis of the residuals, generated using the procedures in 
Computer Solutions 16.3 

lower profit ($0.2 million) than the $0.4574 million that the equation predicts, 
a residual of —$0.2574 million. 

• Residuals versus x v Looks somewhat like a U-shaped curve here, too. The 
residuals might not be independent of the level of counter sales. 

• Residuals versus x 2 - Although the residuals seem to be a bit more scattered for 
franchises with lower levels of drive-through sales, there are not very many 
data points, and this slight pattern could have happened by chance. 
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FIGURE 16.3 



Minitab plots for analysis of 
the residuals, discussed in 
the text and generated 
using the procedures in 
Computer Solutions 16.3 
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Residuals Versus $Counter 

(response is $Profil) 




Residuals Versus $Driveup 

(response is $Profit) 
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16.36 What role does the normal probability plot play in 
examining whether the multiple regression model is 
appropriate for a given set of data? 

16.37 Discuss residual analysis in terms of the assumptions 
required for using the multiple regression model. 

16.38 Given the printout in Exercise 16.15, briefly sum- 
marize the overall results in the context of that exercise: 

a. Identify and interpret each of the partial regression 
coefficients. 

b. What proportion of the variation in y is explained by 
the equation? 

c. At what p-value is the overall regression equation 
significant? 

d. For each of the partial regression coefficients, at what 
p-value does the coefficient differ from zero? Which 
independent variables appear to be most and least 
useful in explaining changes in y? 

/ data Set / Note: Exercises 16.39-16.45 require a 
computer and statistical software. 

16.39 For the regression equation obtained in Exercise 

16.11, analyze the residuals by (a) constructing a 
histogram, (b) utilizing the normal probability plot, and 
(c) plotting the residuals versus each of the independent 
variables. Do any of the assumptions of the multiple 
regression model appear to have been violated? 

16.40 For the regression equation obtained in Exercise 

16.12, analyze the residuals by (a) constructing a 
histogram, (b) utilizing the normal probability plot, and 
(c) plotting the residuals versus each of the independent 
variables. Do any of the assumptions of the multiple 
regression model appear to have been violated? 

16.41 For the regression equation obtained in Exercise 

16.13, analyze the residuals by (a) constructing a 
histogram, (b) utilizing the normal probability plot, and 
(c) plotting the residuals versus each of the independent 
variables. Do any of the assumptions of the multiple 
regression model appear to have been violated? 

16.42 For the regression equation obtained in Exercise 

16.18, analyze the residuals by (a) constructing a 
histogram, (b) utilizing the normal probability plot, and 
(c) plotting the residuals versus each of the independent 
variables. Do any of the assumptions of the multiple 
regression model appear to have been violated? 

16.43 For the regression equation obtained in Exercise 

16.19, analyze the residuals by (a) constructing a 
histogram, (b) utilizing the normal probability plot, and 
(c) plotting the residuals versus each of the independent 
variables. Do any of the assumptions of the multiple 
regression model appear to have been violated? 



16.44 An efficiency expert wants to determine whether 
the time an assembly worker requires to perform a stan- 
dard task might be related to (1) the number of years on 
the job and (2) the score on a pre-employment aptitude 
test. The following data have been collected. 



y 


= Time, seconds: 


52 


48 


56 


52 


48 


45 


42 




= Years on job: 


7 


14 


8 


6 


10 


6 


10 


X 2 


= Test score: 


73 


78 


71 


75 


78 


78 


80 


y 


= Time, seconds: 


51 


51 


53 


47 


45 


49 






= Years on job: 


1 1 


10 


6 


8 


12 


12 




X 2 


= Test score: 


72 


79 


77 


79 


84 


72 





b. 



d. 



Determine the multiple regression equation and 
interpret the partial regression coefficients. 
Determine the 95% confidence interval for the 
population partial regression coefficients, p 1 and /6 2 . 
Interpret the coefficient of determination and the 
result of each significance test done by the computer. 
Analyze the residuals. Does the analysis support the 
applicability of the multiple regression model to these 
data? 



16.45 A test of 9 micro-size radar detectors included 
laboratory evaluation as well as the warning distance 
provided under various driving conditions. The following 
data represent over-the-hill trap warning distance, typical 
price, x-band lab sensitivity, and weight, source: Csaba 
Csere and Don Sherman, "Micro Magic Boxes," Car and Driver, April 1987, 
pp. 95-108. 



Over-the-Hill 
Trap Warning 
Detector Distance, y 



0.675 miles 

0.660 

0.640 

0.560 

0.540 

0.640 

0.540 

0.645 

0.670 



Typical 
Price, 

*1 

$289 
295 
240 
249 
260 
200 
199 
220 
250 



Laboratory 
Sensitivity, Weight, 

x, x. 



1 08 dBm/cm 2 

110 

108 

103 

107 

108 

109 

108 

112 



3.8 oz 

6.1 

5.8 

6.6 

6.0 

5.8 

5.9 

5.8 

6.2 



b. 



d. 



Determine the multiple regression equation and inter- 
pret the partial regression coefficients. 
Determine the 95% confidence interval for the popu- 
lation partial regression coefficients, jSj, p 2 , and /3 3 . 
Interpret the coefficient of determination and the re- 
sult of each significance test done by the computer. 
Analyze the residuals. Does the analysis support the 
applicability of the multiple regression model to these 
data? 
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ADDITIONAL TOPICS IN MULTIPLE 1 6.7 

REGRESSION AND CORRELATION 



Although we can't cover all facets of multiple regression and correlation analysis, 
two additional topics are deserving of our attention. These include (1) using a 
dummy variable to incorporate qualitative data into the analysis, and (2) a discus- 
sion of the potential problem known as multicollinearity. 

Including Qualitative Data: The Dummy Variable 

There may be situations where some of our data do not meet the requirements of 
either the interval or the ratio scales of measurement. In such cases, we can still 
include the data in our analysis by using one or more dummy variables. A dummy 
variable will have a value of either one or zero, depending on whether a given 
characteristic is present or absent. 

For example, if some of the 10 franchise managers were college graduates and 
others were not, we could include an variable reflecting this information. De- 
pending on the presence or absence of a college degree, x 3 could be recorded as 
either one or zero: 

x 3 = 1 if the franchise is managed by a college graduate 

and 

x 3 = 0 if the manager is not a college graduate 

The dummy variable has a partial regression coefficient, in this case b 3 . As an 
example of its interpretation, if b 3 = 0.300, the estimated profit will be $0,300 
million higher if a franchise is managed by a college graduate. 

Dummy variables can represent information such as marital status, gender, 
city versus urban location, trained versus untrained employees, or even whether 
someone wears contact lenses. As with other independent variables, computer 
analysis tests the significance of the partial regression coefficient for a dummy 
variable. For example, if b 3 were to differ significantly from zero, we would con- 
clude that whether a manager is a college graduate is an important predictor for 
the net profit of a franchise. If b 3 were positive and significant, this would indicate 
that franchises managed by college graduates are significantly more profitable 
than those managed by nongraduates. 

To further demonstrate how dummy variables can be employed, we will use 
actual data for a sample of 10 personal printers. 2 Some of the printers can print 
in color, while others are monochrome, and the models vary in how fast they can 
generate text-only materials. The data are in file CX16PRIN and represent the fol- 
lowing variables: 

PRICE = The street price for the printer 
SPEED = Printing speed for text, pages per minute 
COLOR = A dummy variable reflecting color versus monochrome printer 
(1 = color, 0 = monochrome) 



2 Source: "Top 10 Printers," PC World, April 1997, p. 211. 
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Using PRICE as the dependent variable, the estimation equation is: 



Estimated street 
price of the 
printer (dollars) 



For each additional page 
that can be printed per 
minute, the estimated 
price goes up by $28.86. 



If the printer is color 
instead of monochrome, 
the estimated price goes 
up by $27.03. 



PRICE = 250.88 + 28.86 SPEED + 27.03 COLOR 

Point and interval price estimates, confidence intervals, significance testing, 
and the coefficient of multiple determination are treated the same way as they 
were in earlier portions of the chapter. For example, a machine that prints five text 
pages per minute (SPEED = 5) and is capable of color printing (COLOR = 1) 
would have an estimated price of 250.88 + 28.86(5) + 27.03(1), or $422.21. 



Multicol linearity 

Multicollinearity is a situation in which two or more of the independent variables 
are highly correlated with each other. When this happens, the partial regression 
coefficients become both statistically unreliable and difficult to interpret. This is 
because they are not really "saying" entirely different things about y. If our fast- 
food example had included x 3 = number of vehicles using the drive-through, this 
would be very highly correlated with x 2 = drive-through sales. As a result, these 
two variables would be the mathematical equivalent of the old-time comedians 
Laurel and Hardy carrying a piano. They would be very unsure of how much of 
the "load" each should carry, and the piano would tend to be quite unstable. 

Multicollinearity is not a problem when we simply substitute x values into the 
regression equation in order to obtain an estimated value for y. However, it is a 
problem when we're trying to interpret the partial regression coefficients for the x 
variables that are highly correlated. 

When multicollinearity is present, strange things can happen. Along with a 
high correlation between two or more independent variables, the following are 
clues to its presence: 

1. An independent variable known to be an important predictor ends up having 
a partial regression coefficient that is not significant. 

2. A partial regression coefficient that should be positive turns out to be nega- 
tive, or vice versa. 

3. When an independent variable is added or removed, the partial regression 
coefficients for the other independent variables change drastically. 

One way to deal with multicollinearity is to avoid having a set of independent 
variables in which two or more variables are highly correlated with each other. In 
the fast-food example used throughout this chapter, the inclusion of a third inde- 
pendent variable representing the number of vehicles using the drive-through 
would likely cause multicollinearity problems, because x 2 = drive-through sales 
would probably be highly correlated with x 3 = drive-through traffic count — each 
variable is essentially describing the "bigness" of the drive-through aspect of the 
restaurant. 

In Chapter 17, we will discuss stepwise regression, a regression model in which 
the independent variables enter the regression analysis one at a time. The first x 
variable to enter is the one that explains the greatest amount of variation in y. The 
second x variable to enter will be the one that explains the greatest amount of the 
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remaining variation in y, and so on. At each "step," the computer decides which, 
if any, of the remaining x variables should be brought in. Every step results in a 
new regression equation and an updated printout of this equation and its analysis. 
The general idea of stepwise regression is the balancing act of trying to (1) explain 
the most possible variation in y, while (2) using the fewest possible x variables. 





16.46 What is a dummy variable, and how is it useful to 
multiple regression? Give an example of three dummy 
variables that could be used in describing your home town. 

16.47 A multiple regression equation has been developed 
for y = daily attendance at a community swimming 
pool, x 1 = temperature (degrees Fahrenheit), and x 2 = 
weekend versus weekday. (x 2 = 1 for Saturday and 
Sunday, and 0 for the other days of the week.) For the 
regression equation shown below, interpret each partial 
regression coefficient. 



Pounds Months as 



y = 100 + 



SX-y + 



15 Ox, 



16.48 For the regression equation in Exercise 16.47, the 
estimated number of persons swimming on a zero-degree 
weekday would be 100 persons. Since this level of pool 
attendance is unlikely on such a day, does this mean that 
an error was made in constructing the regression 
equation? Explain. 

16.49 What is multicollinearity, and how can it adversely 
affect multiple regression analysis? How can we tell 
whether multicollinearity is present? 

/ data set/ Note: Exercises 16.50-16.51 require a 
computer and statistical software. 

16.50 For 12 recent clients, a weight-loss clinic has 
collected the following data. (Session is coded as 1 for 
day, 0 for evening. Gender is coded as 1 for male, 0 for 
female.) Using a computer statistical package, carry out 
a multiple regression analysis, interpret the partial 
regression coefficients, and discuss the results of each 
significance test. 



Client 


Lost 


a Client 


Session Gender 


1 


31 


5 


1 1 


2 


49 


8 


1 1 


3 


12 


3 


1 0 


4 


26 


9 


0 0 


5 


34 


8 


0 1 


6 


1 1 


2 


0 0 


7 


4 


1 


0 1 


8 


27 


8 


0 1 


9 


12 


6 


1 1 


10 


28 


9 


1 0 


1 1 


41 


6 


0 0 


12 


16 


6 


0 0 



16.51 A safety researcher has measured the speed of au- 
tomobiles passing his vantage point near an interstate 
highway. He has also recorded the number of occupants 
and whether the driver was wearing a seat belt. The data 
are shown in the following table with seat belt users 
coded as 1, nonusers as 0. Using a computer statistical 
package, carry out a multiple regression analysis on these 
data, interpret the partial regression coefficients, and 
discuss the results of each significance test. 



speed (mph): 
occupants: 
belt usage: 

speed (mph): 
occupants: 
belt usage: 



61 
2 
1 

51 
2 
1 



63 
1 
1 

57 
2 
1 



55 
2 
1 

55 
2 
1 



59 59 

1 3 

0 0 

54 49 

3 1 

0 1 



52 61 

1 2 

1 0 

61 73 

1 1 

1 0 



SUMMARY 1 6.8 



• Multiple regression model 

Unlike its counterpart in Chapter 15, multiple regression and correlation analysis 
considers two or more independent (x) variables. The model is of the form y i = /3 0 + 
fi\x^ - + P 2 x 2i + " ' + Pk x ki + € i an ^' f° r a gi ven set °f x values, the expected value of 
y is given by the regression equation, E(y) = /3 0 + $\x x ■ + j8 2 x 2j + • • • + /3 k x kj . For 
each x term in the estimation equation, the corresponding jS is referred to as the 
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partial regression coefficient. Each /3 I (i = 1, 2, . . ., k) is a slope relating changes in 
E(y) to changes in one x variable when all other x's are held constant. The multiple 
standard error of estimate expresses the amount of dispersion of data points about the 
regression equation. 

• Estimation and computer assistance 

Calculations involving multiple regression and correlation analysis are tedious 
and best left to a computer package such as Excel or Minitab. The chapter 
includes formulas for determining the approximate confidence interval for the 
conditional mean of y as well as the approximate prediction interval for an indi- 
vidual y value, given a set of x values. The exact intervals can be obtained using 
Excel or Minitab. 

• Coefficient of multiple determination 

Multiple correlation measures the strength of the relationship between the depen- 
dent variable and the set of independent variables. The coefficient of multiple 
determination (R 2 ) is the proportion of the variation in y that is explained by the 
regression equation. 

• Significance testing and residual analysis 

The overall significance of the regression equation can be tested through the use of 
analysis of variance, while a f-test is used in testing the significance of each partial re- 
gression coefficient. The results of these tests are typically included when data have 
been analyzed using a computer statistical package. Most computer packages can also 
provide a summary of the residuals, which may be analyzed further in testing the re- 
gression model and identifying data points especially distant from the regression 
equation. 

• Dummy variables and multicollinearity 

A dummy variable has a value of either one or zero, depending on whether a given 
characteristic is present, and it allows qualitative data to be included in the anal- 
ysis. If two or more independent variables are highly correlated with each other, 
multicollinearity is present, and the partial regression coefficients may be unreli- 
able. Multicollinearity is not a problem when the only purpose of the equation is 
to predict the value of y. 

• Stepwise regression 

In stepwise regression, independent variables enter the equation one at a time, 
with the first one entered being the x variable that explains the greatest amount of 
the variation in y. At each step, the variable introduced is the one that explains the 
greatest amount of the remaining variation in y. Stepwise regression will be dis- 
cussed more completely in Chapter 17. 



The Multiple Regression Model 

y, =Po + Pl x li + 02*2, + + Pk X ki + e , 

where y ; = a value of the dependent variable, y 
f) 0 = a constant 

x \i> x 2i' x ki = values of the independent variables, x t , x 2 , . .., x k 
jSj, /3 2 , fik = partial regression coefficients for the independent 
variables, x v x 2 , x k 
e ■ = random error, or residual 
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The Sample Multiple Regression Equation 

y = b 0 + fcjXj + b 2 x 2 + ■•■ + b k x k 

where y = the estimated value of the dependent variable, y 
for a given set of values for Xj, x 2 , ■ ■ ■ , x k 
k = the number of independent variables 
b 0 = the y-intercept; this is the estimated value of y when 

all of the independent variables are equal to 0 
b: = the partial regression coefficient for x 

The Multiple Standard Error of Estimate 

s e = — — where y i = each observed value of y in the data 
n — k — 1 y . = the value of y that would have been 

estimated from the regression equation 
n = the number of data points 
k = the number of independent (x) variables 

Approximate Confidence Interval for the Conditional Mean of y 

5 

y ± t—j= where y = the estimated value of y based on 
n the set of x values provided 

t = t value from the t distribution table 
for the desired confidence level and 
df = n — k — 1 (n = number of data 
points, k = number of x variables) 
s e = the multiple standard error of estimate 

Approximate Prediction Interval for an Individual y Observation 

y ± ts e where y = the estimated value of y based on 
the set of x values provided 
t = t value from the t distribution table 
for the desired confidence level and 
df = n — k — 1 (n = number of data points, 
k = number of x variables) 
s e = the multiple standard error of estimate 

Coefficient of Multiple Determination 

^ 2 _ Error variation, which is not explained by the regression equation 

Total variation in the y values 

= 2(y, -y,) 2 _ SSE 

Ziy.-yf OT SST 

As in Chapter 15, the total variation in y is the sum of the explained variation plus the 
unexplained (error) variation, or 

Total variation Variation explained by Variation not explained by 

in y values = the regression equation + the regression equation 
(SST) (SSR) (SSE) 

ANOVA Test for the Overall Significance of the Multiple Regression Equation 

• Null hypothesis 

H 0 : /3 1 = /3 2 = • • • = P k = 0 The regression equation is not significant. 

• Alternative hypothesis 

Hj: One or more of the /3 ; (i = 1, 2, . . . , k) + 0 The regression 

equation is significant. 
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• Test statistic 

SSR/k 

t = — where SSK = regression sum or squares 

' n ' SSE = error sum of squares 

n = number of data points 
k = number of independent 
(x) variables 

• Critical value of the F statistic 

F for significance level specified and df (numerator) = k, 
and df (denominator) = n — k — 1 

t-Test for the Significance of a Partial Regression Coefficient, fa ; 

• Null hypothesis 

H Q : fi i = 0 The population coefficient is 0. 

• Alternative hypothesis 

Hy f) i + 0 The population coefficient is not 0. 

• Test statistic 



- 0 



where b t = the observed value of the regression coefficient 
s b = the estimated standard deviation of b i 



NOT 



1. We can also test H Q : fi i = f) j0 versus H t : ji t + fi j0 , where f) j0 is any value of interest. 
With the exception of /3 (0 replacing zero, the test is the same as described here. 

2. Critical values of t are ±t for the level of significance desired and df = n — k — 1 
(n = number of data points, k = number of independent variables). 

Confidence Interval for a Partial Regression Coefficient, /J ( 

The interval is b l ± J | 5 i.) where b i = partial regression coefficient in 

the sample regression equation 
t = t value for the confidence level 
desired, and df = n — k — 1 
s h = estimated standard deviation of b t 

Excel provides these confidence intervals as part of the regression analysis printout. 




/ data set/ Note: Exercises 16.52-16.58 require a 
computer and statistical software. 

16.52 Interested in the possible relationship between the 
size of his tip versus the size of the check and the number 
of diners in the party, a food server has recorded the fol- 
lowing for a sample of 8 checks: 



Observation 
Number y = Tip x 1 = Check x 2 = Diners 



1 


$7.5 


$40 


2 


2 


0.5 


15 


1 


3 


2.0 


30 


3 


4 


3.5 


25 


4 


5 


9.5 


50 


4 


6 


2.5 


20 


5 


7 


3.5 


35 


5 


8 


1.0 


10 


2 
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a. Determine the multiple regression equation and inter- 
pret the partial regression coefficients. 

b. What is the estimated tip amount for 3 diners who 
have a $40 check? 

c. Determine the 95% prediction interval for the tip left 
by a dining party like the one in part (b). 

d. Determine the 95% confidence interval for the mean 
tip left by all dining parties like the one in part (b). 

e. Determine the 95% confidence interval for the partial 
regression coefficients, /3 1 and /3 9 . 

f. Interpret the significance tests in the computer 
printout. 

g. Analyze the residuals. Does the analysis support the 
applicability of the multiple regression model to these 
data? 

16.53 Annual per-capita consumption of all fresh fruits 
versus that of apples and grapes from 1998 through 2003 
was as shown in the table, source: Bureau of the Census, Statistical 
Abstract of the United States 2006, p. 137. 

y = AII 



y = Starting x 1 = Grade 
Salary Point 
Graduate (Thousands) Average 



x 2 = Number 
of 

Activities 



Year 


Fresh Fruits 


x 1 = Apples 


x 2 = Grapes 


1998 


1 28.9 lb/person 


1 9.0 lb/person 


7.1 lb/person 


1999 


129.8 


18.7 


8.1 


2000 


128.0 


17.6 


7.4 


2001 


125.7 


15.8 


7.7 


2002 


126.9 


16.2 


8.7 


2003 


126.7 


16.7 


7.5 



a. Determine the multiple regression equation and inter- 
pret the partial regression coefficients. 

b. What is the estimated per-capita consumption of all 
fresh fruits during a year when 17 pounds of apples 
and 6 pounds of grapes are consumed per person? 

c. Determine the 95% prediction interval for per-capita 
consumption of fresh fruits during a year like the one 
described in part (b). 

d. Determine the 95% confidence interval for mean per- 
capita consumption of fresh fruits during years like 
the one described in part (b). 

e. Determine the 95% confidence interval for the popu- 
lation partial regression coefficients, /3 1 and /6 9 . 

f. Interpret the significance tests in the computer 
printout. 

g. Analyze the residuals. Does the analysis support the 
applicability of the multiple regression model to these 
data? 

16.54 A university placement director is interested in the 
effect that grade point average (GPA) and the number of 
university activities listed on the resume might have on the 
starting salaries of this year's graduating class. He has col- 
lected these data for a sample of 10 graduates: 



1 


$40 


3.2 


2 


2 


46 


3.6 


5 


3 


38 


2.8 


3 


4 


39 


2.4 


4 


5 


37 


2.5 


2 


6 


38 


2.1 


3 


7 


42 


2.7 


3 


8 


37 


2.6 


2 


9 


44 


3.0 


4 


10 


41 


2.9 


3 



a. Determine the multiple regression equation and inter- 
pret the partial regression coefficients. 

b. Dave has a 3.6 grade point average and 3 university 
activities listed on his resume. What would be his 
estimated starting salary? 

c. Determine the 95% prediction interval for the start- 
ing salary of the student described in part (b). 

d. Determine the 95% confidence interval for the mean 
starting salary for all students like the one described 
in part (b). 

e. Determine the 95% confidence interval for the popu- 
lation partial regression coefficients, fi l and /3 2 . 

f . Interpret the significance tests in the computer 
printout. 

g. Analyze the residuals. Does the analysis support the 
applicability of the multiple regression model to these 
data? 

16.55 An admissions counselor, examining the usefulness 
of Xj = total SAT score and x 2 = high school class rank 
in predicting y = freshman grade point average (GPA) at 
her university, has collected the sample data shown here 
and listed in file XR16055. Rank in high school class is 
expressed as a cumulative percentile, i.e., 100.0% reflects 
the top of the class. 







High 






High 


Freshman 


SAT, 


School 


Freshman 


SAT, 


School 


GPA 


Total 


Rank 


GPA 


Total 


Rank 


2.66 


1 153 


61.5 


2.45 


1 136 


45.5 


2.10 


1086 


84.5 


2.50 


966 


60.2 


3.33 


1 141 


92.0 


2.29 


1023 


74.0 


3.85 


1237 


94.4 


2.24 


976 


86.4 


2.51 


1205 


89.5 


1.81 


1066 


73.0 


3.22 


1205 


97.0 


2.99 


1076 


55.2 


2.92 


1 163 


95.9 


3.14 


1 152 


72.1 


1.95 


1 121 


64.1 


1.86 


955 


51.0 



a. Determine the multiple regression equation and inter- 
pret the partial regression coefficients. 
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b. What is the estimated freshman GPA for a student 
who scored 1100 on the SAT exam and had a cumu- 
lative class rank of 80%? 

c. Determine the 95% prediction interval for the GPA of 
the student described in part (b). 

d. Determine the 95% confidence interval for the mean 
GPA of all students similar to the one described in 
part (b). 

e. Determine the 95% confidence interval for the popu- 
lation partial regression coefficients, fi 1 and f}-,. 

f. Interpret the significance tests in the computer 
printout. 

g. Analyze the residuals. Does the analysis support the 
applicability of the multiple regression model to these 
data? 

1 6.56 Data file XR1 6056 lists the following information for 
a sample of local homes sold recently: selling price (dollars), 
lot size (acres), living area (square feet), and whether the 
home is air conditioned (1 = A/C, 0 = noA/C). 

a. Determine the multiple regression equation and interpret 
the partial regression coefficients. 

b. What is the estimated selling price for a house sitting 
on a 0.9-acre lot, with 1800 square feet of living area 
and central air conditioning? 

c. Determine the 95% prediction interval for the selling 
price of the house described in part (b). 

d. Determine the 95% confidence interval for the mean 
selling price of all houses like the one described in 
part (b). 

e. Determine the 95% confidence interval for the popu- 
lation partial regression coefficients, fi v j8 2 , and /3 3 . 



integrate 
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f. Interpret the significance tests in the computer printout. 

g. Analyze the residuals. Does the analysis support the 
applicability of the multiple regression model to these 
data? 

16.57 In Exercise 16.56, what would be the estimated 
selling price of a house occupying a 0.1-acre lot, with 
100 square feet of living area and no central air 
conditioning? Considering the nature of this "house," 
does this selling price seem reasonable? If not, might the 
computer have made an error in obtaining the regression 
equation? Explain. 

16.58 Data file XR16058 lists the following data for a 
sample of automatic teller machine (ATM) customers: 
time (seconds) to complete their transaction, estimated 
age, and gender (1 = male, 0 = female). 

a. Determine the multiple regression equation and 
interpret the partial regression coefficients. 

b. What is the estimated time required by a female 
customer who is 45 years of age? 

c. Determine the 95% prediction interval for the time 
required by the customer described in part (b). 

d. Determine the 95% confidence interval for the mean 
time required by customers similar to the one 
described in part (b). 

e. Determine the 95% confidence interval for the popu- 
lation partial regression coefficients, f) 1 and j3 2 - 

f . Interpret the significance tests in the computer 
printout. 

g. Analyze the residuals. Does the analysis support the 
applicability of the multiple regression model to these 
data? 




Thorndike Sports Equipment 



For several years, Thorndike Sports Equipment has been a 
minority stockholder in the Snow Kingdom Ski Resort and 
Conference Center. The Thorndikes visit Snow Kingdom 
several times each winter to meet with management and 
find out how business is going. In addition, the visits give 
them a chance for informal discussions with customers 
and potential customers of Thorndike ski clothing and 
equipment. Luke claims that many good product ideas 
have been inspired by a warm drink in the Snow Kingdom 
lodge. 

On their current visit, Ted and Luke are asked by Snow 
Kingdom management to lend a hand in analyzing some 
data that might help in predicting how many customers to 
expect on any given day. Overall business has been rather 
steady over the past several years, but the daily customer 
count seems to have ups and downs. The people at Snow 



Kingdom are curious as to what factors might be causing 
the seemingly random levels of patronage from one day to 
the next. 

In response to Ted's request, management supplies data 
for a random sample of 30 days over the past two seasons. 
The information includes the number of skiers, the high 
temperature (degrees Fahrenheit), the number of inches of 
snow on the ground at noon, and whether the day fell on a 
weekend (1 = weekend, 0 = weekday). Data are shown 
here and are also in file THORN16. 

Using multiple regression and correlation analysis, do 
you think this information appears to be helpful in ex- 
plaining the level of daily patronage at Snow Kingdom? 
Help Ted in interpreting the associated computer printout 
for an upcoming presentation to the Snow Kingdom 
management. 



Chapter 16: Multiple Regression and Correlation 



639 



Snow Temperature Snow Temperature 



Day 


Skiers 


Weekend 


(Inches) 


(Degrees) 


Day 


Skiers 


Weekend 


(Inches) 


(Degre 


1 


402 


0 


22 


24 


16 


648 


0 


14 


8 


2 


337 


0 


17 


25 


17 


540 


0 


17 


1 1 


3 


471 


0 


17 


28 


18 


614 


1 


36 


6 


4 


610 


0 


39 


21 


19 


796 


1 


25 


29 


5 


620 


0 


1 1 


18 


20 


477 


0 


13 


5 


6 


545 


1 


24 


17 


21 


532 


0 


35 


30 


7 


523 


0 


25 


29 


22 


732 


0 


36 


15 


8 


563 


0 


34 


17 


23 


618 


0 


18 


1 1 


9 


873 


1 


18 


1 1 


24 


728 


1 


19 


28 


10 


358 


0 


28 


6 


25 


620 


0 


29 


8 


1 1 


568 


0 


14 


9 


26 


551 


0 


18 


6 


12 


453 


0 


12 


18 


27 


816 


0 


31 


13 


13 


485 


0 


27 


27 


28 


765 


0 


24 


19 


14 


767 


1 


37 


16 


29 


650 


1 


22 


27 


15 


735 


1 


12 


6 


30 


732 


0 


1 1 


24 



Springdale Shopping Survey 



This case involves the variables shown here and described 
in data file SHOPPING. Information like that gained from 
this case could provide management with useful insights 
into how two or more variables (including dummy vari- 
ables) can help describe consumers' overall attitude to- 
ward a shopping area. 

• Variables 7-9 Overall attitude toward the shopping 
area. The highest possible rating is 5. In each analysis, 
one of these will be the dependent variable. 

• Variables 18-25 The importance (highest rating = 7) 
placed on each of 8 attributes that a shopping area 
might possess. In each analysis, these will be among 
the independent variables. 

• Variables 26 and 28 The respondent's gender and mari- 
tal status. These independent variables will be dummy 
variables. Recode them so that (1 = male, 0 = female) 
and (1 = married, 0 = single or other). 

The necessary commands will vary, depending on your 
computer statistical package. Be careful not to save the re- 
coded database, because the original values will be over- 
written if you do. 

1. With variable 7 (attitude toward Springdale Mall) as 
the dependent variable, perform a multiple regression 
analysis using variables 21 (good variety of 
sizes/styles), 22 (sales staff helpful/friendly), 26 
(gender), and 28 (marital status) as the four indepen- 
dent variables. If possible, have the residuals and the 
predicted y values retained for later analysis. 

a. Interpret the partial regression coefficient for 
variable 26 (gender). At the 0.05 level, is it 



significantly different from zero? If so, what does 
this say about the respective attitudes of males 
versus females toward Springdale Mall? Interpret 
the other partial regression coefficients and the 
results of the significance test for each. 

b. At the 0.05 level, is the overall regression 
equation significant? At exactly what p-value is 
the equation significant? 

c. What percentage of the variation in y is explained 
by the regression equation? Explain this percent- 
age in terms of the analysis of variance table that 
accompanies the printout. 

d. If possible with your computer package, generate 
a plot of the residuals (vertical axis) versus each 
of the independent variables (horizontal axis). 
Evaluate each plot in terms of whether patterns 
exist that could weaken the validity of the regres- 
sion model. 

e. If possible with your computer statistical package, 
use the normal probability plot to examine the 
residuals. 

2. Repeat Question 1, but with variable 8 (attitude 
toward Downtown) as the dependent variable. 

3. Repeat Question 1, but with variable 9 (attitude 
toward West Mall) as the dependent variable. 

4. Compare the regression equations obtained in 
Questions 1, 2, and 3. For which one of the 
shopping areas does this set of independent 
variables seem to do the best job of predicting 
shopper attitude? Explain your reasoning. 
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Easton Realty Company (A) 




Sam Easton started out as a real estate agent in Atlanta ten 
years ago. After working two years for a national real estate 
firm, he transferred to Dallas, Texas, and worked for an- 
other realty agency. His friends and relatives convinced him 
that with his experience and knowledge of the real estate 
business, he should open his own agency. He eventually ac- 
quired his broker's license and before long started his own 
company, Easton Realty Company, in Fort Worth, Texas. 

Two salespeople at the previous company agreed to 
follow him to the new company. Easton currently has eight 
real estate agents working for him. Before the real estate 
slump, the combined residential sales for Easton Realty 
amounted to approximately $15 million annually. 

Recently, the Dallas-Fort Worth (DFW) metroplex 
and the state of Texas have suffered economic problems 
from several sources. Much of the wealth in Texas was 
generated by the oil industry, but the oil industry has fallen 
on hard times in recent years. Many savings and loan 
(S&L) institutions loaned large amounts of money to the 
oil industry and to commercial and residential construc- 
tion. As the oil industry fell off and the economy weak- 
ened, many S&Ls found themselves in difficulty as a result 
of poor real estate investments and the soft real estate mar- 
ket that was getting worse with each passing month. With 
the lessening of the Cold War, the federal government 
closed several military bases across the country, including 
two in the DFW area. Large government contractors, such 
as General Dynamics, had to trim down their operations 
and lay off many workers. This added more pressure to the 
real estate market by putting more houses on an already 
saturated market. Real estate agencies found it increas- 
ingly difficult to sell houses. 

Two days ago, Sam Easton received a special delivery 
letter from the president of the local Board of Realtors. 
The board had received complaints from two people who 
had listed and sold their homes through Easton Realty in 
the past month. The president of the Board of Realtors 
was informing Sam of these complaints and giving him 
the opportunity to respond. Both complaints were trig- 
gered by a recent article on home sales that appeared in 
one of the local newspapers. The article contained the 
table shown here. 

Typical Home Sale, DFW Metroplex 

Average Sales Price $1 04,250 

Average Size 1 860 sq. ft. 

Note: Includes all homes sold in the Dallas-Fort Worth 
metroplex over the past 1 2 months. 




The two sellers charged that Easton Realty Company 
had underpriced their homes in order to accelerate the 
sales. The first house is located outside of the DFW area, is 
four years old, has 2190 square feet, and sold for $88,500. 
The second house is located in Fort Worth, is nine years 
old, has 1848 square feet, and sold for $79,500. Both 
houses in question are three-bedroom houses. Both sellers 
believe that they would have received more money for 
their houses if Easton Realty had priced them at their true 
market value. 

Sam knew from experience that people selling their 
homes invariably overestimate the value. Most sellers be- 
lieve they could have gotten more money from the sale of 
their homes. But Sam also knew that his agents would not 
intentionally underprice houses. However, in these bad eco- 
nomic times, many real estate companies, including Easton 
Realty, had large inventories of houses for sale and needed 
to make sales. One quick way to unload these houses is to 
underprice them. On a residential sale, an agent working 
under a real estate broker typically makes about 3% of the 
sales price if he originally listed the property. Dropping the 
sales price of a $100,000 home down to $90,000 would 
speed up the sale and the agent's commission would fall 
only from $3,000 to $2,700. Some real estate agents might 
consider sacrificing $300 in order to get their commission 
sooner, but it is unethical because the agent is supposed to 
be representing the seller and acting in the seller's best inter- 
ests. Sam had to convince the two sellers and the Board of 
Realtors that there was no substance to the complaints. The 
question was, How was he going to do it? 

First, he needed to obtain recent residential sales data. 
Unfortunately, the local multiple listing service (MLS) did 
not contain actual sales prices of homes. However, Pat 
McCloskey, a local real estate appraiser, did maintain a 
database that had the sales information Sam needed. Phon- 
ing Pat, Sam found that she indeed had the data he 
required, but she would have to merge her personal 
database with data downloaded from the MLS in order to 
give Sam the necessary information. Fortunately, this was a 
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relatively simple task, and Pat could get the data to Sam the 
next day. 

Sam had asked Pat to give him all the data she had 
on home sales that had taken place in the DFW area over 
the previous three months. Although Pat's database did 
not contain all home sales in the DFW metroplex over 
that period of time, she felt the data she had were repre- 
sentative of the entire population. The data for each 
home sold included the sale month, the sale price, the 
size of the home (in square feet of heated floor space), 
the number of bedrooms, the age of the house, whether 
the house was located in Dallas versus Fort Worth or 
elsewhere within the metroplex, and the real estate com- 
pany that made the sale. 

Assignment 

The real estate data compiled by Pat McCloskey for Sam 
Easton are contained in the file EASTON on the text CD. 
The Data Description section provides a partial listing of 
this data file along with definitions of the variables. Using 
this data set and other information given in the case, help 
Sam Easton respond to the underpricing claims of his for- 
mer clients. The Case Questions will assist you in your 
analysis of the data. Use important details from your anal- 
ysis of the data to support your recommendation. 

Data Description 

The EASTON file contains data on home sales over the 
past three months in the DFW metroplex. A partial listing 
of the data is shown at the bottom of the page. 
The variables are defined as follows: 



MONTH: 



PRICE: 

SQFEET: 

BEDROOMS: 

AGE: 

DALLAS: 



Month in which the sale took place: 

4, if April 

5, if May 

6, if June 

Sale price of the house in dollars 
Square feet of heated floor space 
Number of bedrooms in the house 
Age of the house in years 
Area within the DFW metroplex 
where the house is located: 
1, if in Dallas 

0, if in Fort Worth or elsewhere 
in the metroplex 



EASTON: 1, if Easton Realty Company sold 
the house 
0, otherwise 

Case Questions 

1. Considering the claim that the two houses in question 
did not sell for their fair market value: 

a. Compare the selling prices of the two houses to the 
average selling price of all houses sold in the most re- 
cent three-month period. Does the difference appear 
to be substantial? 

b. Provide at least two reasons why the comparison in 
part (a) is not fair — i.e., describe at least two pric- 
ing factors that are not being considered in drawing 
the comparison in part (a). 

c. In making their argument, the complaining sellers 
are relying heavily on the average selling price 
($104,250) stated in the article for all homes sold 
in the area during the previous twelve months dur- 
ing a weakening housing market. Compare the 
$104,250 average with that for all houses sold in 
the area during the most recent three months, then 
comment on how this comparison might affect the 
validity of their argument. 

2. Use a multiple regression model to estimate PRICE as 
a function of SQFEET, BEDROOMS, AGE, DALLAS, 
and EASTON. 

a. Interpret the partial regression coefficients in the 
equation. Specifically, comment on the sign and 
numerical value of the partial regression coeffi- 
cient for EASTON in terms of the claim that Eas- 
ton Realty Company has been underpricing its 
residential properties relative to other real estate 
companies. 

b. For each of the two houses that are the subject of 
complaints, construct and interpret the 95% pre- 
diction interval for a comparable house that is 
being sold by a realtor other than Easton. 

c. On the basis of your analyses in question 1 and 
in parts (a) and (b) of this question, prepare a 
brief, convincing response to the claims of under- 
pricing. 



Month 

4 
4 
4 
4 
4 



Price 

$73,800 
69,200 
98,500 
82,200 

100,300 



SqFeet 

1284 
1919 
2316 
1821 
1703 



Bedrooms 

2 
3 
3 
4 
3 



Age 

6 
8 
7 
4 
4 



Dallas 

1 

0 
0 
0 

1 



Easton 

0 
0 
0 
0 
0 
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Circuit Systems, Inc. (C) 




In Chapters 11 and 14, we visited Circuit Systems, Inc., a 
company that was concerned about the effectiveness of its 
new program for reducing the cost of absenteeism among 
hourly workers. Besides allowing employees to be paid for 
unused sick leave, the program also pays $200 toward 
membership in a local health club for those employees 
who wish to join. Thomas Nelson, who introduced the 
new program, has collected data describing the 233 em- 
ployees who have been with the company for both the year 
preceding the new program and the year following its im- 
plementation. The information Mr. Nelson has collected is 
in the CIRCUIT data file on the text CD, and the variables 
and description of his program are as described in Circuit 
Systems, Inc. (A), in Chapter 11. 

1. Use a multiple regression model to estimate the 

number of days of sick leave this year as a function of 
two variables: days of sick leave taken last year and 
whether the employee is a participant in the exercise 
program. Interpret the partial regression coefficients in 
the equation. Specifically, comment on the sign and 




numerical value of the partial regression coefficient for 
the dummy variable representing participation in the 
exercise program. On this basis, does the exercise pro- 
gram seem worthy of continuation? 
2. What percentage of the variation in days of sick leave 
this year is explained by the regression equation you 
generated for question 1 ? To the extent that there is 
unexplained variation in the number of sick days 
taken this year, what other variables — including pos- 
sible variables not in the database — might influence 
the number of days of sick leave taken this year? 



The NFL Passer Rating 



To many fans, one of the more mysterious facets of the National Football League 
(NFL) is its numerical rating of quarterbacks. It's something the sports announcers 
and "Monday morning quarterbacks" like to talk about, but they spend little or no 
time talking about how this rating is determined. Media listings of passer ratings 
are usually accompanied by other performance statistics, such as passing yards, 
completion percentage, touchdowns, interceptions, and so on. 

In this chapter, we learn about stepwise regression, a method by which the 
candidate predictor variables are mathematically evaluated before being invited into 
the equation. As in the NFL, not all of them make the team. At each round of the 
evaluations, just one is invited in, and the invitations stop when we run out of 
variables that are able to make a meaningful contribution to 

estimating the dependent variable. ^ rea | thing, but a 



examining the NFL passer rating. The idea is to 
find out which variables are actually used, and 
how they might be combined. Although the passer 
rating is an interesting and useful measure, the NFL 
is quick to point out that such statistics "do not 
reflect leadership, play-calling, and other intangible 
factors that go into making a successful professional 
quarterback." 



Later in the chapter, you can use 
Exercise 17.73 and stepwise regression in 




Source: NFL.com, June 30, 2003. 
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Build polynomial regression models to describe curvilinear relationships. 

Apply qualitative variables representing two or more categories. 

Use logarithmic transformations in constructing exponential and multiplicative 
models. 

Identify and compensate for multicollinearity in the data. 

Apply stepwise regression in selecting which variables to use in a model. 

Determine which of several competing models might be most suitable for the 
data and the situation. 



17.1 INTRODUCTION 



In Chapter 15, we applied the simple linear regression model in estimating the 
value of a dependent variable on the basis of a single independent variable. In 
Chapter 16, we expanded this model to include two or more independent vari- 
ables. Both models assume a linear relationship, and the multiple regression 
model assumes that the independent variables are relatively independent from 
each other. In this chapter, we will examine more sophisticated models for the 
estimation of y. These will include curvilinear models as well as models in which 
qualitative variables are considered. We will also consider models in which the in- 
dependent variables are related to each other. We will introduce a regression 
method that allows us to strategically choose a subset of the independent vari- 
ables that has nearly the predictive power of the original, larger set. 

The final section focuses on an important concept that is considered throughout 
the chapter: the importance of selecting a model that is appropriate to both the data 
and our analysis objectives. As an introductory example, note the scatter diagrams 
in parts (a) and (b) of Figure 17.1. As x increases, y initially increases, then it de- 
creases, and then it increases again. As part (a) indicates, a linear model is not a very 
good fit to the data. In part (b), we use one of the models that will be presented in 
Section 17.2, and it does a much better job of estimating y for a given value of x. 



1 7.2 POLYNOMIAL MODELS WITH ONE 

QUANTITATIVE PREDICTOR VARIABLE 



By extending the simple linear regression model of Chapter 15 to include terms 
that include the independent (or predictor) variable x to a power greater than 1, 
we can construct models to fit data that are decidedly nonlinear. Such models can 
fit even data like those shown in Figure 17.1, where y responds to increases in x 
by initially increasing, then decreasing, then increasing again. The equation 
shown in part (b) of Figure 17.1 includes both an x 2 term and an x 3 term, and is 
a member of the family of polynomial models discussed in this section. 

The General Polynomial Model 

The polynomial model includes p terms that contain the independent variable x, 
with the size of the exponent increasing in consecutive terms from 1 to a maximum 



learning 
objectives * 

After reading this 
chapter, you should 

be able to: » 
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(a) Excel scatter diagram and fitted linear equation 
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(b) Excel scatter diagram and fitted nonlinear equation 




FIGURE 17.1 



It is important to select an 
appropriate model for a 
given set of data. In part (a), 
the simple regression model 
of Chapter 1 5 is applied 
to the data. The model in 
part (b), to be introduced 
in Section 17.2, is a much 
better fit. 



of p. There is only one independent, or predictor variable (x), but each x term in 
the equation is actually being treated as if it were a separate independent variable. 
The general form of the model is shown here: 

The general polynomial model: 

Vi = £o + Pi x , + Pi x f + ^ + ■-+ Pp x f + e , 

where y . = a value of the dependent variable, y 
)3 0 = a constant 

x i = a value of the independent (predictor) variable, x 
jSj, jS 2 , . . . ,pp = partial regression coefficients 
e ; = random error, or residual 
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In the general polynomial model, y. = /3 0 + /3 1 x t + /3 2 xj + fi^xf + ••• + 
Ppxf + e-, the order of the polynomial is the integer p. As in simple and multiple 
regression, the constant (/3 0 ) and the partial regression coefficients (j8 l5 /3 2 , P p ) 
must be estimated on the basis of sample data. Our discussion in the remainder of 
this section pertains to polynomial models in which p = 1, 2, or 3, because prac- 
tical situations rarely call for a model that is higher than the third order. The first-, 
second-, and third-order polynomial models will be discussed individually. 

First-Order Polynomial Model 

The first-order polynomial model is the same as the linear regression model pre- 
sented in Chapter 15, and is appropriate when there is a linear relationship 
between y and x. With E(y) as the expected value of y for a given value of the pre- 
dictor variable x, the format is 

First-order polynomial model: 

E(y) =f3 0 + p x x 

As shown in part (a) of Figure 17.1, this model is not suitable when the rela- 
tionship between y and x is nonlinear. When the relationship does happen to be 
linear, the equation can have either a positive slope > 0) or a negative slope 
(j3 1 < 0), as shown in Figure 17.2. If you were driving along the plotted line in a 
car, you would always be driving straight. 

Second-Order Polynomial Model 

The second-order polynomial model has a parabolic shape in that the curve can 
resemble a "U" that is either upright or upside-down. The format for this 
model is 

Second-order polynomial model: 

E(y) =(3 0 + p lX + p 2 x 2 

In this model, the fi x x term provides a linear component while the /3 2 x 2 term 
provides a nonlinear component that allows the line to curve. The shape of the 
curve will be determined by the relative sizes of /3 1 and /3 2 and on their signs, and 
typical curves for this model are shown in Figure 17.3. To the extent that any 



FIGURE 17.2 



In the first-order polynomial 
model, the expected value 
of y is E(y) = p Q + 0,x. This 
is the same as the simple 
regression model in Chap- 
ter 1 5, and the slope can 
be either positive > 0) 
or negative (/3, < 0). 
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FIGURE 17.3 



For the second-order 
polynomial model, the 
expected value of y will 
be f(y) = 0 O + 0,x + 0 2 x 2 , 
and the shape of the curve 
will depend on both the 
signs and the relative 
sizes of the coefficients 
j8 1 and0 2 . Withx>0, 
combinations of positive 
and negative values of 
these coefficients can 
result in curves like these. 



given second-order polynomial exhibits a curve, it will always curve in the same 
direction. 

Third-Order Polynomial Model 

As with the second-order model, the third-order polynomial model allows the 
plotted line to curve. However, it makes it possible for the line to reverse the di- 
rection in which it is curving. The format for this model is 



Third-order polynomial model: 

E (y) = /3 0 + P x x + /3 2 x 2 + /3 3 x 3 



In this model, the (3^ term provides a linear component, the /3 2 x 2 term pro- 
vides a nonlinear component allowing the line to curve, and the P 3 x 3 term pro- 
vides a nonlinear component enabling the line to reverse the direction of curva- 
ture. The sign of [3 1 determines the initial slope of the line for small positive values 
of x. If /3 2 and /3 3 have opposite signs, the line will be able to reverse the direction 
it is curving. If /3 2 and /3 3 have the same sign, the effect of the /3 3 x 3 term will 
be to make the line curve more strongly in the direction it is already curving. 
Figure 17.4 shows example plots of the third-order model when /3, and /3 3 have 
opposite signs. 



£(v) = 0o + 0i* + 0 2 * i + 0i* j 



/3 2 > 0 and 0 3 < 0 


02 < 0 and 0, > 0 


E(y) 




E(y) 




0 


0 



FIGURE 17.4 



In the third-order 
polynomial model, the 
expected value of y is f(y) = 
0 O + 0,x + 0jX 2 + 0 3 x 3 . 
When 0 2 and 0 3 have oppo- 
site signs, the fitted line can 
reverse its direction of curva- 
ture. For positive values of x 
and fi v these are examples 
of curves when 0 2 > 0 and 
0 3 < 0, and vice versa. 
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example 



Fitting a Polynomial Model 

U.S. News & World Report compiles information each year regarding leading U.S. 
graduate schools in a variety of fields. Data file CX17MBA includes the magazine's 
overall score for each of 50 leading MBA programs along with the average starting 
salary and bonus for the program's most recent graduating class. 1 In our analysis, we 
will use overall score as a predictor variable to estimate starting salary and bonus. 

SOLUTION 

Computer Solutions 17.1 describes the procedures and shows the results when we 
use Excel and Minitab in fitting a polynomial model to a set of data. With either 
software package, it is very easy to select whether the model will be first, second, or 
third order. Note that Minitab refers to these as linear, quadratic, or cubic, 
respectively. In Computer Solutions 17.1, we fit a second-order polynomial model 



computer solutions 17.1 



Fitting a Polynomial Regression Equation, One Predictor Variable 
EXCEL 
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Fitting a Polynomial Regression Equation with Excel 

1. Using file CX17MBA.XLS, with MBA average starting salary and bonus data (in thousands of dollars) in A2:A51, U.S. 
News & World Report overall scores in F2:F51 and labels in A1 and F1 : Click Tools. Click Data Analysis. Select Re- 
gression. Click OK. 

2. Enter A1:A51 into the Input Y Range box. Enter F1:F51, into the Input X Range box. Select Labels. Enter H1 into 
the Output Range box. Select Line Fit Plots. Click OK. 



Source: U.S. News & World Report: America's Best Graduate Schools 2001, p. 23. 
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3. To show the trend line: Right-click on one of the data points. Click Add Trendline. We will use a second-order poly- 
nomial model in this example. Within the Type menu, select Polynomial and enter 2 into the Order box. Within the 
Options menu, select Display Equation and Display R-Squared Click OK. Following minor formatting adjust- 
ments, the scatter diagram and polynomial regression equation appear as shown here. 

Note: To obtain the Excel printout in Table 17.1 : In step 2, above, enter F1:G51 into the Input X Range box and do not 
select the Lines Fit Plot. Omit step 3. (The squares of the overall scores are in column G.) 



MINITAB 

Fitting a Polynomial Regression Equation with Minitab 



Fitted Line Plot 

StartSal= - 52.15 + 2.785 USNscor 
- 0.01233 USNscor**2 
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Using file CX17MBA.MTW, with MBA average starting salary and bonus data (in thousands of dollars) in C1 and U.S. News & 
World Report overall scores in C6: Click Stat. Select Regression. Click Fitted Line Plot. Enter C1 into the Response (Y) box. 
Enter C6 into the Predictor (X) box. We will be using the second-order polynomial model in this example. Within the Type of 
regression model box, select Quadratic. Click OK. The Minitab plot, fitted equation, and R 2 items appear as shown here. 



Note: The Minitab printout in Table 
described above. 



7.1 is automatically generated when we carry out the Fitted Line Plot procedure 



to the data. With x = overall score and y = average starting salary and bonus (thou- 
sands of dollars), the estimation equation and coefficient of determination, R 2 , are 

y = -52.152 + 2.7854* - 0.0123x 2 , with R 2 = 0.8748 

Interpreting R 2 = 0.8748, we can say that 87.48% of the variation in average 
starting salary and bonus is explained by the predictor variable, overall score, 
through the fitted equation. Table 17.1 (page 650) shows the printouts obtained 
from performing a multiple regression analysis with x and x 1 treated as separate 
variables. These printouts provide greater detail, and we can see that the regres- 
sion equation is significant at the 0.000 level (rounded). The procedures for ob- 
taining the printouts in Table 17.1 are described in the Excel and Minitab portions 
of Computer Solutions 17.1. 



650 



Part 5: Regression, Model Building, and Time Series 



TABLE 17.1 



Regression printouts corre- 
sponding to the plots in 
Computer Solutions 17.1. 
In each case, we have 
created a new variable: 
the square of the U.S. 
News & World Report 
overall score. The p-value 
for the regression equation 
is 0.000 (rounded), and 
Excel reports it in scientific 
notation (6.27 x 10 -22 ). 



EXCEL PRINTOUT 





H I 
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L 


M 
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1 


SUMMARY OUTPUT 












2 














3 


Regression Statistics 












4 


Multiple R 


0.935 












5 


R Square 


0.875 












6 


Adjusted R Square 


0.869 












7 


Standard Error 


4.465 












8 


Observations 


50 












9 
















1 0 


ANOVA 














11 




df 


SS 


MS 


F 


Significance F 




12 


Regression 


2 


6543.31 


3271 .65 


164.13 


6.27E-22 




13 


Residual 


47 


936.88 


19.93 








14 


Total 


49 


7480.19 










15 
















16 




Coefficients 


Standard Error 


tStat 


P-vaiue 


Lower 95% 


Upper 95% 


17 


Intercept 


-52.1522 


29.6476 


-1.7591 


0.085 


-111.7955 


7.4911 


18 


USNscor 


2.7854 


0.7766 


3.5868 


0.001 


1.2231 


4.3476 


19 


USNscor^sq 


-0.0123 


0.0049 


-2.4945 


0.016 


-0.0223 


-0.0024 



MINITAB PRINTOUT 

Polynomial Regression Analysis: StartSal versus USNscor 

The regression equation is 

StartSal = - 52.15 + 2.785 USNscor - 0.01233 USNscor**2 



S = 4.46470 R-Sq = 87.5% R-Sq(adj) = 86.9% 
Analysis of Variance 

Source DF SS MS F P 

Regression 2 6543.31 3271.65 164.13 0.000 
Error 47 936.88 19.93 

Total 49 7480.19 



If we use the procedures in Computer Solutions 17.1 to fit first-order and 
third-order polynomial models, the resulting equations and their R 2 values are as 
shown below 

First-order polynomial: 

y = 21.2597 + 0.8519x, with R 2 = 0.858 

Third-order polynomial: 

y = -188.674 + 8.101x - 0.0802x 2 + 0.000285% 3 , with R 2 = 0.876 

The third-order polynomial, with R 2 = 0.876, explains a greater percentage 
of the variation in average starting salary and bonus than either of the other mod- 
els. This is not surprising, because it includes two more terms than the first-order 
model and one more term than the second-order model. However, its R 2 advan- 
tage over the second-order model is extremely slight (R 2 = 0.876 versus R 2 = 
0.8748, respectively), and the greater simplicity of the second-order model may 
make it the most preferable of the polynomial models. In Section 17.8, we will 
further discuss the process of selecting the appropriate model for a set of data. 
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17.1 The models in this section include the first-, second-, 
and third-order polynomials. What would be the form of 
a fifth-order polynomial? Would such a polynomial 
model tend to be practical for use with business data? 

17.2 As x increases, y increases, but at a decreasing rate. 
If a second-order polynomial model were fitted to the 
scatterplot of the data, what would be the signs of the 
partial regression coefficients in the model? 

17.3 As x increases, y decreases, and at an increasing 
rate. If a second-order polynomial model were fitted to 
the scatterplot of the data, what would be the signs of the 
partial regression coefficients in the model? 

17.4 As the temperature increases, an experimental 
lubricant initially decreases in viscosity (resistance to 
flow), then increases, then decreases again. Of the 
polynomial models discussed in this section, which one 
would tend to be most applicable for such data? What 
would be the signs of the partial regression coefficient(s) 
in the model? 

17.5 A communications firm employs a large number of 
technical sales representatives who make personal visits 
to potential customers. For data collected on y = average 
amount of sales per call and x = number of years of 
experience, the accompanying scatter diagram has been 
constructed. If a third-order polynomial model were fitted 
to these data, what would be the signs of the partial 
regression coefficients in the model? 
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17.6 As the number of cable television subscribers has 
increased, the number of subscribers to satellite televi- 
sion services has increased as well. For y = number of 
satellite subscribers and x = number of cable subscribers, 
the scatter diagram shown here has been constructed. 
Of the polynomial models discussed in this section, 
which one would tend to be most applicable for such 
data, and what would be the signs of the partial 
regression coefficients ) in the model? source: USA Today, 
May 3, 2000, p. 26A. 
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/ data set/ Note: Exercises 17.7-17.10 require a 
computer and statistical software. 

17.7 Smith Travel Research has reported average 
occupancy and room rates for hotels in the largest U.S. 
hotel markets. The data are also in file XR17007. For 

y = average room rate (dollars) and x = average 
occupancy rate (percent), fit a second-order polynomial 
model to the data and determine the percentage of the 
variation in average room rates that is explained by the 
model. Given the nature of these variables, comment on 
causation possibilities that might be present, source: Wall 
Street Journal Almanac 1999, p. 648. 

17.8 International Data Corporation has reported the 
following costs per gigabyte of hard-drive storage space 
for the period 1995-2000. The data are also in file 
XR17008. Using x = 1 through 6 to represent years 
1995-2000, fit a second-order polynomial model to the 
data and estimate the cost per gigabyte for the year 2008. 
source: Jon L. Jacobi, "Cheaper by the Gigabyte," PC World, July 2000, p. 59. 



Year 



x = Year code 



Cost/gigabyte 



1995 
1996 
1997 
1998 
1999 
2000 



$261.84 
137.94 
69.68 
29.30 
13.09 
6.46 



17.9 An early edition of Car and Driver magazine 
reported the engine horsepower, top speed (mph), curb 
weight (pounds), and 0-60 mph acceleration time (seconds) 
for a selection of automobiles that ranged from large 
American "muscle cars" to small economy imports. The 
data are in file XR17009. Using y = 0-60 time and 
x = horsepower, fit a second-order polynomial model 
and predict the 0-60 time for a car with 300 horsepower. 
At the 0.05 level of significance, is the regression equation 
significant? source: 1964 Car and Driver Yearbook, pp. 14-63. 
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17.10 Repeat Exercise 17.9, but for a third-order 
model and using y = top speed (mph) as the dependent 
variable. During the 1960s, manufacturers of the so- 
called "muscle cars" were accused of deliberately 
understating the massive amount of horsepower these 
engines produced. Examining the curve for the third- 
order model, would it appear that these accusations 
might have been justified? 

17.11 For the 20 top-grossing U.S. films of the 1990s, 
data have been reported for y = foreign gross ticket sales 
and x = domestic gross ticket sales, both in millions of 
dollars. The printout here shows the results for a second- 
order polynomial model fitted to the data. What amount 
of foreign gross ticket sales would be predicted for a film 



that generated $400 million in domestic gross ticket 
sales? At the 0.05 level of significance, is this model 
significant? source: "Overseas Grosses Outdoing North America," 
USA Today, December 13, 2000. 
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R Square 
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Adjusted R Square 
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Standard Error 


112.603 












8 


Observations 


20 












9 
















10 


ANOVA 














11 




df 


SS 


MS 


F 


Significance F 




12 


Regression 
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697099.23 


348549.61 


27.49 


4.71 E-06 




13 


Residual 


17 


215551.88 


12679.52 
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Total 


19 


912651.11 
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Coefficients 


Standard Error 


[ Stilt 


P-value 


Lower 95% 


Upper 95% 


17 


Intercept 


860.8444 


244.9641 


3.5142 


0.0027 


344.0146 


1377.6743 


18 


DomGross 


-4.1517 


1.4392 


-2.8848 


0.0103 


-7.1882 


-1.1153 


19 


DomGross_sq 


0.0077 


0.0019 


4.0038 


0.0009 


0.0036 


0.0117 



1 7.3 POLYNOMIAL MODELS WITH TWO 

QUANTITATIVE PREDICTOR VARIABLES 



In this section, we consider polynomial models in which two quantitative predic- 
tor variables are used in estimating the value of the dependent variable. Our dis- 
cussion will include three such models. 

First-Order Model with Two Variables 

The first-order polynomial model with two quantitative predictor variables is the 
same as the two-variable multiple regression model of Chapter 16. Accordingly, 
the format is simply 

First-order model with two predictor variables: 

E(y) = P 0 + P\ x \ + 02 x 2 

In this model, the expected value of y is based on x 1 and x 2 , as in the multiple 
regression models of Chapter 16. The model is applicable when there is no inter- 
action between x 1 and x 2 . Interaction between x 1 and x 2 is absent whenever the 
following conditions are met: 

1. For a given value of x v a 1-unit increase in x 1 will cause E(y) to increase by /3 2 . 

2. For a given value of x 2 , a 1-unit increase in x 1 will cause E(y) to increase by j8 1 . 

In other words, interaction between the two predictor variables is absent 
when the effect on E(y) of a 1-unit increase in x 2 does not depend on the value of 
x v and vice versa. Figure 17.5 shows a situation in which interaction is absent. 
For the estimation equation y = 10 + Sx-y + 3x 2 , we have plotted three separate 
linear relationships between y and x^. one for each of the assumed values of x 2 . It 
is very important to note that the lines are parallel — regardless of the value of x 2 , 
the slope is the same. 

First-Order Model with an Interaction Term 

When we believe there may be interaction between the two predictor variables, 
we can enhance the first-order model just discussed by adding an interaction term. 
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y = 10 + 5x[ + 3x 2 



when x-y = 3 




FIGURE 17.5 



In this first-order polynomial 
model with two quantita- 
tive predictor variables, 
there is no interaction be- 
tween x, and x 2 . Regardless 
of the value of x 2 , the slope 
of the line relating y to x, 
remains the same. 



5> = 50 + 4xj + 3jt 2 + 5x1*2 




when x, = 2 



FIGURE 17.6 



In this first-order polynomial 
model with an interaction 
term included, there is 
interaction between x, 
and x 2 . The slope of the 
relationship between y 
and x, depends on the 
value of x,. 



The format for this model is 

First-order model with two predictor variables and interaction: 
E(y) = Po + + Pl x 2 + /3 3 Xi*2 

In this model, the interaction term is $tX x x 2 . The term includes the partial re- 
gression coefficient /3 3 and the cross product x 1 x 2 - Because the sum of the expo- 
nents in the cross product is 2, this model can technically be considered a second- 
order model. However, to avoid confusion with what is typically referred to as the 
"full" second-order model discussed next, we will consider this to be a first-order 
model that has had an interaction term added to it. 

In this model, interaction is present because the effect on E(y) of a 1-unit in- 
crease in x 2 will depend on the value of x v and vice versa. Figure 17.6 shows 
a situation in which interaction is present. For the estimation equation 
y = 50 + Axy + 3x 2 + 5x^x 2 , we have plotted three separate linear relationships 
between y and x^: one for each of the assumed values of x 2 . Note that the lines are 
not parallel and that the slope of the relationship between y and x 1 depends on 
the value of x 2 . 

Second-Order Model with Interaction 

When, in addition to the possibility of interaction between the predictor vari- 
ables, we believe there could be a nonlinear relationship between y and each of 



654 



Part 5: Regression, Model Building, and Time Series 



the predictor variables, we can use the second-order model with interaction. The 
format for this model is 

Second-order model with interaction: 

E(y) = /3 0 + yS 1 ^c 1 + /3 2 x 2 + j3 3 x 2 + + Ps x i x 2 

In addition to the interaction term jSytj^, this model includes the quadratic 
terms P 3 x 2 an d /3 4 *2- As mentioned previously, this model is typically referred 
to as the "full" second-order model — not only do the exponents in the interac- 
tion term sum to 2, but each predictor variable is in a term where it is squared. 



example 



mm 




Polynomial Model with Two Predictor Variables 

Residents of cold climates are familiar with the wind chill index that estimates 
how cold it feels. This index is based on two variables: air temperature and wind 
speed. File CX17WIND contains data showing the wind chill index for various 
combinations of air temperature and wind speed. 2 The file also includes, as sepa- 
rate variables, the temperature-squared, wind-speed-squared, and (temperature X 
wind speed) values. In Computer Solutions 17.2, we use Excel and Minitab to ob- 
tain a second-order polynomial model with two predictor variables and an inter- 
action term. The Minitab equation is 



WChill = -37.7 + 1.38Temp + 0.040Wind + 0.00050Temp_Sq 
- 0.00227Wind_Sq + 0.00146(Temp X Wind) 

SOLUTION 

Interpreting the R 2 portion of the printouts, we can say that 78.6% of the varia- 
tion in wind chill is explained by this model. However, if we apply either of the 
two more simple models discussed in this section, we end up obtaining the same 
R 2 value for these data. The procedures for applying these models are similar to 
the steps described in Computer Solutions 17.2, and the resulting Minitab equa- 
tions and R 2 values are 

First-order polynomial: 

WChill = -36.5 + 1.42Temp - 0.073Wind, with R 2 = 0.786 



First-order polynomial, with interaction term added: 



WChill = -36.5 + 1.38Temp - 0.073Wind + 0.00152(Temp X Wind), 
with R 2 = 0.786 



Neither the square terms nor the interaction term makes very much difference 
in the resulting equation. Because the simplest model does as well as the others in 
its ability to explain variation in the wind chill values, it would be the preferred 
model to fit to these data. 



2 "Some Experts Say Wind Chill Index Is Hype," Pittsburgh Tribune Review, February 5, 2000, p. Dl. 
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Fitting a Polynomial Regression Equation, Two Predictor Variables 



EXCEL 





G H 


| 


j 


K 


L 


M 


1 


SUMMARY OUTPUT 












2 














3 


Regression Statistics 












4 


Multiple R 


0.887 














R Square 


0.786 












Q 


Adjusted R Square 


0.778 












7 


Standard Error 


16.374 












8 


Observations 


135 












9 
















10 


ANOVA 














11 




df 


ss 


MS 


F 


Significance F 




12 


Regression 


5 


127254.53 


25450.91 


94.93 


1.67E-41 




13 


Residual 


129 


34586.77 


268.11 








14 


Total 


134 


161841.30 










15 
















16 




Coefficients 


Standard Error 


tStat 


P-value 


Lower 95% 


Upper 95% 


17 


Intercept 


-37.7396 


5.63098 


-6.70214 


0.00000 


-48.88062 


-26.59857 


18 


Temp 


1.3811 


0.14424 


9.57547 


0.00000 


1.09576 


1 .66652 


19 


Wind 


0.0404 


0.49789 


0.08105 


0.93553 


-0.94473 


1 .02544 


20 


Temp_Sq 


0.0005 


0.00343 


0.14576 


0.88434 


-0.00628 


0.00728 


21 


Wind_Sq 


-0.0023 


0.00971 


-0.23361 


0.81566 


-0.02149 


0.01695 


22 


Temp'Wind 


0.0015 


0.00514 


0.28438 


0.77657 


-0.00871 


0.01164 



Fitting a Polynomial Regression Equation, Two Predictors, with Excel 



1. The wind chill data are in CX17WIND.XLS, with wind chills in A2:A136, temperatures (degrees Fahrenheit) in B2:B1 36, 
wind speeds (mph) in C2:C136, and their labels in A1 :C1 . The squares of the temperatures and wind speeds are in 
columns D and E, respectively, and the cross-product values for (temperature x wind speed) are in column F. The 
values in columns D, E, and F were obtained by using the Excel drag-and-fill feature. For example, the cross-product 
column can be obtained by clicking on F2, then entering = B2*C2. Click at the lower right corner of F2 (cursor turns 
to "+" sign), then drag and fill to obtain the F3:F136 values. 

2. We will generate the regression equation for the second-order model with interaction: Click Tools. Click Data 
Analysis. Select Regression. Click OK. 

3. Enter A1:A136 into the Input Y Range box. Enter B1:F136 into the Input X Range box. Select Labels. Enter G1 into 
the Output Range box. Click OK. After slight adjustments to the column widths and the number of decimal places, 
the results are as shown above. 



MINITAB 

Fitting a Polynomial Regression Equation, Two Predictors, with Minitab 

Regression Analysis: WChill versus Temp, Wind, ... 

The regression equation is 

WChill = - 37.7 + 1.38 Temp + 0.040 Wind + 0.00050 Temp_Sq - 0.00227 Wind_Sq 
+ 0.00146 Temp-'Wind 



Predictor 


Coef 


SE Coef 




T 




P 


Constant 


-37.740 




5.631 


-6 


70 


0 


000 


Temp 


1.3811 




0.1442 


9 


58 


0 


000 


Wi nd 


0 . 0404 




0.4979 


0 


08 


0 


936 


Temp_Sq 


0.000500 


0 


.003429 


0 


15 


0 


884 


Wi nd_Sq 


-0.002269 


0 


.009715 


-0 


23 


0 


816 


Temp*Wi nd 


0.001463 


0 


.005143 


0 


28 


0 


777 


S = 16.3742 


R-Sq = 78 


6% 


R-Sq(adj) 




77 


8% 





(continued) 
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Analysis of Variance 



Source 


DF 


SS 


MS 


F P 


Regression 


5 


127255 


25451 


94.93 0.000 


Residual Error 


129 


34587 


268 




Total 


134 


161841 







Using file CX17WIND.MTW, with wind chills in C1, temperatures (degrees Fahrenheit) in C2, wind speeds (mph) in C3, 
temperature-squared values in C4, wind-squared values in C5, and (temperature x wind) values in C6: Click Stat, Select 
Regression. Click Regression. Enter C1 into the Response box. Enter C2-C6 into the Predictors box. Click OK. 



exercises 



17.12 A researcher suspects that the dependent variable 
y is linearly related to both x 1 and x 2 , but believes there 
is little or no interaction between the two predictor vari- 
ables. Of the models in this section, which one would 
likely be most suitable for this situation? 

17.13 In Exercise 17.12, suppose the researcher wishes to 
use one of the models in this section, but is uncertain as to 
whether y might be nonlinearly related to Xj and/or x 2 . 
Also, the researcher doesn't have any idea as to whether 
interaction might exist between x 1 and x 2 . Of the models 
in this section, which one would result in the highest R 2 
value while at the same time reducing the possibility that 
the researcher might "miss" an important aspect of the re- 
lationship between the dependent and predictor variables? 




i ViX .in 




200 + 5x 



2x 2 has 



17.14 The regression model y 

been fitted to a set of data. If x 2 = 10, what will be the 
effect on y if Xj increases by 1 unit? Will your answer 
change if x 2 = 20? Given the preceding, comment on 
whether interaction appears to exist between x 1 and x 2 . 

I data set/ Note: Exercises 17.15-17.20 require a 
computer and statistical software. 

17.15 The personnel director of the communications firm 
in Exercise 17.5 has just provided another variable describ- 
ing the company's technical sales representatives: their 
scores on a sales aptitude test they were recently given. 
Using data file XR17015, with y = average dollar sales per 
call, x x = years of experience, and x 2 = score on the sales 
aptitude test, fit a second-order model with interaction to 
the data. Interpret the R 2 value for the model and indicate 
whether, at the 0.05 level, the model is significant. 

17.16 Auto Rental News has reported U.S. operations 
data for 15 top rental-car companies. Using data file 
XR17016, with y = rental revenue (millions of dollars) for 
the most recent year of record, Xj = thousands of cars in 
service, and x 2 = number of rental locations: source: Wall 
Street Journal Almanac 1999, p. 649. 

a. Fit a first-order model of the form E(y) = f) 0 + f3 1 x 1 + 
P 2 x 2 . Interpret the R 2 value for the model and indicate 
whether, at the 0.01 level, the model is significant. 



b. Repeat part (a), but with an interaction term 

included. Interpreting the R 2 value, to what extent 
has inclusion of the interaction term improved the 
model's explanatory power? 

17.17 The Air Transport Association has reported the 
following kinds of information describing 23 individual 
U.S. airlines for the most recent year of record: total 
operating revenue ($millions), number of aircraft, 
number of employees, and number of aircraft departures. 
Using data file XR17017, with y = total operating revenue 
(millions of dollars), x 1 = number of employees, and 

x 2 = number of aircraft departures: source: Wall Street 
Journal Almanac 1999, p. 644. 

a. Fit a first-order model of the form E(y) = /3 0 + 
jSjXj + f) 2 x 2 . Interpret the R 2 value for the model 
and indicate whether, at the 0.01 level, the model is 
significant. 

b. Repeat part (a), but with an interaction term included. 
Interpreting the R 2 value, to what extent has inclusion 
of the interaction term improved the model's explana- 
tory power? 

17.18 Repeat Exercise 17.17, using the data in file XR17017, 
but with y = total operating revenue (millions of dollars), 
x 1 = number of aircraft, and x 2 = number of employees. 

17.19 For the Car and Driver information described 
in Exercise 17.9, it would seem reasonable that the 
number of seconds required to accelerate to 60 mph 
would depend on both a car's weight and its horsepower. 
Further, it would be logical to assume that horsepower 
and weight might interact. Using data file XR17009, with 
y = 0-60 mph acceleration time (seconds), x l = horse- 
power, and x 2 = curb weight (pounds), fit a model of 
the form E(y) = jS 0 + f) 1 x 1 + P 2 x 2 + fijX^x^ Interpret 
the R 2 value for the model and indicate whether, at the 
0.02 level, the model is significant. 

17.20 Repeat Exercise 17.19, but using a second-order 
model with interaction. Interpreting the R 2 value, to what 
extent has inclusion of the squared terms improved the 
model's explanatory power? 
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QUALITATIVE VARIABLES 



In Chapter 16, we briefly discussed the idea of including a binary qualitative vari- 
able in the regression analysis. Such a variable has two possible values, typically 
coded as 1 and 0. In examining home prices, whether a home has central air con- 
ditioning can be an informative variable. In studying employee salaries and 
wages, using gender as a qualitative variable can help in determining whether dis- 
crimination might exist within a company's workforce. In this section, we begin 
by reviewing the inclusion of a qualitative variable representing two possible cat- 
egories, then go on to consider qualitative variables that happen to have more 
than two possible values. In regression analysis, qualitative variables are some- 
times referred to as dummy variables. 

Qualitative Variable Representing 
Two Possible Categories 

One of the most important variables in determining a vehicle's highway fuel econ- 
omy is weight. However, body style could also be important, since some types of 
vehicles are not very aerodynamic and would tend to get fewer miles per gallon at 
highway speeds. In a multiple regression of highway fuel economy on (1) weight 
and (2) whether the vehicle is a sports-utility vehicle, the format of the fitted equa- 
tion is 

HwyMpg = b 0 + fcjWeight + b 2 SUV?, where HwyMpg = highway miles 

per gallon 
Weight = curb weight 
(pounds) 
SUV? = sports-utility 

vehicle dummy 
variable (1 if 
vehicle is an 
SUV, 0 if not) 



Example 


■7.. -. \ 







Qualitative Variable with Two Categories 



For a sample of vehicles described in Consumer Reports' New Car Preview 
2001, data include highway fuel economy test results, vehicle weights, and ve- 
hicle types. 3 Data file CX17FUEL describes 53 sports-utility vehicles, large or 
family sedans, and small cars in terms of their highway fuel economy and their 
curb weight. With the variable SUV? coded as 1 for sports-utility vehicles 
and 0 for other vehicles, we obtain the following fitted regression equation 
and R 2 : 

HwyMpg = 52.3 - 0.00534Weight -6.01SUV?, with R 2 = 0.784 



17.4 



^Consumer Reports, New Car Preview 2001. 
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SOLUTION 

Interpreting this equation, we can say that the estimated highway fuel economy 
will decrease by 0.00534 mpg for each additional pound of curb weight. The co- 
efficient for the SUV? variable indicates that, for a given curb weight, the esti- 
mated highway fuel economy will be 6.01 mpg lower if the vehicle is an SUV. The 
regression equation explains 78.4% of the variation in highway fuel economy for 
the vehicles in the sample. 



Qualitative Variables Representing 
More Than Two Possible Categories 

With slight modification, the strategy we've just employed can also be used when 
a qualitative variable happens to represent more than two possible categories. For 
the new-car data described previously, we have n = 3 types of vehicles in data file 
CX17FUEL, but we will need only (n — 1) = 2 qualitative variables to represent 
them. Using SUV? and FamSedan? as the two qualitative variables, our coding 
scheme is as follows: 



SUV? = 



FamSedan? - 



Represents: 



sports-utility vehicle 
large/family sedan 
small car 



A portion of the data from file CX17FUEL is shown here; it represents the first 
five vehicles in the listing. 



HwyMpg 


Weight 


SUV? 


FamSedan? 


26 


3225 


1 


0 


33 


3455 


0 


1 


30 


4070 


0 


1 


38 


3325 


0 


1 


39 


2780 


0 


0 



sports-utility vehicle 



large/family sedan 



small car 



example 



Qualitative Variable with More Than Two Categories 

For the sample of vehicles in data file CX17FUEL, and using the coding scheme de- 
scribed above, we carry out a multiple regression with HwyMpg as the depen- 
dent variable, and Weight, SUV?, and FamSedan? as the predictor variables. The 
procedures and results for Excel and Minitab are shown in Computer Solutions 
17.3. The equation can be expressed as 



HwyMpg = 50.54 - 0.00428Weight - 8.62SUV? 
with R 2 = 0.799 



2.58FamSedan?, 



SOLUTION 

Interpreting this equation, we can say that the estimated highway fuel economy 
will decrease by 0.00428 mpg for each additional pound of curb weight. The 
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Estimated HwyMpg = 50.54 - 0.00428Weight - 8.62SUV? - 2.58FamSedan? 



For small cars: 

Estimated HwyMpg = 50.54 - 0.00428Weight 



Estimated 
HwyMpg 



For large/family sedans: 

Estimated HwyMpg = 47.96 - 0.00428Weight 

For sports-utility vehicles: 

Estimated HwyMpg = 41 .92 - 0.00428Weight 



FIGURE 17.7 



Given the multiple regres- 
sion equation developed in 
Computer Solutions 17.3, 
these are the highway 
fuel-economy estimation 
equations for the individual 
types of vehicles in the 
analysis. 



Weight 



estimated highway fuel economy will be 8.62 mpg lower if the vehicle is an SUV 
and 2.58 mpg lower if it is a large or family sedan. If the vehicle is a small car (i.e., 
SUV? = 0 and FamSedan? = 0), the estimated highway fuel economy will be 
50.54 - 0.00428Weight - 8.62(0) - 2.58(0), or simply 50.54 - 0.00428Weight. 
The equation explains 79.9% of the variation in highway fuel economy. 

Figure 17.7 describes the linear estimation equations when we consider the 
three types of vehicles individually. For the equation estimating highway fuel econ- 
omy only for small cars, SUV? = 0, FamSedan? = 0, and the constant in the equa- 
tion is 50.54. The constants in the corresponding sports-utility vehicle and 
large/family sedan equations will be smaller. As shown in Figure 17.7, the constant 
in the estimation equation for sports-utility vehicles is 50.54 — 8.62, or 41.92, and 
the constant in the equation for large/family sedans is 50.54 — 2.58, or 47.96. 




Multiple Regression with Qualitative Predictor Variables 
EXCEL 






G H 


I 


J 


K 


L 


M 


1 


SUMMARY OUTPUT 












2 














3 


Regression Statistics 












4 


Multiple R 


0.894 












5 


R Square 


0.799 












6 


Adjusted R Square 


0.787 












7 


Standard Error 


3.089 












8 


Observations 


53 












9 
















10 


ANOVA 














11 




df 


SS 


MS 


F 


Significance F 




12 


Regression 


3 


1856.24 


618.75 


64.86 


0.000 




13 


Residual 


49 


467.46 


9.54 








14 


Total 


52 


2323.70 










15 
















16 




Coefficients 


Standard Error 


t Stat 


P- value 


Lower 95% 


Upper 95% 


17 


Intercept 


50.5445 


2.506 


20.168 


2.237E-25 


45.508 


55.581 


18 


Weight 


-0.0043 


0.001 


-4.738 


1.886E-05 


-0.006 


-0.002 


19 


SUV? 


-8.6198 


1.764 


-4.885 


1.145E-05 


-12.166 


-5.074 


20 


FamSedan? 


-2.5828 


1.348 


-1.915 


6.128E-02 


-5.292 


0.127 



(continued) 
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Excel Multiple Regression with Qualitative Predictor Variables 

1 . The new-car data are in CX17FUEL.xls with highway fuel economy (mpg) in A2:A54, curb weight (pounds) in B2:B54, 
whether the vehicle is an SUV (coded as 1 if yes, 0 if no) in C2:C54, whether the vehicle is a large or family sedan 
(coded as 1 if yes, 0 if no) in D2:D54, and their labels in A1 :D1 . Click Tools. Click Data Analysis. Select Regression. 
Click OK. 

2. Enter A1:A54 into the Input Y Range box. Enter B1:D54 into the Input X Range box. Select Labels. Enter G1 into 
the Output Range box. Click OK. After slight adjustments to the column widths and the number of decimal places, 
the results are as shown in the printout. 



MINITAB 

Minitab Multiple Regression with Qualitative Predictor Variables 
Regression Analysis: HwyMpg versus Weight, SUV?, FamSedan? 

The regression equation is 

HwyMpg = 50.5 - 0.00428 Weight - 8.62 SUV? - 2.58 FamSedan? 



Predictor Coef 


SE Coef 




T 




P 


Constant 50.545 




2.506 


20 


17 


0. 


000 


Weight -0.0042766 


0.I 


D009026 


-4. 


74 


0. 


000 


SUV? -8.620 




1.764 


-4 


.89 


0. 


000 


FamSedan? -2.583 




1.348 


-1 


.92 


0. 


061 


S = 3.08868 R-Sq = 79. 


9% 


R-Sq(adj) 


= 78.7% 






Analysis of Variance 














Source DF 




SS 


MS 




F 




Regression 3 


1856 


.24 618 


.75 


64. 


86 


i 


Residual Error 49 


467 


.46 9 


.54 








Total 52 


2323 


.70 











Using file CX17FUELMTW, with highway fuel economy (mpg) in C1, curb weight (pounds) in C2, whether the vehicle is an 
SUV (coded as 1 if yes, 0 if no) in C3, and whether the vehicle is a large or family sedan (coded as 1 if yes, 0 if no) in C4: 
Click Stat. Select Regression. Click Regression. Enter C1 into the Response box. Enter C2-C4 into the Predictors box. 
Click OK. 



exercises 




17.21 For a sample of 100 recent credit-card customers 
of a company's on-line store, the following data are 
available: the dollar amount of merchandise ordered 
during the visit, how much time (minutes) the customer 
spent at the site during the purchase visit, and whether the 
customer used a Visa credit card. Describe the format of a 
linear regression equation for estimating the dollar 
amount of the order on the basis of the other variables 
described above. If any qualitative variables are used, indi- 
cate the strategy for coding them. 



17.22 Repeat Exercise 17.21, but assume that more 
information is available regarding the specific type of 
credit card that was used for the purchase, that is, 
Visa, MasterCard, or other. 

17.23 Data to be used in a linear regression analysis 
involving clients of an investment counselor include infor- 
mation regarding the risk level each client has specified: 
low, medium, or high. Determine the number of qualita- 
tive independent variables that will be required to 
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represent this information and describe the coding 
scheme for each. 

17.24 Repeat Exercise 17.23, but for clients whose risk 
levels have been specified as low, medium, high, or very 
high. 

17.25 The manager of an ice cream store has developed a 
regression equation for estimating the number of 
customers on the basis of two independent variables: 

JCj = the high temperature (degrees Fahrenheit) for the 
day, and x 2 = whether the day is during the weekend 
(coded as 1) or is a weekday (coded as 0). For the estima- 
tion equation y = 100 + 5xj + 50x 2 , interpret the par- 
tial regression coefficients, then estimate the number of 
customers on an 80-degree Saturday. 

17.26 The dean of a law school has developed a 
regression equation for estimating the starting salary 
(thousands of dollars) of a new graduate on the basis of 
two independent variables: x 1 = the score achieved on 
the Law School Admission Test (LSAT) at the time of 
application, and x 2 — whether the new graduate's 
position is in the private sector (coded as 1) or the 
public sector (coded as 0). For the estimation equation 

y = 25 + O.lXj + 30x 2 , interpret the partial regression 
coefficients, then estimate the starting salary for a new 
graduate with an LSAT score of 160 who is entering the 
private sector. 

/ data Set / Note: Exercises 17.27-17.30 require a 
computer and statistical software. 

17.27 In a PC World comparison of computer hard 
drives, data describing the drives included the following 
variables: price (dollars), disk capacity (gigabytes), and 
rotational speed (RPM). The data are in file XR17027. 
With y = price, x 1 = disk capacity, and x 2 = rotational 
speed (coded as 1 = 7200 RPM and 0 = 5400 RPM), 
determine the linear regression equation for estimating 
price as a function of the two predictor variables and 
interpret the partial regression coefficients. For a given 
storage capacity, how much extra would one expect to 
pay for a hard drive with the higher rotational speed? 
source: Stan Miastkowski, "LivirV Large," PC World, March 2001, p. 109. 

17.28 A researcher has observed 50 automatic teller ma- 
chine (ATM) users outside the gift shop of a museum, 
and the data are in file XR17028. With y = amount with- 
drawn (dollars), Xj = time required for the transaction 
(seconds), and x 2 = gender (1 = female, 0 = male), 
determine the linear regression equation for estimating 
the withdrawal amount as a function of the two predictor 
variables and interpret the partial regression coefficients. 



According to the equation, how much difference does 
gender tend to make in the amount withdrawn? 

17.29 An efficiency expert has studied 12 employees who 
perform similar assembly tasks, recording productivity 
(units per hour), number of years of experience, and which 
one of three popular assembly methods the individual has 
chosen to use in performing the task. Given the data 
shown here, determine the linear regression equation for 
estimating productivity based on the other variables. For 
any qualitative variables that are used, be sure to specify 
the coding strategy employed for each. Comment on the 
explanatory power of the regression equation and interpret 
the partial regression coefficients. 



Productivity Years of Assembly 
Worker units/hour experience method 



1 


75 


7 


A 


2 


88 


10 


C 


3 


91 


4 


B 


4 


93 


5 


B 


5 


95 


11 


C 


6 


77 


3 


A 


7 


97 


12 


B 


8 


85 


10 


C 


9 


102 


12 


C 


10 


93 


13 


A 


11 


112 


12 


B 


12 


86 


14 


A 



17.30 For a small sample of suburban homes recently 
sold, the selling price ($), living space (square feet), and 
suburban area (Sunny Haven, Easy Acres, or Rolling Hills) 
have been recorded. Given the data shown here, determine 
the linear regression equation for estimating selling price 
based on the other variables. For any qualitative variables 
that are used, be sure to specify the coding strategy 
employed for each. Comment on the explanatory power 
of the regression equation and interpret the partial 
regression coefficients. 



Home 


Price 


Space 


Area 


1 


167,900 


2860 


Easy 


2 


184,800 


2050 


Sunny 


3 


161,000 


21 10 


Rolling 


4 


160,800 


2950 


Easy 


5 


148,200 


1570 


Sunny 


6 


128,600 


1860 


Rolling 


7 


145,700 


2620 


Rolling 


8 


178,500 


2010 


Sunny 


9 


146,100 


2710 


Easy 


10 


146,300 


2200 


Rolling 
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17.5 DATA TRANSFORMATIONS 



When the relationship between the dependent variable and one or more predictor 
variables is not linear, one of the polynomial models discussed in Sections 17.2 
and 17.3 will often provide satisfactory results. However, we sometimes en- 
counter variables for which the underlying relationship is better described by 
other models that can be converted to linear form. This is possible through data 
transformation, which is what we did in Section 17.2 when we complemented x 
by forming the x 2 and % 3 terms in the third-order model. Many other types of 
transformations are possible, such as converting x to its square root, its inverse, 
or its logarithm. We will examine two nonlinear models that rely on logarithmic 
transformations to achieve linearity. 

For the exponential model with estimation equation y = b 0 b\, conversion to 
a linear form involves taking the logarithms of both sides of the equation. Either 
natural logarithms (base e = 2.71828...) or common logarithms (base 10) can be 
used, as long as we are consistent. We will be using common logarithms. The ex- 
ponential estimation equation is converted to linear form as follows: 

Conversion of the exponential equation, y = b 0 b* , to linear form: 
log y = log b 0 + x log b v or log y = log b 0 + (log bjx 

As a brief review of logarithms, recall that a number is converted to a com- 
mon logarithm by expressing it in terms of the power to which 10 must be raised. 
Also, multiplying two numbers is equivalent to adding their logarithms, and rais- 
ing a quantity to a power is the same as multiplying its logarithm by that power. 
For example, 



• Logarithms and multiplication 



10 2 




j q0.69897 




jq2.69897 



■For 100 • 5 = 500, log 100 + log 5 = log 500. 



100 • 5 = 500 



■ Logarithms and exponentiation 



(10 ,2MI2 ) 2 



10 2 



For 16 2 = 256, 2(log 16) = log 256. 



16' = 256 



The second model we will consider is the multiplicative model. If there are 
two predictor variables, the estimated value of the dependent variable will be 
y = b 0 x^x2 2 - The multiplicative equation is converted to linear form as shown next. 

Conversion of the multiplicative equation, y = b^x^x^i to linear form: 
log y = log b Q + b x log x 1 + b 2 log x 2 
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As an example of the logarithmic transformation of data, we will use data file 
CX17ACC, which lists the following variables for a sample consisting of sports cars 
and sedans described in an early issue of the Car and Driver Yearbook: 4 

y = 0-60 mph acceleration time (seconds) 
x 1 = engine horsepower 
x 2 = curb weight (pounds) 

Considering the possibility that the time to reach 60 mph might be directly pro- 
portional to horsepower and inversely proportional to curb weight (in essence, a mul- 
tiplicative model with estimation equation y = b^pc^x^ 2 )^ we can use logarithms in 
converting this model to linear form. This can be done with either Excel or Minitab, 
and the procedures and results are shown in Computer Solutions 17.4. The equation 
and the result when it is converted back to multiplicative form are shown here: 

Logarithmic form: Log_0-60 = -0.0528 - 0.8607 log_hp + 0.8691 log_wt 

Multiplicative form: 0-60 = 0.8855 X hp -0 - 8607 X curbwt 0 - 8691 

In this conversion, the antilog of -0.0528 is 10~ 0 - 0528 = 0.8855. The esti- 
mated 0-60 mph time for a 100-horsepower car weighing 2000 pounds is calcu- 
lated as 0.8855 X lOO" 0 8607 X 2000 0 8691 = 12.44 seconds. 

In applying the models in this section, all of the original data values must be 
greater than zero. Also, because the dependent variable is log y instead of y, a com- 
parison of R 2 values with those from regression models having y as the dependent 
variable may not be appropriate because of the differing units of measurement. 5 
The transformations in this section are but two of the many that can be done. For 
a more complete discussion of the possibilities and their applications, the interested 
reader may refer to one of the more advanced treatments cited in the footnote. 6 




Transformation of the Multiplicative Model 
EXCEL 





G H 


I 


J 


K 


L 


M 


1 


SUMMARY OUTPUT 












2 














3 


Regression Statistics 












4 


Multiple R 


0.923 












5 


R Square 


0.852 












6 


Adjusted R Square 


0.836 












7 


Standard Error 


0.066 












8 


Observations 


21 












9 
















10 


ANOVA 














11 




df 


SS 


MS 


F 


Significance F 




12 


Regression 


2 


0.455 


0.228 


52.014 


0.000 




13 


Residual 


18 


0.079 


0.004 








14 


Total 


20 


0.534 










15 
















16 




Coefficients 


Standard Error 


tStat 


P-value 


Lower 95% 


Upper 95% 


17 


Intercept 


-0.0528 


0.5531 


-0.0954 


0.925 


-1.2148 


1.1092 


18 


Log_hp 


-0.8607 


0.1062 


-8.1022 


0.000 


-1.0839 


-0.6375 


19 


Log_wt 


0.8691 


0.2170 


4.0042 


0.001 


0.4131 


1.3251 



(continued) 

4 1964 Car and Driver Yearbook, pp. 14-63. 

5 T. Dielman, Applied Regression Analysis, Pacific Grove, CA: Duxbury Press, 1996, p. 240. 

''See Dielman or J. Neter, M. Kutner, C. Nachtsheim, and W. Wasserman, Applied Linear Statistical 

Models, Homewood, IL: Irwin, 1996. 
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Excel Transformation of the Multiplicative Model 

1 . The data are in CX17ACC.XLS, with 0-60 mph times (seconds) in A2:A22, horsepower ratings in B2:B22, curb weights 
(pounds) in C2:C22, and their labels in A1:C1. 

2. Click to select cell D2. Click the f x (Insert Function) button on the toolbar. In the Category box, select Math & Trig. 
Within the Function box, select LOG10. Click OK. Enter A2 into the Number box. Click OK. The logarithm of the first 
0-60-mph time will now be in cell D2. 

3. Click the lower right corner of cell D2 (cursor turns to "+" sign), then drag and fill to F2. Similarly, click the lower right cor- 
ner of F2, then drag and fill to F22. Enter the column labels (Log_0-60, Log_hp, and Log_wt, respectively) into D1 :F1 . 

4. Click Tools. Click Data Analysis. Select Regression. Click OK. 

5. Enter D1:D22 into the Input Y Range box. Enter E1:F22 into the Input X Range box. Select Labels. Enter G1 into 
the Output Range box. Click OK. After slight adjustments to the column widths and the number of decimal places, 
the results are as shown above. 



MINITAB 

Minitab Transformation of the Multiplicative Model 
Regression Analysis: Log_0-60 versus Log_hp, Log_wt 

The regression equation is 

Log_0-60 = - 0.053 - 0.861 Log_hp + 0.869 Log_wt 

Predictor Coef SE Coef T P 

Constant -0.0528 0.5531 -0.10 0.925 

Log_hp -0.8607 0.1062 -8.10 0.000 

Log_wt 0.8691 0.2170 4.00 0.001 

S = 0.0661493 R-Sq = 85.2% R-Sq(adj) = 83.6% 

Analysis of Variance 

Source DF SS MS F P 

Regression 2 0.45520 0.22760 52.01 0.000 

Residual Error 18 0.07876 0.00438 
Total 20 0.53396 

1 . Using file CX17ACC.MTW, 0-60 times in C1, horsepower ratings in C2, and curb weights in C3: Click Calc. Click Calcu- 
lator. In the Functions box, select Log base 10. Within the parentheses in the Expression box, enter C1. Enter C4 into 
the Store result in variable box. Click OK. 

2. Repeat this procedure two times, placing the logarithms for C2 into column C5 and placing the logs for C3 into C6. 

3. Enter the Log_0-60, Log_hp, and Log_wt labels at the top of columns C4-C6, respectively. Click Stat. Select Regres- 
sion. Click Regression. Enter C4 into the Response box. Enter C5-C6 into the Predictors box. Click OK. 




17.31 Following data transformation, regression 
analysis results in the estimation equation 

log y = —0.179 + 0.140^. Transform this equation 
to the equivalent exponential model with estimation 
equation y = b a b\. (The base of the logarithms is 10.) 

17.32 Following data transformation, regression 
analysis results in the estimation equation 



logy = 3.15 + 0.473 log x 1 + 0.354 log x 2 . Trans- 
form this equation to the equivalent multiplicative model 
with estimation equation y = b^x^x^ 1 - (The base of the 
logarithms is 10.) 

/ data set/ Note: Exercises 17.33-17.36 require a 
computer and statistical software. Use logarithms with 
base 10. 
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17.33 For the situation and variables in Exercise 17.7, 
fit an exponential model and use it to estimate the 
average room rate for a market in which the average 
occupancy rate is 70%. The underlying data are in file 
XR17007. 

17.34 For the situation and variables in Exercise 17.8, 
fit an exponential model and use it to estimate the cost 
per gigabyte of storage space for the year 2008. The under- 
lying data are in file XR17008. 



17.35 For the situation and variables in Exercise 17.17, fit 
a multiplicative model and use it to estimate the total oper- 
ating revenue for an airline with 2000 employees and 
10,000 departures. The underlying data are in file XR17017. 

17.36 For the situation and variables in Exercise 17.16, 
fit a multiplicative model and use it to estimate the rental 
revenue for a company with 80,000 cars in service and 
2500 rental locations. The underlying data are in file 
XR17016. 



MULTICOLLINEARITY 1 7.6 



In Chapter 16, we briefly discussed multicollinearity, a condition in which two or 
more of the independent variables are highly correlated with each other. When this 
happens, partial regression coefficients can become both statistically unreliable and 
difficult to interpret. This is because they are not really "saying" different things 
about y. We likened multicollinearity to a situation where old-time comedians Lau- 
rel and Hardy are carrying a piano, and each comedian is unsure as to how much 
of the load he should carry. As a result, the piano will tend to be very unstable. Mul- 
ticollinearity is not a problem if we are simply substituting x values into the equa- 
tion and attempting to estimate y. However, it is a problem when we're trying 
to interpret the partial regression coefficients for the x variables that are highly 
correlated. 

Besides the high correlation between two or more independent variables, 
there can be other clues to the presence of multicollinearity: (1) an independent 
variable known to be an important predictor ends up having a partial regression 
coefficient that is not significant; (2) a partial regression coefficient that should be 
positive turns out to be negative, or vice versa; or (3) when an independent vari- 
able is added or removed, the partial regression coefficients for the other indepen- 
dent variables change drastically. 

A practical way to identify multicollinearity is to generate a correlation ma- 
trix. This is a matrix showing the correlation of each variable with each of the 
other variables. Computer Solutions 17.5 shows the correlation matrix for the 
following y and x variables for a sample consisting of 34 large automobile deal- 
erships. 7 The data are in file CX17DEAL. 

y = TOTSALES, total sales generated during the year (thousands of dollars) 
x 1 = PARTSERV, parts and service sales for the year (thousands of dollars) 
x 2 = RETUSED, number of used cars sold at retail 
x 3 = RETNEW, number of new cars sold at retail 
x 4 = FLEET, number of cars sold to fleet customers 
x s = YRSINBUS, number of years the dealership has been in business 

With a total of six variables, there are (6 • 5)/2 = 15 possible pairs of vari- 
ables. As shown in the first column of the correlation matrix, several of the 



7 Crain Communications, Inc., Automotive News 1988 Market Data Book, pp. 95-108. 
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The Correlation Matrix 
EXCEL 





G 


H 


I 


J 


K 


L 


M 


1 




TOTSALES 


PARTSERV 


RETUSED 


RETNEW 


FLEET 


YRSINBUS 


2 


TOTSALES 


1.000 












3 


PARTSERV 


0.794 


1.000 










4 


RETUSED 


0.780 


0.386 


1.000 








5 


RETNEW 


0.759 


0.333 


0.835 


1.000 






6 


FLEET 


0.358 


0.322 


0.238 


0.201 


1.000 




7 


YRSINBUS 


0.190 


0.275 


-0.085 


-0.130 


0.336 


1.000 



Excel Correlation Matrix 

1 . The dealership data are in CX17DEAL.XLS, with total sales in A2:A35, parts and service sales in B2:B35, used retail units 
in C2:C35, new retail units in D2:D35, fleet units in E2:E35, years in business in F2:F35, and their labels in A1 :F1 . Click 
Tools. Click Data Analysis. Select Correlation. Click OK. 

2. Enter A1:F35 into the Input Range box. In the Grouped By menu, select Columns. Select Labels in First Row. Enter 
G1 into the Output Range box. Click OK. After adjusting the column widths and the number of decimal places, the 
results are as shown here. 



MINITAB 

Minitab Correlation Matrix 

Correlations: TOTSALES, PARTSERV, RETUSED, RETNEW, FLEET, YRSINBUS 

TOTSALES PARTSERV RETUSED RETNEW FLEET 
PARTSERV 0.794 

RETUSED 0.780 0.386 

RETNEW 0.759 0.333 0.835 

FLEET 0.358 0.322 0.238 0.201 

YRSINBUS 0.190 0.275 -0.085 -0.130 0.336 

Cell Contents: Pearson correlation 

Using file CX17DEAL.MTW, with total sales in C 1 , parts and service sales in C2, used retail units in C3, new retail units in 
C4, fleet units in C5, and years in business in C6: Click Stat. Select Basic Statistics. Click Correlation. Enter C1-C6 into 
the Variables box. Click OK. 



independent variables are highly correlated with the dependent variable, TOT- 
SALES. However, there is a very high correlation (0.835) between the number of 
used (RETUSED) and new (RETNEW) cars sold at retail. This high correlation 
between two of the independent variables is an example of multicollinearity. 

We can reduce the impact of multicollinearity by breaking up "partners" like 
the RETUSED and RETNEW variables and not including both of them in the 
equation at the same time. For example, we could decide to include only the 
"partner" with the higher correlation with y. An alternative approach involves 
the stepwise regression method described in the next section. 
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17.37 What is a correlation matrix, and what role does it 
play in multiple regression? 

17.38 One of the indicators that multicollinearity is pre- 
sent is when a variable that is known to be an important 
predictor ends up having a partial regression coefficient 
that is not significant. What are some of the other signals 
for the presence of multicollinearity? 

17.39 In linear regression models, for what types of uses 
or applications would multicollinearity tend to be a prob- 
lem? Are there any applications or uses for which 
multicollinearity would not be a problem? 

17.40 A linear regression analysis was performed using x l 
as a predictor variable. The p-value associated with the re- 
gression coefficient for x 1 was 0.02, and the sign of the 
coefficient was negative. Another analysis was performed, 
this time using both x 1 and x 2 as predictor variables. In 
the second analysis, the partial regression coefficient for 
x l had a positive sign and the p-value associated with it 
was 0.39. Given these results, would multicollinearity 
appear to be present? 

17.41 Four predictor variables are being considered for 
use in a linear regression model. Given the accompanying 
correlation matrix, does it appear that multicollinearity 
could be a problem? 





E 


F 


G 


H 


I 


1 




x1 


x2 


x3 


x4 


2 


x1 


1.000 








3 


x2 


0.483 


1.000 






4 


x3 


0.952 


0.516 


1.000 




5 


x4 


-0.164 


-0.202 


-0.113 


1.000 



/ data SEt / Note: A computer and statistical 
software are required for Exercises 17.42-17.45. 

17.42 For the situation and variables in Exercise 17.16, use 
data file XR17016 and generate a correlation matrix for the 
independent variables. Examine the correlation matrix and 
comment on the possible presence of multicollinearity. 

17.43 For the situation and variables in Exercise 17.17, 
use data file XR17017 and generate a correlation matrix 
for these independent variables: number of aircraft, num- 
ber of employees, and number of aircraft departures. Ex- 
amine the correlation matrix and comment on the possi- 
ble presence of multicollinearity. 



17.44 Circulation (millions) for Sunday newspapers versus 
morning and evening daily papers for 1996-2004 are 
shown here. The data are also in file XR17044. With Sunday 
circulation as the dependent variable, generate the correla- 
tion matrix for the independent variables and comment on 
the possible presence of multicollinearity. source: Bureau of the 
Census, Statistical Abstract of the United States 2002, p. 700 and Statistical 
Abstract of the United States 2006, p. 738. 



Year 


Sunday 


Daily, 
Morning 


Daily, 
Evening 


1996 


60.8 


44.8 


12.2 


1997 


60.5 


45.4 


11.3 


1998 


60.1 


45.6 


10.5 


1999 


59.9 


46.0 


10.0 


2000 


59.4 


46.8 


9.0 


2001 


59.1 


46.8 


8.8 


2002 


58.8 


46.6 


8.6 


2003 


58.5 


46.9 


8.3 


2004 


57.8 


46.9 


7.7 



17.45 For 1985-1995, production in the energy sector 
included the quantities shown here. The variables and 
their units are total nonrenewable energy production 
(quadrillion Btu), crude oil (million bbl per day), natural 
gas (trillion cubic feet), coal (million short tons), and 
nuclear power (billion kWh). The data are also in 
file XR17045. For these data, and with y = total 
nonrenewable energy production, generate the correlation 
matrix for the independent variables and comment on the 
possible presence of multicollinearity. source: Energy 
Information Administration, Annual Energy Review 1995, pp. 5, 141, 
189, 211, 261 





Total Non- 


Crude 


Natural 






Year 


renewable 


Oil 


Gas 


Coal 


Nuclear 


1985 


61.7 


8.97 


16.45 


883.6 


383.7 


1986 


61.1 


8.68 


16.06 


890.3 


414.0 


1987 


62.1 


8.35 


16.62 


918.8 


455.3 


1988 


63.5 


8.14 


17.10 


950.3 


527.0 


1989 


63.2 


7.61 


17.31 


980.7 


529.4 


1990 


64.7 


7.36 


17.81 


1029.1 


576.9 


1991 


64.4 


7.42 


17.70 


996.0 


612.6 


1992 


64.1 


7.17 


17.84 


997.5 


618.8 


1993 


62.2 


6.85 


18.10 


954.4 


610.3 


1994 


64.6 


6.66 


18.75 


1033.5 


640.4 


1995 


64.6 


6.53 


18.71 


1029.7 


673.4 
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1 7.7 STEPWISE REGRESSION 



In stepwise regression, the variables enter the regression analysis one at a time. 
The first x variable to enter is the one that explains the greatest amount of varia- 
tion in y. The second x variable to enter will be the one that explains the greatest 
amount of the remaining variation in y, and so on. At each "step," the computer 
decides which, if any, of the remaining x variables should be brought in. Every 
step results in a new regression equation and (if you choose) an updated printout 
of this equation and its analysis. 

Stepwise regression is especially useful when there are a great many x vari- 
ables. Under these conditions, a lot of independent variables may be highly corre- 
lated, and it can be difficult for us humans to sort out the confusion they 
represent. The general idea of stepwise regression is the balancing act of trying to 
(1) explain the most possible variation in y, while (2) using the fewest possible x 
variables. 

In general, it's advisable to test a model that has already been hypothesized (e.g., 
a multiple linear regression between y and x p x 3 , and x 6 ). If you don't have a spe- 
cific model in mind, however, stepwise regression is very useful when you have a 
great many independent variables and wish to determine which linear model might 
best apply. 



Example ' «9Hl 



Stepwise Regression 

As shown in Computer Solutions 17.5, two of the five independent variables (RE- 
TUSED and RETNEW) are highly correlated with each other. In viewing the appli- 
cation of stepwise regression to these data, we'll observe how the analysis is 
influenced by this situation in which multicollinearity is obviously present. 

SOLUTION 

Computer Solutions 17.6 shows the stepwise regression analysis of the underlying 
data for the six dealership variables. Although there are five independent variables, 
the procedure has gone through just four steps. The number of vehicles sold to fleet 
customers (FLEET) is denied entrance to the regression equation. Here is a brief 
discussion of the steps in the Computer Solutions 17.6 printout. 

Step 1 The first variable invited into the equation is PARTSERV, sales revenue 
from parts and service. This is no surprise, since the correlation matrix shows this 
to be the independent variable most highly correlated with TOTSALES. At this 
step, the regression equation is 

TOTSALES = -347.6 + 9.55PARTSERV 

This regression equation explains 63.04% of the variation in TOTSALES, and the 
multiple standard error of estimate is listed as 23143, or $23,143 million. 

Step 2 The next variable to enter is RETNEW, the number of new cars sold at 
retail. The regression equation now becomes 

TOTSALES = -6156.0 + 7.32PARTSERV + 10. 8 RETNEW 
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computer solutions 17.6 



Stepwise Regression 
EXCEL 





HI I I J 


K 


L 


M 


N 


1 


Results of stepwise regression 










2 












3 


Step 1 - Entering variable: PARTSERV 








4 














5 


Summary measures 












6 




Multiple R 


0.7940 










7 




R-Square 


0.6304 










8 




Adj R-Square 


0.6189 










9 




StErr of Est 


23143 










10 
















11 


ANOVA Table 












12 




Source 


df 


SS 


MS 


F 


p-value 


13 




Explained 


1 


29236853124 


29236853124 


54.6 


0.000 


14 




Unexplained 


32 


17138622464 


535581952 






15 
















16 


Regression coefficients 










17 






Coefficient 


Std Err 


t-value 


p-value 




18 




Constant 


-347.57 


7685.61 


-0.045 


0.964 




19 




PARTSERV 


9.55 


1.29 


7.388 


0.000 




20 
















21 


Step 2 - Entering variable: RETNEW 








22 














23 


Summary measures 




Change 


% Change 






24 




Multiple R 


0.9513 


0.1573 


%19.8 






25 




R-Square 


0.9050 


0.2746 


%43.6 






26 




Adj R-Square 


0.8989 


0.2800 


%45.2 






27 




StErr of Est 


11920 


-11223 


-%48.5 






28 
















29 


ANOVA Table 












30 




Source 


df 


SS 


MS 


F 


p-value 


31 




Explained 


2 


41971102084 


20985551042 


147.7 


0.0000 


32 




Unexplained 


31 


4404373504 


1 42076565 






33 
















34 


Regression coefficients 










35 






Coefficient 


Std Err 


t-value 


p-value 




36 




Constant 


-6156.03 


4005.73 


-1.537 


0.134 




37 




PARTSERV 


7.32 


0.71 


10.372 


0.000 




38 




RETNEW 


10.81 


1.14 


9.467 


0.000 




39 
















40 


Step 3 


- Entering variable: RETUSED 








41 
















42 


Summary measures 




Change 


% Change 






43 




Multiple R 


0.9632 


0.0119 


%1.3 






44 




R-Square 


0.9279 


0.0228 


%2.5 






45 




Adj R-Square 


0.9206 


0.0217 


%2.4 






46 




StErr of Est 


10561 


-1359 


-%1 1 .4 






47 
















48 


ANOVA Table 












49 




Source 


df 


SS 


MS 


F 


p-value 


50 




Explained 


3 


43029497732 


14343165911 


128.60 


0.000 


51 




Unexplained 


30 


3345977856 


111532595 






52 
















53 


Regression coefficients 










54 






Coefficient 


Std Err 


t-value 


p-value 




55 




Constant 


-5874.83 


3550.30 


-1.655 


0.108 




56 




PARTSERV 


6.91 


0.64 


10.812 


0.000 




57 




RETNEW 


6.46 


1.74 


3.724 


0.001 




58 




RETUSED 


10.20 


3.31 


3.081 


0.004 




59 
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60 


Step 4 - Entering variable: YRSINBUS 








61 














62 


Summary measures 




Change 


% Change 






63 




Multiple R 


0.9691 


0.0058 


%0.6 






64 




R-Square 


0.9391 


0.0112 


%1 .2 






65 




Adj R-Square 


0.9307 


0.0100 


%1.1 






66 




StErr of Est 


9871 


-690 


-%6.5 






67 
















68 


ANOVA Table 












69 




Source 


df 


SS 


MS 


F 


p-value 


70 




Explained 


4 


43549942148 


10887485537 


1 1 1 .74 


0.000 


71 




Unexplained 


29 


2825533440 


97432188 






72 
















73 


Regression coefficients 










74 






Coefficient 


Std Err 


t-value 


p-value 




75 




Constant 


-10235.89 


3817.27 


-2.681 


0.012 




76 




PARTSERV 


6.41 


0.64 


10.100 


0.000 




77 




RETNEW 


6.92 


1.63 


4.237 


0.000 




78 




RETUSED 


10.42 


3.10 


3.362 


0.002 




79 




YRSINBUS 


388.30 


168.01 


2.311 


0.028 





Excel Stepwise Regression 

1 . Using file CX17DEAL.XLS, with total sales in A2:A35, parts and service sales in B2:B35, used retail units in C2:C35, new 
retail units in D2:D35, fleet units in E2:E35, years in business in F2:F35, and their labels in A1:F1: Click on any cell 
within the data field. Click Tools. Click Data Analysis Plus. Select Stepwise Regression. Click OK. 

2. Check to verify that the cursor is on a cell within the data field, then click Continue. If there are no missing values 
within the data, click OK. Click on TOTSALES as the response variable and click OK. Holding down the Ctrl key, 
click on each of the five explanatory variables to be used: PARTSERV, RETUSED, RETNEW, FLEET, and YRSINBUS. 
Click OK. 

3. In the Stepwise regression parameters menu, click p-values and click OK. Accept the default p-to-enter and p-to- 

leave parameters (0.05 and 0.10, respectively) and click OK. Select any scatterplots that are desired, and click OK. In 
the Location of results menu, select Same worksheet as the data. Click OK. After adjusting the column widths and 
the number of decimal places, the results are as shown here. 



MINI TAB 

Minitab Stepwise Regression 

Stepwise Regression: TOTSALES versus PARTSERV, RETUSED, 

Alpha-to-Enter: 0.05 Alpha-to-Remove: 0.1 

Response is TOTSALES on 5 predictors, with N = 34 

Step 12 3 4 

Constant -347.6 -6156.0 -5874.8 -10235.9 

PARTSERV 9.55 7.32 6.91 6.41 

T-Value 7.39 10.37 10.81 10.10 

P-Value 0.000 0.000 0.000 0.000 

RETNEW 10.8 6.5 6.9 

T-Value 9.47 3.72 4.24 

P-Value 0.000 0.001 0.000 
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RETUSED 
T-Value 
P-Value 



10.2 
3.08 
0.004 



10.4 
3.36 
0.002 



YRSINBUS 

T-Value 

P-Value 



388 
2.31 
0.028 



S 

R-Sq 

R-Sq(adj) 
Mallows C-p 



23143 
63.04 
61.89 
140.1 



11920 
90.50 
89.89 
15.7 



10561 
92.79 
92.06 
7.2 



9871 
93.91 
93.07 
4.0 



1 . Using file CX17DEAL.MTW, with total sales in CI, parts and service sales in C2, used retail units in C3, new retail units 
in C4, fleet units in C5, and years in business in C6: Click Stat. Select Regression. Click Stepwise. Enter C1 into the 
Response box. Enter C2-C6 into the Predictors box. 

2. For consistency, we will use the same entry/removal parameters as the defaults in the Excel procedure: Click Methods. 
Select Use alpha values. Select Stepwise (forward and backward). Enter 0.05 into the Alpha to enter box. Enter 
0.10 into the Alpha to remove box. Click OK. Click OK. 



The inclusion of RETNEW has shaken things up a bit by changing the constant 
from —347.6 to —6156.0. This is not too unusual, since improving the equation 
means changing the equation. The percentage of variation in TOTSALES that is 
explained rises from 63.04% to 90.50%, a hefty increase. The estimating ability 
of the equation has also improved: the multiple standard error of estimate is re- 
duced to $11,920 million. 

Step 3 Here comes the Laurel and Hardy piano act. Recall that RETNEW and 
RETUSED are highly correlated (r = 0.835). However, of the variables not yet in- 
cluded, RETUSED explains the greatest amount of remaining variability in TOT- 
SALES, so it comes into the regression equation: 

TOTSALES = -5874.8 + 6.91PARTSERV + 6. 5 RETNEW + 10.2RETUSED 

The constant term and the partial regression coefficient for PARTSERV have 
changed slightly, but the partial regression coefficient for RETNEW has de- 
creased dramatically. (In the Laurel and Hardy analogy, RETNEW was carry- 
ing most of the piano in step 2, but now RETUSED has taken up part of the 
load.) Notice how the i-ratio for RETNEW has gone down from 9.47 to 3.72, 
another reflection of the new sharing of the load. Although RETUSED has a 
high correlation with TOTSALES (r = 0.780), it has not added much to the 
predictive power of the regression equation. The percentage of variation ex- 
plained has increased only slightly, from 90.50% to 92.79%. Likewise, the 
multiple standard error of estimate is reduced very little and is now $10,561 
million. 

If our sole purpose were the prediction of TOTSALES, there would be no 
problem in using the equations in this step and the next. On the other hand, if our 
purpose is to be able to interpret the partial regression coefficients, we should use 
only the equation that resulted from step 2. At step 3, multicollinearity has set in, 
and we have lost the ability to make meaningful interpretations of the partial re- 
gression coefficients. 
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Step 4 The next entrant is YRSINBUS, the number of years the dealership has 
been in business. The regression equation is now 

TOTSALES = -10,235.9 + 6.41PARTSERV + 6.9RETNEW 
+ 10.4RETUSED + 3 8 8.0 YRSINBUS 

Now that RETNEW and RETUSED have the piano under control, their partial 
regression coefficients have not changed much from the previous step. However, 
the constant term is now —10,235.9. Bringing YRSINBUS into the equation in- 
creases the explained percentage of variation in TOTSALES to 93.91%, and the 
multiple standard error of estimate decreases to $9,871 million. 



Once stepwise regression identifies a subset of the original independent vari- 
ables and lists them in a multiple regression equation, it is useful to include only 
these independent variables in a conventional multiple regression analysis. The pro- 
cedure described in this section is known as forward selection stepwise regression, 
since we begin with no predictor variables and add one predictor at each iteration. 
There is also a backward elimination stepwise regression, in which we begin with 
the entire set of predictor variables and eliminate one variable at each iteration. 




exercises 



17.46 What is stepwise regression, and when is it 
desirable to make use of this multiple regression 
technique? 

17.47 In general, on what basis are independent variables 
selected for entry into the equation during stepwise 
regression? 

17.48 The two largest values in a correlation matrix 
are the 0.850 correlation between y and x 4 , and the 
0.790 correlation between y and x 9 . During a stepwise 
regression analysis, x 4 is the first independent variable 
brought into the equation. Will x 9 necessarily be next? 
If not, why not? 

17.49 The following Minitab printout is for a stepwise 
regression with 10 independent variables and 50 
observations: 

Stepwise Regression: y versus xl, x2, x3, x4, 
x5, x6, x7, x8, x9, xlO 

Alpha-to-Enter: 0.05 AT pha-to-Remove : 0.1 



Response is 


y 


on 10 


aredictors, with N = 


Step 


l 


2 


3 


Constant 


83.91 


106.85 


124.76 


x5 


-0.33 


-0.35 


-0.30 


T-Value 


-2.39 


-2.65 


-2.30 


P-Value 


0.021 


0.011 


0.026 


x2 




-0.33 


-0.38 


T-Value 




-2.30 


-2.70 


P-Value 




0.026 


0.010 



x9 






-0.27 


T-Value 






-2.07 


P-Value 






0.044 


S 


13.8 


13.2 


12.8 


R-Sq 


10.63 


19.66 


26.52 


R-Sq(adj) 


8.76 


16.25 


21.73 


C-p 


3.9 


0.9 


-0.9 



a. List the order in which the independent variables 
have been introduced. 

b. What is the multiple regression equation after two 
independent variables have been introduced? 

c. As of step 3, three variables have been introduced. 
Which, if any, partial regression coefficients differ sig- 
nificantly from zero at the 0.05 level? At the 0.02 level? 

/ data SEt / Note: Exercises 17.50-17.57 require a 
computer and statistical software. We suggest that 
Minitab users specify alpha-to-enter as 0.05 and alpha- 
to-remove as 0.10. (See Computer Solutions 17.6.) 

17.50 For the data in Exercise 17.44, and using Sunday 
circulation as the dependent variable, perform 
conventional multiple regression and stepwise regression 
analyses on the data, then compare the results of the two 
approaches. Lhe data are also in file XR17044. 

17.51 For the data in Exercise 17.45, and using total non- 
renewable energy production as the dependent variable, 
perform conventional multiple regression and stepwise re- 
gression analyses on the data, then compare the results of 
the two approaches. The data are also in file XR17045. 
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17.52 For the situation, data, and variables described in 
Exercise 17.16, perform conventional multiple regression 
and stepwise regression analyses on the data, then 
compare the results of the two approaches. The data are 
in file XR17016. 

17.53 For the situation, data, and variables described in 
Exercise 17.15, perform conventional multiple regression 
and stepwise regression analyses on the data, then 
compare the results of the two approaches. The data are 
in file XR17015. 

17.54 For the situation and data described in Exercise 
17.17, perform conventional multiple regression and 
stepwise regression analyses on the data, then compare 
the results of the two approaches. Use total operating rev- 
enue as the dependent variable and number of aircraft, 
number of employees, and number of aircraft departures 
as the three independent variables. The data are also in 
file XR17017. 

17.55 An officer training program has 9 board members 
who provide ratings for the candidates. The high and low 
scores are dropped before the overall rating is computed. 
For a sample of 28 persons from the most recent class of 
candidates, the overall ratings and ratings by the individual 
board members are provided in data file XR17055. 

a. Construct the correlation matrix for all 
10 variables. 



b. With overall rating as the dependent variable, 
perform a stepwise regression and interpret the 
results. 

c. From the results in part (b), does it appear that the 
size of the board could be reduced without losing 
very much explanatory power? Explain. 

d. Using only the data provided by the first two board 
members introduced in part (b), perform a multiple 
regression analysis. Compare the results with the step- 
wise regression as it appeared following the introduc- 
tion of these board members. 

17.56 Using the data file for Exercise 17.55, (a) apply 
stepwise regression with the rating by board member 1 as 
the dependent variable and the ratings by the other 8 
members as the set of independent variables from which 
to select. Interpret the results, (b) Using only the indepen- 
dent variables introduced during the stepwise regression, 
perform a multiple regression analysis, (c) Compare the 
results in parts (a) and (b). 

17.57 Using the data file for Exercise 17.55, (a) apply 
stepwise regression with the rating by board member 9 as 
the dependent variable and the ratings by the other 8 
members as the set of independent variables from which 
to select, then interpret the results, (b) Including only the 
independent variables introduced during the stepwise re- 
gression, perform a multiple regression analysis, (c) Com- 
pare the results in parts (a) and (b). 



SELECTING A MODEL 1 7.8 



Given the variety of models discussed in this chapter, and in Chapters 15 and 16, 
it is natural to wonder which model is best for a given situation. However, the se- 
lection of which model to use is not quite so straightforward as the workings of 
the individual models themselves. Rather than feature a complex flow diagram 
and an elaborate discourse on each of its components, in this section we will con- 
sider a general sequence of steps and some of the more important considerations 
involved with each. For a more advanced discussion of the process of selecting the 
best model, the interested reader may wish to refer to a more advanced text, such 
as the excellent treatment given this topic by Kleinbaum, Kupper, Muller, and 
Nizam. 8 

1. Specify the dependent variable and determine the goal for the analysis. If the 

goal is simply to predict the value of the dependent variable based on a set of 
predictor variables, we need not worry about multicollinearity. Accordingly, 
we can use a large number of predictor variables without worrying about 
whether some of them might be highly correlated with each other. On the 



"Kleinbaum, D.G., L.L. Kupper, K.E. Muller, and A. Nizam, Applied Regression Analysis and Multi- 
variable Methods, Pacific Grove, CA: Duxbury Press, 1998. 
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other hand, if we are concerned with interpreting partial regression coeffi- 
cients in an attempt to better understand the relationship between the depen- 
dent variable and one or more of the predictor variables, it will be desirable to 
build a smaller model with predictor variables that are more independent 
from each other. 

2. Specify possible predictor variables. Based on a combination of experience 
and knowledge about the dependent variable and the environment in which 
it exists, develop a list of predictor variables that could be related to the de- 
pendent variable. In doing so, keep in mind the data-collection step that 
comes next. It may be not be practical or economically feasible to gather 
relevant data for all of the predictor variables that could be candidates for 
the model. 

3. Collect the data. This may involve primary data, secondary data, or both. The 
kinds of methods and data sources described in Chapter 4 could be utilized. 
Consider how large the eventual set of predictor variables might be, and col- 
lect data accordingly. As a rule of thumb, the number of observations should 
be at least five times the number of predictor variables in the model. 9 This is 
due to degrees-of-freedom considerations. The larger the number of predictor 
variables, the easier it is for the model to fit any given set of data, even data 
that are completely random. 

4. Specify the candidate models. The models under consideration could include 
any of those discussed in this chapter and the two that preceded, as well as 
models that are larger or more complex. In general, we should attempt to use 
the simplest possible model that satisfies the goal for which the model is 
being developed. 

5. For each candidate model, decide which of the available predictor vari- 
ables will be included in the model. Deciding which variable(s) to include 
can be done one variable at a time, as in the stepwise regression method 
presented in Section 17.7, and the approach can involve either forward se- 
lection or reverse elimination. However, with increasingly powerful com- 
puters and the greater sophistication of statistical software, more "brute 
force" methods are available. In an approach called best subsets regres- 
sion, the computer considers all of the possible combinations or subsets of 
the available predictor variables, then reports which models are the best 
performers for each given number of predictors. Implicit in this process is 
the criterion by which one model is judged to perform better than another. 
In this chapter, our focus has been on the coefficient of determination (_R 2 ) 
that indicates the proportion of variation in the dependent variable that is 
"explained" by the set of predictor variables, but many other criteria can 
be applied. 

6. Select a model and evaluate its suitability. Once a model has been selected, we 
can evaluate whether it satisfies the assumptions underlying its use. This eval- 
uation should include a residual analysis and related diagnostics. 

7. Test the reliability of the model by applying it to new or withheld data. The 
model can be tested by applying it to new data as time goes by. Another pos- 
sibility is to build the model on the basis of a portion of the available data, 
then test its performance on the "fresh" data that were initially withheld from 
the model. 



9 Ibid., p. 389. 
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SUMMARY 1 7.9 



• Models, variables, and transformations 

In this chapter, we go beyond the simple and multiple regression estimation mod- 
els in Chapters 15 and 16 to include curvilinear models as well as models where 
a variable is qualitative and can represent two or more categories. For underlying 
relationships that are not linear, logarithmic transformation methods can be em- 
ployed in constructing exponential and multiplicative models. 



• Multicollinearity 

If two or more independent variables are highly correlated with each other, mul- 
ticollinearity is present, and the partial regression coefficients may be unreliable. 
One way to identify multicollinearity is to generate a correlation matrix that 
shows the correlation of each variable with each of the other variables. Multi- 
collinearity is not a problem when the only purpose of the equation is to predict 
the value of the dependent variable, but it can be a problem when we are trying 
to interpret the meaning of the partial regression coefficients. 



• Stepwise regression 

Stepwise regression is introduced and the forward selection method is presented. 
In this approach, independent variables enter the equation one at a time, with the 
first one entering being the x variable that explains the greatest amount of the 
variation in y. At each step, the variable is introduced that explains the greatest 
amount of the remaining variation in y. 



• Model selection 

Determining the best model for a given situation can be a complex process, and a 
sequence of general steps is presented for model selection. 



Equations 




Polynomial models with one quantitative predictor variable 

General polynomial model: y i = f) 0 + fi^x^ P 2 x t 2 + PjX^ + • • • + fipxf + e, 
First-order polynomial model: E(y) = fi Q + fi x x 
Second-order polynomial model: E(y) = p 0 + /3jX + f) 2 x 2 
Fhird-order polynomial model: E(y) = p 0 + jSjX + f) 2 x z + /3 3 x 3 

Polynomial models with two quantitative predictor variables 

First-order model with two predictor variables: E(y) = /3 Q + j3 

First-order model with two predictor variables and interaction: 
E(y) = P 0 + P i x 1 + P 2 x 2 + 03X^2 

Second-order model with interaction: 
E(y) = /3 0 + P 1 x 1 + fi 2 x 2 + P 3 x\+ Pix\+ P 5 x x x 2 
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Conversion of the exponential equation, y = b 0 bf, to linear form 

logy = log b 0 + x log b v or logy = log b 0 + (log b t )x 

Conversion of the multiplicative equation, y =b 0 x^ix^ i to linear form 

logy = log b 0 + b 1 log x 1 + b 2 log x 2 




/data set /Note: Exercises 17.58-17.73 require a 
computer and statistical software. 

17.58 As reported by the Recording Industry Association 
of America, the percentage of recorded music dollar 
sales in CD format was as shown here for the years 
2000 through 2004. The data are also in file XR17058. 
Using x = 1 through 5 to represent years 2000-2004, 
fit a second-order polynomial model to the data and 
estimate the percentage of music sales in CD format for 
the year 2010. source: The World Almanac and Book of Facts 
2006, p. 276. 



Year 



x = Year Code 



CD Music 
Percentage 



2000 
2001 
2002 
2003 
2004 



89.3 
89.2 
90.5 
87.8 
90.3 



17.59 J.K. Rowling's popular Harry Potter books have 
been getting thicker with each new edition. The 
publication year and page count for U.S. editions for 
the first six books were as shown here. The data are 
also in file XR17059. Using year codes so that x = 1 
corresponds to 1998, the year the first book was 
published, fit a second-order polynomial model to the 
data and estimate the page count for a Harry Potter book 
published in 2010. source: Julie Snider, "Higher and Higher with 
Harry," USA Today, April 2, 2003, p. 1D, and harrypotterfanzone.com, 
August 25, 2006. 



17.60 Dun & Bradstreet have reported the number of 
business starts and the employment they have provided 
for the years shown here. The data are also in file 
XR17060. Generate a scatter diagram and apply an 
appropriate polynomial model to fit the data. Does 
there appear to be any relationship between the 
variables? How much of the variation in employment 
is explained by the model? source: Wall Street Journal 
Almanac 1999, p. 171. 



Year 


Starts 


Employment 


1985 


249,770 


1,657,383 


1986 


253,092 


1,650,537 


1987 


233,170 


1,552,708 


1988 


199,091 


1,333,426 


1989 


181,645 


1,212,464 


1990 


158,930 


827,012 


1991 


155,672 


731,621 


1992 


164,086 


800,827 


1993 


166,154 


780,804 


1994 


188,387 


758,134 


1995 


168,158 


738,606 


1996 


170,475 


846,973 


1997 


166,740 


939,310 



17.61 Each workstation in the final assembly area of a 
large manufacturing facility has a "backlog" area to 
hold units that are to be processed. A production engi- 
neer believes there could be a relationship between the 
number of units in the backlog area and the productivity 
of the worker. Specifically, the engineer thinks increasing 
the number of units in the backlog area might lead to 



Data for Exercise 17.59 
Title 



The Sorcerer's Stone 
The Chamber of Secrets 
The Prisoner ofAzkaban 
The Goblet of Fire 
The Order of the Phoenix 
The Half-Blood Prince 



Publication Year 



1998 
1999 
1999 
2000 
2003 
2005 



x = Year Code 



Page Count, U.S. Edition 



309 
341 
435 
734 
896 
762 
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workers being more in a hurry to "catch up," hence in- 
creasing their productivity. The data in file XR17061 
show the results of a test in which the engineer tried 
various backlog levels and measured productivity at 
50 different workstations. Generate a scatter diagram 
relating y = productivity (units/hour) to x = number of 
units in the backlog area, then apply an appropriate 
polynomial model to fit the data. Does there appear to 
be any relationship between the variables? How much 
of the variation in productivity is explained by the 
model? 

17.62 The data below show the number of years it took 
for selected technological innovations to spread to 25% 
of the U.S. population. The data are also in file XR17062. 
For y = years to spread and x = years past 1800 
(e.g., the code for 1975 is 175), generate a scatter 
diagram and apply an appropriate polynomial model to 
fit the data. Does there appear to be any relationship 
between the variables? How much of the variation in 
dissemination time is explained by the model? Using 
your model, predict how long it will take for a major 
technological invention that is invented in 2010 to 
spread to 25% of the U.S. population, source: Wall Street 
Journal Almanac 1999, p. 350. 





Year 


Years to 


Invention 


Code 


Spread 


Home electricity (1873) 


73 


46 


Telephone (1875) 


75 


35 


Automobile (1885) 


85 


55 


Radio (1906) 


106 


22 


Television (1925) 


125 


26 


VCR (1952) 


152 


34 


Microwave oven (1953) 


153 


30 


Personal computer (1975) 


175 


15 


Cellular phone (1983) 


183 


13 



17.63 The data in file XR17063 include the "apparent 
temperature," how warm it "feels" at different combi- 
nations of room temperature (degrees Fahrenheit), and 
relative humidity (%). For y = apparent temperature, 

x 1 = actual room temperature, and x 2 = relative humid- 
ity, fit a first-order polynomial model to the data and in- 
terpret the partial regression coefficients and the R 2 
value. Add an interaction term to the model. To what 
extent has the addition of the interaction term improved 
the explanatory power of the model? source: Time Almanac 
2001, p. 609. 

17.64 The American Bankers Association has reported 
the percentage of consumer loans in various categories 
that were at least 30 days overdue in the first quarter of 
each of the years shown in the table. The data are listed 
in file XR17064. For y = percent of home equity loans 
overdue, Xj = percent of bank credit card balances over- 
due, and x 2 = percent of direct auto loans overdue, fit a 



first-order polynomial model to the data and interpret 
the partial regression coefficients and the R 2 value. Add 
an interaction term to the model. To what extent has the 
addition of the interaction term improved the 
explanatory power of the model? source: Wall Street Journal 
Almanac 1999, p. 302. 





Home 


Bank 


Auto 


Year 


Equity 


Lards 


Loans 


1985 


2.07 


2.51 


1.63 


1 986 


1 .95 


2.86 


1 .82 


1987 


1.66 


2.33 


1.76 


1988 


1.77 


2.54 


1.75 


1989 


1.51 


2.54 


1.73 


1990 


1.47 


2.18 


1.94 


1991 


1.75 


3.34 


2.14 


1992 


1.73 


2.86 


2.37 


1993 


1.48 


2.74 


2.00 


1994 


1.51 


2.54 


1.53 


1995 


1.25 


3.18 


1.45 


1996 


1.44 


3.53 


1.74 


1997 


1.38 


3.51 


2.10 


1998 


1.30 


3.1 1 


2.06 



17.65 The production engineer for the manufacturing 
plant in Exercise 17.61 has just included another 
variable: the worker's gender, coded as 1 = female and 

0 = male. File XR17065 contains the expanded set of data. 
With y = productivity (units/hour), x 1 = number of units 
in the backlog area, and x 2 = gender, fit a first-order 
model with no interaction term. Interpret the partial 
regression coefficients and the R 2 value. At the 0.02 level, 
is the model significant? 

17.66 The sales director for an industrial supplies firm 
has collected information describing the performance 
and personal characteristics of 80 members of her 
sales force. The data in file XR17066 include the percent 
of sales quota the salesperson achieved last year, age, 
number of years with the firm, and personal appearance 
(very neat, about average, or a little sloppy). In the data 
file, a very neat person is coded as neat = 1, average = 0, 
a person of average neatness is coded as neat = 0, 
average = 1, and a sloppy person is coded as neat = 0, 
average = 0. With y = percent of sales quota, perform 

a linear regression using the other variables as predictors. 
Interpret the partial regression coefficients and the R 2 
value for the model. At the 0.05 level, is the model 
significant? 

17.67 For the temperature and relative humidity data 
described in Exercise 17.63 and listed in file XR17063, 
fit a multiplicative model and use it to estimate the ap- 
parent temperature for a room in which the actual tem- 
perature is 72 degrees Fahrenheit and the relative 
humidity is 50%. 
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17.68 The Dow Jones Global Indexes describe over 
3000 companies worldwide that represent more than 
80% of the equity capital in the world's stock markets. 
Data file XR17068 shows the quarterly indexes for the 
world as a whole, and indexes for the United States, the 
Americas, Europe, and the Asia/Pacific region from the 
first quarter of 1993 through the third quarter of 1998. 
Given that the dependent variable will be y = world 
index, construct a correlation matrix for the four 
possible predictor variables. Does multicollinearity 
seem to be present in the data? If so, would this present 
a problem if we were using the predictor variables to 
estimate the world index? If there is multicollinearity, 
would this present a problem if we were interpreting 
the partial regression coefficients in a linear model 
applied to the data? source: Wall Street Journal Almanac 1999, 

p. 264. 

Note: For Exercises 17.69-17.72, we suggest that 
Minitab users specify alpha-to-enter as 0.05 and alpha- 
to-remove as 0.10. (See Computer Solutions 17.6.) 

17.69 Data file XR17069 lists a variety of information 
for a selection of aircraft operated during 1999. The 
variables and their units are: operating cost (dollars per 
hour), number of seats, speed (mph), flight range 
(miles), and fuel consumption (gallons per hour). 
Considering operating cost per hour as the dependent 
variable, construct and interpret the correlation matrix 
for the possible predictor variables. Does it appear that 
multicollinearity could be present? For y = operating 
cost per hour, perform a stepwise regression analysis 
using the other variables as predictors and interpret the 
results. How much of the variation in operating costs is 
explained by the model? source: World Almanac and Book of 
Facts 2001, p. 217. 

17.70 Given the situation and variables described in Ex- 
ercise 17.66 and data file XR17066, use y = percent of 
sales quota in performing a stepwise regression using the 
other variables as predictors. Interpret the results, includ- 
ing a commentary on which variables seem to be of 
greater importance in predicting sales performance. How 
much of the variation in sales performance is explained 
by the model? 

17.71 The students in a training program have taken four 
tests along with a final exam, with the scores shown in 
the table. The data are also in file XR17071. 

a. Construct the correlation matrix for these vari- 
ables. Does it appear that multicollinearity 
could be a problem if all of the independent 
variables are used in estimating y = final exam 
score? In a stepwise regression, which indepen- 
dent variable would be the first one introduced 
into the equation, and why? 



Student 


Test 
1 


Test 
2 


Test 
3 


Test 
4 


Final 
Exam 


1 


59 


65 


53 


65 


62 


2 


69 


77 


57 


76 


78 


3 


79 


74 


81 


86 


82 


4 


87 


88 


83 


90 


96 


5 


92 


89 


97 


92 


94 


6 


65 


69 


60 


66 


68 


7 


71 


70 


68 


81 


78 


8 


93 


87 


85 


95 


94 


9 


83 


92 


89 


90 


91 


10 


71 


68 


69 


74 


77 


1 1 


64 


61 


74 


74 


73 


12 


76 


82 


79 


78 


86 


13 


65 


75 


67 


77 


79 


14 


90 


88 


75 


94 


89 


15 


69 


68 


74 


76 


72 


16 


85 


81 


99 


79 


96 



b. With y = final exam score and the independent 
variables as shown above, apply stepwise 
regression analysis to these data. 

c. Describe each step of the stepwise regression 
in terms of the variable introduced, the re- 
gression equation after its introduction, and the 
proportion of variation in y that has been 
explained. 

d. Using only the independent variables introduced 
during the stepwise regression, carry out a con- 
ventional multiple regression analysis on the 
data. What, if any, additional information does 
your computer statistical package provide that 
was not given in the printout for the stepwise 
regression? 

17.72 Repeat parts (b)-(d) of Exercise 17.71, with Test 1 
as the dependent variable, and Test 2, Test 3, Test 4, and 
Final Exam as the independent variables. 

17.73 As indicated in the chapter-opening vignette, the 
National Football League quarterback passer rating is 
somewhat of a mystery to most observers. Data file 
XR17073 includes the passer rating for 34 NFL 
quarterbacks during the 2000 season, along with these 
ten performance measurements for each quarterback: 
passing attempts, completions, completion percentage, 
yards gained, average yards per attempt, number of 
touchdowns, percentage of passes that were touchdowns, 
longest pass completed, number of interceptions, and per- 
centage of passes that were intercepted. With passer rat- 
ing as the dependent variable, use stepwise regression to 
find out which predictor variables are especially 
important in determining the passer rating. Comment on 
the results, including the percentage of the variation in 
passer ratings that is explained by the model. 

source: NFL.com. 
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Thorndike Sports Equipment 

Ted Thorndike has been asked to pay another visit to the 
statistics class taught by his friend, Mary Stuart. One of 
the university's track athletes has an excellent chance to 
make the Olympics in the 1500 meters, known as the 
"metric mile," and Ted has brought along his laptop com- 
puter, a statistical package, and data on the world record 
for the one-mile run. 

In 1954, Great Britain's Roger Bannister became the first 
person to run a mile in less than four minutes (3:59.4). Since 
then, the record has been broken 1 8 times, most recently by 
Morocco's Hicham El Guerrouj, at 3:43.13. The following 
data describe the record runs and the years in which they oc- 
curred. The data are also in file THORN17. source: Time Al- 
manac 2006, p. 985. 

Note These times are not recorded to the same number 
of decimal places for the seconds, because officiating poli- 
cies and timing technologies may have differed from one 
record run to the next. For convenience, you may wish to 
(1) code the times so that each is the number of seconds 
over 3 minutes and (2) code the years so that the year 
1901 = 1. 

1. Fit a simple linear regression (y = b 0 + b x x) to the 
time series. 

2. Fit a polynomial regression equation (y = b 0 + b x x + 
b 7 x 2 ) to the time series. 



; ■ 












Person 


Time 


Year 


Roger Bannister 


3:59.4 


1954 


John Landy 


3:58 


1954 


Derek Ibbotson 


3:57.2 


1957 


Herb Elliott 


3:54.5 


1958 


Peter Snell 


3:54.4 


1962 


Peter Snell 


3:54.1 


1964 


Michael Jazy 


3:53.6 


1965 


Jim Ryun 


3:51.3 


1966 


Jim Ryun 


3:51.1 


1967 


Filbert Bayi 


3:51 


1975 


John Walker 


3:49.4 


1975 


Sebastian Coe 


3:49 


1979 


Steve Ovett 


3:48.8 


1980 


Sebastian Coe 


3:48.53 


1981 


Steve Ovett 


3:48.4 


1981 


Sebastian Coe 


3:47.33 


1981 


Steve Cram 


3:46.32 


1985 


Noureddine Morceli 


3:44.39 


1993 


Hicham El Guerrouj 


3:43.13 


1999 



J > 



J* 



3. Comparing the R 2 values for the equations from parts 
1 and 2, which equation is the better fit to the data? 

4. Using the equation identified in question 3, what is the 
predicted world record for the one-mile run in the 
year 2010? 



Fast-Growing Companies 



Computer database GROWCO describes the characteristics 
of 100 companies identified by Fortune as among the 
fastest growing, source: "Fortune's 100 Fastest-Growing Compa- 
nies," Fortune, September 4, 2000, pp. 142-158. The variables 
include: 

1 . Name 

2. Earnings-per-share growth, 3-year annual rate (percent) 

3. Net income, past four quarters (millions of dollars) 

4. Revenue growth, 3-year annual rate (percent) 

5. Revenue, past four quarters (millions of dollars) 

6. Total return, 3-year annual rate (percent) 

7. Stock price change in past year (percent) 

8. Stock price (dollars) 

9. Estimated price/earnings ratio a year from now 



10. Category (1 = technology, 2 = industrial, 3 = retail, 
4 = health care, 5 = financial services, 6 = telecom- 
munications, 7 = other) 

Variables 11-16 are dummy variables representing the 
categories of variable 10. They are coded according to the 
scheme described in Section 17.4. For a technology com- 
pany, technology = 1 and the other dummy variables will 
be 0. For a financial services company, financial = 1 and 
the others will be 0. If a company is "other," all six values 
will be 0. Thus, we need only six variables to describe the 
seven possible categories. 

1. Generate a correlations matrix that includes variables 
2-9 and 11-16. (Variable 10 would be meaningless 
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here because its values are 1-7 for the 7 company 
categories.) Do the correlations "make sense" in 
terms of which correlations are positive and which 
are negative? 

Using y = stock price (variable 8) as the dependent 
variable, carry out a conventional multiple regression 
analysis using variables 2-7, 9, and 11-16. Excel 
users, see note 1, which follows. Examine the partial 
regression coefficients to see whether their signs 
match the signs of the corresponding correlation co- 
efficients associated with stock price. What percent- 
age of the variation in stock prices is explained by the 
set of predictor variables? 

Repeat step 2, but this time perform a stepwise regres- 
sion analysis. Minitab and Excel users should refer to 
the following notes. Identify the variables that were 
introduced and interpret the printout. What percent- 
age of the variation in stock prices is explained by the 
reduced set of predictor variables? 



Minitab users, note: In the solutions materials, we use 
alpha-to-enter = 0.05 and alpha-to-remove = 0.10 for 
stepwise regression analyses. This is to provide consistency 
with Excel and the Data Analysis Plus add-in used with the 
text. See Computer Solutions 17.6. 

Excel users, note 1: Excel requires that the predictor 
variables be arranged in adjacent columns when we carry 
out a multiple regression analysis. The same is true for all 
variables for which a correlation matrix is being generated. 
Depending on the predictor variables being used in a given 
analysis, you may need to rearrange some of the columns in 
the database. If you save the revised database, it is advis- 
able to change the name of the file when doing so. 

Excel users, note 2: The estimated price/earnings ratio 
a year from now was not available for four of the compa- 
nies, so these are "missing data." If you are using the Data 
Analysis Plus add-in for the stepwise regression, you may 
wish to either omit this variable or omit these four compa- 
nies from the analysis. 




Westmore MBA Program 



The MBA program at Westmore University has under- 
gone several dramatic changes over the past five years. 
During this time, the goal of the business school was to 
recruit as many students as possible into the MBA pro- 
gram in order to build up their student base and credit 
hour production. A massive campaign was launched five 
years ago in order to attract more applicants to the pro- 
gram. Special brochures containing information about 
the program were printed and mailed to prospective stu- 
dents as well as to other colleges and universities that 
were likely to have undergraduate students who might be 
interested in coming to Westmore. Mailings were also 
sent to students who indicated an interest in Westmore 
on their GMAT exam. (The GMAT exam is a national 
standardized test used by most business schools in mak- 
ing admissions decisions for applicants to their graduate 
programs.) Representatives from the Westmore School of 
Business began attending regional "MBA fairs," where 
MBA program representatives meet with prospective 
MBA students and share information. 

In the beginning, the number of students applying to 
the Westmore MBA program was small, but eventually the 
advertising campaign began to work and the number of 
qualified applicants each year increased to the target value 
of 150 initially set by the dean of the business school and 
the director of the MBA program. The yield — the percent- 
age of admitted applicants who actually enroll and attend 
Westmore — is typically around 70%. Admitted students 




who do not enroll either attend other MBA programs or 
accept job offers. The accompanying table shows the ad- 
missions and enrollment figures for the five years of the 
plan to build the MBA student base at Westmore. 



Year 


1 


2 


3 


4 


5 


Admissions 


86 


108 


134 


141 


154 


Enrollment 


54 


77 


91 


96 


106 



Wayne McDonald, the director of the program, is cur- 
rently putting the second phase of the plan into action. He 
knows that in order for the MBA program at Westmore to 
attain national recognition, administrators must become 
more selective in the admissions process. The number of 
applicants is now large enough to do this without falling 
below a critical mass of 60 enrolled students each year. 

The major issue facing Wayne and the MBA pro- 
gram is how to go about selecting the best students for 
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the program. Wayne recently met with the MBA Admis- 
sions Committee, which consists of himself and two fac- 
ulty members, Dr. Susan Thompson, who is a finance 
professor, and Dr. Hector Gonzalez, who is a marketing 
professor. 

Wayne: "Thanks for coming to the meeting today. As 
you both know, our recruiting effort over the past 
five years has been extremely successful. We were 
able to exceed our original enrollment goal last year. 
While many of our students have been outstanding 
and have given our program visibility in the business 
community, we have had a number of weak perform- 
ers. Some professors have had to water down their 
courses to keep these people afloat in the program. If 
we are to have a nationally recognized, quality MBA 
program, we must become stricter in our admission 
policies. Fortunately, we are now at the point where 
we can be much more selective and still have our 
minimum critical mass of 60 enrolled students each 
year. " 

Susan: "Wayne is right. Our current admission standards 
require a minimum score of 400 on the GMAT and a 
minimum undergraduate grade point average of 2.0. 
Obviously, this is not much of a hurdle. Personally, 
I would like to see the minimum requirements set at 
580 on the GMAT and 2.75 for the undergraduate 
grade point average." 

Wayne: "Well, raising the minimums is one way of going 
about it, but there are many other factors that deter- 
mine the degree of success a student has in our MBA 
program. We should consider including these factors 
in our decision making process." 

Hector: "Too bad we don't know in advance which stu- 
dents are going to excel. Wayne, do you know what 
other schools are doing?" 

Wayne: "From conferences that I've attended, I have 
found that many MBA programs put a lot of empha- 
sis on the GMAT score and the undergraduate grade 
point average of the student. While some schools set a 
minimum entrance requirement on each of these crite- 
ria as we do currently, other schools combine these 
measures into a single overall score. For instance, 
there is a 'formula score' that many schools use which 
multiplies the undergraduate GPA by 200 and adds 
the result to the GMAT score. If the formula score 
is above a certain figure, say, 1000, then the student is 
considered to be admittable." 

Susan: "But there are so many other factors to consider. 
Surely we don't want our admissions decisions to be 
based solely on a formula. There are many students 
who attend colleges with high grade inflation. Those 
applicants would have an unfair advantage over appli- 
cants from stronger schools with regard to undergrad- 
uate grade point average." 

Hector: "Yes, I agree. There are also studies that have 
indicated the GMAT is not a strong predictor of 



success in graduate school for many reasons, includ- 
ing cultural bias." 

Wayne: "I am not advocating that we go to a strictly 
mathematical basis for making our decisions. How- 
ever, higher minimum standards than we currently 
have or some sort of formula involving GMAT and 
undergraduate grade point average might be a useful 
screening device for sifting through applicants." 

Susan: "I'm not opposed to your suggestion. Such an 
approach could be used to identify those students 
with high potential for succeeding in our program. In 
a sense, many of these decisions could be automated." 

Wayne: "That would certainly be a great timesaver. Our 
admissions committee would only have to meet to dis- 
cuss those applicants with other strong characteristics 
or extenuating circumstances." 

Susan: "Excellent idea! Now, if we go with raising the 
minimum requirements for GMAT and undergraduate 
GPA, how much should we raise them? Or if we go 
with a combined score approach, what formula 
should we use and how should we set its cutoff 
values?" 

Wayne: "We could go with your earlier suggestion of a 
580/2.75 minimum requirement or with the formula 
score I described earlier. I could talk with directors of 
other MBA programs to get some feel for how they 
set their cutoff criteria. After we gain some experience 
with future admissions, we could adjust the cutoff 
criteria." 

Hector: "Why wait until then? We have five years worth 
of data already! We should be able to develop our 
own criteria based on our own past experience." 

Susan: "We might even want to consider developing our 
own formula." 

Wayne: "Great ideas! That's why I like working with the 
two of you on this committee. However, I would limit 
the data to the last two years because of several 
changes we made in the program a few years back. 
The data for the last two years are more reflective of 
our current program." 

Hector: "In looking at these data, how are we going to 
measure the degree to which a student is successful in 
our program? Whether they graduate or not?" 

Wayne: "Fortunately or unfortunately, depending on 
how you look at it, practically all of our students have 
eventually graduated. One of the things we are trying 
to accomplish is to make our program more rigorous 
and demanding, to raise the level of quality. If this is 
going to happen, we have to be more selective with the 
students we admit." 

Susan: "Why not consider the grade point average of the 
student at the end of the program?" 

Wayne: "The major problem there is that the students in 
our program do not take the same set of courses in 
their second year because they select different areas of 
concentration. Some go into marketing, some into 
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finance, others into either accounting or management. 
There is a real lack of comparability in those final 
GPA figures. But what we might do is look at the first- 
year GPA in the MBA program. The courses taken by 
students in the first year are essentially the same be- 
cause they are required core courses. It is not until 
their second year that they began taking elective 
courses in the different concentration areas. What 
first-year MBA grade point average would the two of 
you, as faculty members, define as indicating a suc- 
cessful first year?" 
Hector: "Given the breadth of those first-year core 
courses, their level of difficulty, and our mild degree of 
grade inflation, I would say that any of our students in 
the past two years with at least a 3.2 average would be 
considered successful in the first year. Would you 
agree, Susan?" 

Susan: "I believe most of the faculty would go along with 
that." 

Wayne: "Don't set your goals too high! Remember, we 
need at least 60 students per year to even have a pro- 
gram. We probably need to look at the data to see what's 
possible." 

Hector: "When can we get access to the past data? I re- 
ally would like to get started." 

Wayne: "I'll have one of the staff members write a 
database program to pull up the relevant information 
on all students who have completed the first year of 
the program in the past two years. I'll get the data to 
you as soon as I can. Let's plan to meet again in two 
weeks to see what we have discovered by then. I look 
forward to hearing your ideas." 

Assignment 

Wayne McDonald's assistant gathered the data for the past 
two years of experience with the Westmore MBA program 
and the information is stored in the file WESTMORE on 
your text CD. The Data Description section provides a 
partial listing of the data along with definitions of the vari- 
ables. Using this data set and other information given in 
the case, help Wayne McDonald and the MBA Admissions 
Committee in their quest to develop better admissions 



guidelines for the Westmore MBA program. The case 
questions will assist you in your analysis of the data. Use 
important details from your analysis to support your 
recommendations . 

Data Description 

The data for the Westmore MBA admissions case is con- 
tained in the file WESTMORE on the text CD. The file con- 
tains data for the 202 students who completed their first 
year in the MBA program over the past two years. A par- 
tial listing of the data is shown below. 

The variables are defined as follows: 

ID_No: Student identification number. 
GPA_Yrl: Grade point average for the first year of 
courses in the Westmore MBA program. 
GMAT: Score on the GMAT test. 
UG_GPA: Undergraduate grade point average. 

Undergraduate Major is a qualitative measure broken 
down into two variables: 

UG_Bus = 1 for undergraduate business major, other- 
wise = 0. 

UG_Tech = 1 for undergrad science, engineering, or 
other technical field, otherwise = 0. 
(Note: A student with an undergraduate 
degree in a nonbusiness and nontechni- 
cal field is the "baseline" and will have 
both UG_Bus = 0 and UG_Tech = 0.) 

Undergraduate School Rating is a qualitative measure 
broken down into four variables: 

UG_lst5th = 1 if student is from the top 20% of un- 
dergraduate schools, otherwise = 0. 
UG_2nd5th = 1 if student is from the second 20% of 

undergraduate schools, otherwise = 0. 
UG_3rd5th = 1 if student is from the third 20% of 

undergraduate schools, otherwise = 0. 
UG_4th5th = 1 if student is from the fourth 20% of 

undergraduate schools, otherwise = 0. 

[continued) 















UG_ 


UG_ 


UG_ 


UG_ 






ID_ 


GPA_ 




UG_ 


UG_ 


UG_ 


1st 


2nd 


3rd 


4th 




us_ 


No 


Yr1 


GMAT 


GPA 


Bus 


Tech 


5th 


5th 


5th 


5th 


Age 


Cit 


4001 


3.201 


540 


2.974 


1 


0 


0 


0 


1 


0 


25 




4002 


2.964 


540 


2.529 


1 


0 


0 


1 


0 


0 


23 




4003 


3.745 


510 


3.727 


0 


0 


0 


1 


0 


0 


25 




4004 


3.290 


570 


2.905 


1 


0 


0 


0 


1 


0 


23 




4005 


3.028 


640 


2.641 


1 


0 


1 


0 


0 


0 


23 
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(Note: A student with a degree from a school 
in the bottom fifth of undergraduate schools 
is the "baseline" and will have a 0 value for 
all four of the variables shown above.) 

Age: Age of the student, in years. 

US_Cit: 1, if student is a U.S. citizen, 0 if otherwise. 

Case Questions 

1. The committee wishes to use first-year MBA grade 
point average as its measure of the quality of a student. 
As a first step in assisting committee members, con- 
struct a correlation matrix and briefly discuss the infor- 
mation it contains that could be of help to the commit- 
tee members. If they were to choose only one variable 
in predicting the first-year MBA GPA, which variable 
would you recommend? Do you think it would be a 
good idea to evaluate applicants on the basis of only 
this one variable? 

2. Because the committee is very interested in using pre- 
dicted first-year MBA grade point average in measur- 



ing the quality of applicants to its program, assist 
the members further by carrying out and interpreting 
the results of an appropriate stepwise regression of the 
data. (Note to Minitab users: For consistency with 
Excel results, you may wish to select the alpha-to- 
enter and alpha-to-remove values listed in the Minitab 
portion of Computer Solutions 17.6.) Discuss the re- 
sults, including making and supporting any recom- 
mendations that you think the committee would find 
useful. 

3. Based on your analysis in Case Question 2, and even 
considering the results from a complete regression that 
uses all of the possible predictive variables, does there 
seem to be very much variation in the first-year MBA 
grade point average that is not explained by the inde- 
pendent variables provided in the data? If so, provide 
the committee with some examples of other quantita- 
tive or nonquantitative applicant variables that you 
think could help them in evaluating the potential for 
academic success of persons applying for admission to 
the Westmore MBA program. 




LB 



Easton Realty Company (B) 

When we visited Easton Realty Company (A), in Chapter 
16, Sam Easton was concerned about two former clients 
who complained that Easton Realty had underpriced their 
houses in order to get a quick sale and collect their realty fee. 
In defending his company against these charges, Mr. Easton 
obtained a set of residential sales data on recent home sales 
in the Dallas-Fort Worth metroplex area. The data included 
the sale month, the sale price, the size of the home in square 
feet, the number of bedrooms, the age of the house, whether 
the house was located in Dallas versus somewhere else in the 
Dallas-Fort Worth metroplex, and whether Easton was the 
real estate company that handled the sale. In this chapter, 
you will be asked to help Mr. Easton gain a better under- 
standing of what determines the selling price of a house, and 
you will be using the same variables that were introduced 
earlier, in Easton Realty (A). The data are in file EASTON on 
your text CD. 

Case Questions 

1. As a first step in helping Sam Easton better understand 
how the sales price of a residential property is deter- 



2. 




mined, construct a correlation matrix for the variables 
PRICE, SQFEET, BEDROOMS, AGE, DALLAS, and 
EASTON. Examine the matrix and advise Mr. Easton 
as to which variable seems to be the most important in 
explaining the variation in residential housing prices. 
You can expect Mr. Easton to be a little skeptical, since 
he realizes that the sales price of a house depends on 
many other factors besides the one you have identified. 
Provide further assistance to Sam Easton by carrying 
out and interpreting the results of an appropriate 
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stepwise regression designed to indicate which of the 
predictive variables mentioned in Case Question 1 are 
most important in determining the sales price of a 
house. (Note to Minitab users: For consistency with 
Excel results, you may wish to select the alpha-to- 
enter and alpha-to-remove values listed in the 
Minitab portion of Computer Solutions 17.6.) Briefly 
discuss the results, including making and supporting 
any recommendations you believe could benefit Mr. 
Easton. 

3. Based on your analysis in Case Question 2, and even 
considering the results from a complete regression that 



uses all of the possible predictive variables identified in 
Case Question 1, does there seem to be very much 
variation in sales price that is not explained by the in- 
dependent variables provided in the data? If so, what 
other quantitative or nonquantitative variables associ- 
ated with a house might cause its sales price to be es- 
pecially high or low compared to the estimated value 
based on models such as the ones you applied here and 
in Case Question 2? 



Time Series-Based Forecasting and the Zundapp 

During the late 1950s, some very unusual, miniature-size economy cars were 
imported to the United States. One of these was the German Zundapp, a 9.5-foot, 
14-horsepower four-seater with two doors: one at the front of the car, the other at 
the rear. The model designation was Janus, for the two-faced Roman god who 
guarded the gates with the help of peripheral vision that athletes and city pedestrians 
could only dream of. The Zundapp's driver and front passenger faced forward, while 
the two rear passengers faced (where else?) ... to the rear. 

Being a rear-seat passenger in the Zundapp was a lot like engaging in the time 
series-based forecasting approaches discussed in this chapter. The only thing you 
could see was where you'd been, and you often had little idea where you were 
going. 

Imagine sitting in the rear of the Zundapp and 
having to give driving instructions to a blindfolded ,-,„riaDP JanUS 

driver. Giving such directions would be no The 195 s 

problem as long as no bridges were out, no busy 
intersections loomed ahead, and no freeway ramps 
were coming up. In other words, you could do a 
good job if the future were not much different from 
the past. 

While forecasting isn't bothered by fallen bridges, 
intersections, and freeway ramps, unpredictable 
events such as technological advances and changes in 
either the economy or consumer demands can play 
havoc with the accuracy of a forecast. The moral of the 
story: when engaged in time series-based forecasting, 
remember the Zundapp. 
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Describe the trend, cyclical, seasonal, and irregular components of the classical 
time series model. 

Fit a linear or a polynomial trend equation to a time series. 

Smooth a time series with the centered moving average and exponential 
smoothing techniques. 

Determine seasonal indexes and use them to compensate for the seasonal ef- 
fects in a time series. 

Use the trend extrapolation and the exponential smoothing forecast methods 
to estimate a future value. 

Use the mean absolute deviation [MAD) and mean squared error (MSE) criteria 
to compare how well fitted equations or curves fit a time series. 

Use the Durbin-Watson test to determine whether regression residuals are 
autocorrelated. 

Use index numbers to compare business or economic measurements from one 
period to the next. 



18.1 INTRODUCTION 



When data are a sequence of observations over regular intervals of time, they are 
referred to as a time series. A time series may consist of weekly production out- 
put, monthly sales, annual rainfall, or any other variable for which the value is 
observed or reported at regular intervals. 

Time series are examined for two reasons: (1) to understand the past and 
(2) to predict the future. By analyzing a time series, we can identify patterns and 
tendencies that help explain variation in past sales, shipments, rainfall, or any 
other variable of interest. Likewise, this understanding contributes to our ability 
to forecast future values of the variable. 



1 8.2 TIME SERIES 



A time series can be visually portrayed as a graph in which the variable of inter- 
est (e.g., sales, costs, shipments) is the vertical (y) axis and time is the horizontal 
(x) axis. An example is shown in Figure 18.1, where U.S. hybrid vehicle sales are 
plotted against time for the years 2000-2005. 

Components of a Time Series 

Analysis of a time series involves identifying the components that have led to the 
fluctuations in the data. In what is known as the "classical" approach to time 
series, these components are as follows: 

Trend (T) This is an overall upward or downward tendency. As Figure 18.1 
shows, U.S. sales of hybrid vehicles tended to increase during the 2000-2005 pe- 
riod. To the extent that the trend component is present, a regression line fitted to 
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FIGURE 18.1 
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Excel plot with 
superimposed quadratic 
trendline for U.S. hybrid 
vehicle sales from 2000 
through 2005. It was 
generated using the 
procedures in Computer 
Solutions 1 8.1 , shown 
later in this section. The 
years are coded so 
that 2000 = 1 . Dafa 
source: HybridCars.com, 
August 25, 2006. 



the points on the time series plot will have either a positive or negative slope. The 
line fitted in Figure 18.1 has a positive slope. 

CYCLICAL (C) These are fluctuations that repeat over time, with the period being 
more than a year from one peak to the next. The sunspot and business cycles are 
examples of this type of fluctuation. 

SEASONAL (S) These are also periodic fluctuations, but they repeat over a period 
of one year or less. For example, retail sales of bathing suits tend to be higher during 
the spring months and lower during the winter. Likewise, department stores typically 
do a great deal of their business during the Christmas period, vacation travel peaks 
during the summer months when children are out of school, and grocery stores may 
experience greater sales every other week, when employees at a local plant are paid. 
IRREGULAR (I) This component represents random, or "noise" fluctuations that 
are the result of chance events, such as work stoppages, oil embargoes, equipment 
malfunction, or other happenings that either favorably or unfavorably influence the 
value of the variable of interest. Random variation can make it difficult to identify 
the effect of the other components. 



In the classical time series model, the preceding components may be combined 
in various ways. In our discussion, we will use the multiplicative approach, de- 
scribed below and illustrated in Figure 18.2 (page 688). 



Classical time series model: 




y = T • C • S • I where y = 


observed value of the time series variable 


T = 


trend component 


C = 


cyclical component 


s = 


seasonal component 


I = 


irregular component 



The model assumes that any observed value of y is the result of influences ex- 
erted by the four components, and that these influences are multiplicative. The 
observed value of y is assumed to be the trend value, T, adjusted upward 
or downward by the combined influence of the cyclical, seasonal, and irregular 
components. 
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FIGURE 18.2 



The components of the 
classical multiplicative time 
series model 
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Example: If trend = 340 units, cyclical = 120%, seasonal = 85%, and irregular = 1 10%, 
the observed value will be 340 • 1.20 • 0.85 • 1.10 = 381.5 units. 



Several alternative models exist, including an additive model, with 
y = T+ C + S + I. It can also be assumed that the value of y depends on the 
value of y for the preceding period. In other words, y t (the observed value for time 
period t) may be expressed as a function of T, C, S, I, and y t -\ (the value for 
the preceding time period). Also, the multiplicative and additive models may be 
combined, as in y = (T + C) • S • I. 

In general, the most important component of most time series is the trend, 
which can be examined (1) by using regression techniques to fit a trend line to the 
data, or (2) by using the smoothing techniques described in the next section to 
moderate the peaks and valleys within the series. 

In fitting a trend line with regression techniques, the line may be linear (e.g., 
y = b 0 + b^x) or nonlinear (e.g.,y = b 0 + b^x + b 2 x 2 ), where y is the trend line 
estimate for y and x = time. The nonlinear example is a quadratic equation of the 
type we discussed in Chapter 17. 

Fitting a Linear Trend Equation 



In this method, a least-squares linear equation is fitted to the set of observed data 
points. The approach is the same as in Chapter 15 and can be summarized as follows: 



The linear trend equation: 




y = b 0 + b^x 


where y = the trend line estimate of y 


x = time period 



In determining this and other trend equations, we will code the time periods so 
that the first one has a value of 1, the second has a value of 2, and so on — this is 
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the same approach taken by Excel and Minitab. Also, if you are not using the com- 
puter, this offers the advantage of having easier calculator computations because 
the x values have fewer digits than the four-digit years of the original data. 

Using this coding technique and the procedures in Computer Solutions 18.1, we 
could have obtained the least-squares regression line (y = —52.114 + 34.219x). 

The slope of the equation is positive because vehicle sales tended to increase from 
2000 to 2005. The slope, b 1 = +34.219, indicates that sales tended to increase 



computer solutions 1 8.1 


9* 











Fitting a Linear or Quadratic Trend Equation 
EXCEL 

Fitting a Linear or Quadratic Trend Equation with Excel (see Figure 18.1) 



1. The data in Figure 18.1 are also in file CXHYBRID.XLS, with year codes in A2:A7, sales data (thousands of vehicles) in 
B2:B7, and labels in A1 and B1 . Click Tools. Click Data Analysis. Select Regression. Click OK. 

2. Enter B1:B7 into the Input Y Range box. Enter A1:A7 into the Input X Range box. Select Labels. Enter J1 into the 
Output Range box. Select Line Fit Plots. Click OK. 

3. To show the trend line: Right-click on one of the data points. Click Add Trendline. Within the Type menu, select 
either Linear or Polynomial (of order 2). We selected the latter. Within the Options menu, select Display equation. 
Click OK. Following minor formatting adjustments, the results appear as the quadratic trend equation shown in 
Figure 18.1. 

MINITAB 

Fitting a Linear or Quadratic Trend Equation with Minitab 



Trend Analysis Plot for HybridsSold 

Quadratic Trend Model 
Yt = 54.0681 - 45.4175*t + 11.3767*t**2 




t 1 1 1 1 r 

1 2 3 4 5 6 

Index 



Using file CXHYBRID.MTW, with the year codes in C1 and the vehicle sales data in C2: Click Stat. Select Time Series. Click 
Trend Analysis. Enter C2 into the Variable box. Within the Model Type menu, select either Linear or Quadratic. We 

selected the latter. Click OK. The Minitab counterpart to Figure 18.1 appears as shown here. 
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by 34.219 thousand vehicles per year during the period. For any given year, we 
can substitute the appropriate value of x into the trend line equation to find the 
estimated value of y. For example, 2009 is coded as x = 10, and the trend esti- 
mate for that year is y = -52.114 + 34.219(10), or 290.08 thousand vehicles. 



Fitting a Quadratic Trend Equation 

Another possibility for fitting an equation to a time series is to use a quadratic 
model, as shown in Figure 18.1. This involves an equation estimating y as a func- 
tion of x, but the method treats x and x 2 as two independent variables instead of 
just one. The quadratic trend equation relies on the multiple regression techniques 
of Chapter 16 and is described as follows: 

The quadratic trend equation: 

y = b Q + b x x + b 2 x 2 where y = the trend line estimate of y 

x = time period 

(x and x 2 are treated as separate variables.) 

The quadratic trend equation includes a constant component (b 0 ), a linear 
component {b^x), and a nonlinear component (b 2 x 2 ). As such, it is especially ap- 
propriate when an upward trend in the time series is followed by a downward 
trend, or vice versa. This type of equation can also be applied whenever the time 
series either increases or decreases at an increasing rate. 

Computer Solutions 18.1 shows the procedures and results when Excel and 
Minitab are used in fitting a quadratic trend equation to the hybrid vehicle sales 
data for 2000 (coded as 1) through 2005 (coded as 6). Using the number of 
decimal places shown in Figure 18.1, the quadratic trend equation is 

y = 54.068 - 45.418% + 11.377% 2 

As with the linear trend equation, the estimate for a given year is made using 
the x value corresponding to the code for that year. For 2009, coded as x = 10, 
the estimated number of hybrid vehicles sold in the United States during that year 
would bey = 54.068 - 45.418(10) + 11.377(10) 2 = 737.59 thousand vehicles. 

Note that the quadratic trend line seems to fit the data much more closely 
than a linear trend line would have. In Section 18.6, we will examine two meth- 
ods for determining which type of trend equation best fits a set of time series data. 



01 

exercises 



18.1 What are the four components of a time series, and 
to what kind of variation in the series does each refer? 

18.2 For each of the following, indicate which time series 
component is represented. Explain your reasoning. 

a. The proprietor of a movie theater finds Saturday 
is the most popular day of the week for movie 
attendance. 

b. Political advertising in the United States peaks every 
4 years. 




; 5^ ' I* 



c. Sales of a nonaspirin pain reliever decline after the 
product receives negative publicity in a medical jour- 
nal article. 

18.3 The estimated trend value of community water con- 
sumption is 400,000 gallons for a given month. Assuming 
that the cyclical component is 120% of normal, with the 
seasonal and irregular components 70% and 110% of 
normal, respectively, use the multiplicative time series 
model in determining the quantity of water consumed 
during the month. 
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18.4 The trend equationy = 1200 + 3Sx has been fitted 
to a time series for industry worker days lost due to job- 
related injuries. If x = 1 for 1991, estimate the number 
of worker days lost during 2008. 

18.5 Fit a linear trend equation to the following data 
describing average hourly earnings of U.S. production 
workers. What is the trend estimate for 2010? source: 
World Almanac and Book of Facts 2006, p. 135. 

Year: 2001 2002 2003 2004 

Average hourly 

earnings: $14.53 $14.95 $15.35 $15.67 

18.6 Fit a linear trend equation to these data showing the 
value of shipments of food products in billions of dollars. 
What is the trend estimate for 2012? source: Bureau of the 
Census, Statistical Abstract of the United States 2006, p. 655. 

Year: 2001 2002 2003 2004 

Shipments: $451.4 $450.2 $461.1 $487.4 

18.7 U.S. cellular phone subscribership has been reported 
as shown below for the 1993-2004 period, source: World 
Almanac and Book of Facts 2006, p. 380. 





Subscribers 




Subscribers 


Year 


(Millions) 


Year 


(Millions) 


1993 


16.0 


1999 


86.0 


1994 


24.1 


2000 


109.5 


1995 


33.8 


2001 


128.4 


1996 


44.0 


2002 


140.8 


1997 


55.3 


2003 


158.7 


1998 


69.2 


2004 


182.1 



a. Fit a linear trend equation to the time series. Using 
this equation, determine the trend estimate for 
2010. 

b. Fit a quadratic equation to the time series, then use 
the equation to determine the trend estimate for 
2010. 

c. Construct a graph of the time series along with the 
equations fitted in parts (a) and (b). Which equation 
appears to better fit the data? 

18.8 From 1997 through 2004, the value of U.S. exports 
to Mexico was as shown below. Data are in billions of 

dollars, source: World Almanac and Book of Facts 2006, p. 109. 



Year: 


1997 


1998 


1999 


2000 


Exports: 


$71.4 


$78.8 


$86.9 


$111.3 


Year: 


2001 


2002 


2003 


2004 


Exports: 


$101.3 


$97.5 


$97.4 


$110.8 



a. Fit a linear trend equation to the time series. Using 
this equation, determine the trend estimate for 
2012. 

b. Fit a quadratic equation to the time series, then use 
the equation to determine the trend estimate for 
2012. 

c. Construct a graph of the time series along with the 
equations fitted in parts (a) and (b). Which equation 
appears to better fit the data? 
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While the preceding techniques fit an actual equation to the time series, the meth- 
ods covered here smooth out short-term fluctuations in the data. These techniques 
function much like the shock absorbers on an automobile, dampening the sudden 
upward and downward "jolts" that occur over the series. 

The Moving Average 

The moving average replaces the original time series with another series, each 
point of which is the center of and the average of N points from the original se- 
ries. For this reason, this technique is also known as the centered moving average. 
The purpose of the moving average is to take away the short-term seasonal and 
irregular variation, leaving behind a combined trend and cyclical movement. 
The technique will be explained through analysis of a time series for a state's 
electricity purchases. 

We will examine the state's electricity purchases during 1990-2004, and there 
will be a lot of fluctuations from one year to the next. To smooth these fluctuations, 
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we'll use a three-year moving average. Using several of the earlier data points as 
an example: 

Electricity Purchases Three-Year Centered Three-Year 
Year (thousand kwh) Moving Total Moving Average 



1990 815 

1991 826 

1992 745" 

1993 776 

1994 838_ 



■2386 795.3 (or 2386/3) 

-2347 782.3 
■ 2359 786.3 



The first point in the centered moving average is (815 + 826 + 745)/3, or 
795.3. It corresponds to 1991 and is the average of the three years for which 1991 
is at the center. The basis for the average now "moves," dropping the 1990 value 
and picking up the 1993 value. This second point includes purchases for 1991 
through 1993, is centered at 1992, and has a value of 2347/3, or 782.3. The 
moving average can be viewed as the average weight of the occupants of a lifeboat 
that can hold only three people at a time. 

Computer Solutions 18.2 shows the raw data, the procedures, and the results 
when Excel and Minitab are used in generating a three-year centered moving av- 
erage of the electricity purchase data. Note that the Excel procedure is applicable 
only when the centered moving average is based on an odd number of periods; 



computer solutions 1 8.2 




Centered Moving Average for Smoothing a Time Series 

The Excel procedure applies when the centered moving average is based on an odd number of time periods. 
Computer Solutions 18.3 is applicable for Excel when the centered moving average is based on an even num- 
ber of time periods. The Minitab procedure is applicable regardless of the number of time periods on which 
the centered moving average is based. 
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Excel Three-Period Centered Moving Average 

1. The data in Table 18.1 are also in file CX18EPWR.XLS, with years in A2:A16, electricity purchase data in B2:B16, and 
labels in A1 and B1 . Click Tools. Click Data Analysis. Select Moving Average. Click OK. 

2. Enter B1:B16 into the Input Range box. Enter 3 into the Interval box. Select Labels in First Row. Enter C1 into the Out- 
put Range box. (Do not select "Chart Output," because we will use the Chart Wizard to generate the plot.) Click OK. 

3. In column C, select the first two cells with "#NA" in them. Right-click and clear the contents of these cells. The first numer- 
ical value in column C should be C3, the centered moving average corresponding to 1991 . Enter the label 3-yrMA into C1. 

4. We can then use the Chart Wizard, as in Chapter 2, to plot the original data, along with the three-year centered mov- 
ing average. Click on A1 and drag to C16 to select cells A1:C16. Click on the ChartWizard toolbar button (that's the 
one with the vertical bar graphs). Select XY (Scatter). Select Scatter with data points connected by lines. Click 
Next. The rest of the procedure is comparable to that in Computer Solutions 2.5 and, with minor formatting adjust- 
ments, leads to the plot shown above. 

MINITAB 

Minitab Three-Period Centered Moving Average 



Moving Average Plot for Electric 




i i i 1 i i 1 1 i 1 1 1 i 1 1 — 

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 
Time 



1. Using file CX18EPWR.MTW, with years in C1 and electricity purchase data in C2: Click Stat. Select Time Series. Click 
Moving Average. 

2. Enter C2 into the Variable box. Enter 3 into the MA length box. Select Center the moving averages. 

3. Click Graphs. Select Plot smoothed vs. actual. Click OK. Click OK. The plot appears as shown here. 



Computer Solutions 18.3 describes how to use Excel when the basis is an even 
number of periods. The same Minitab procedure is used regardless of the number 
of periods on which the moving average is based. 

Constructing a Centered Moving Average 
with an Even Number of Periods 

In the preceding example, the centered moving average had an odd number of 
periods as the base. With an even number of periods as the base, the method 
changes slightly so that each moving average point will correspond to an actual 
time period from the original series. The revised procedure is demonstrated in 
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Excel Centered Moving Average Based on Even Number of Periods 

This Excel procedure is for centered moving averages based on an even number of periods. 

EXCEL 




19 



D 



4-yrMA 




1990 1992 1994 1996 1998 2000 2002 2004 



Excel Four-Period Centered Moving Average 



1. 



6. 



7. 



This procedure corresponds to the hand calculations in Table 18.1 . Use file CX18EPWR.XLS, with years in A2:A16, electric- 
ity purchase data in B2:B16, and labels in A1 and B1 . Click Tools. Click Data Analysis. Select Moving Average. Click OK. 
Enter B1:B16 into the Input Range box. Enter 4 into the Interval box. Select Labels in First Row. Enter C1 into the 
Output Range box. (Do not select "Chart Output.") Click OK. 

In column C, select the cells with "#NA" in them. Right-click and clear the contents of these cells. Cells C4:C15 will 
contain smoothed values, but they are not centered. 

Next we do a two-period moving average to center the smoothed values: Click Tools. Click Data Analysis. Select 
Moving Average. Click OK. 

Enter C4:C15 into the Input Range box. Enter 2 into the Interval box. (Do not select "Labels in First Row" or Chart 
Output.") Enter D1 into the Output Range box. Click OK. 

Enter the label 4-yrMA into D1 . Select cells D2:D1 2 and move them downward so the first value corresponds to 1 992. 
The results, shown here, correspond to the calculations demonstrated in Table 18.1 . 

For the plot shown here, temporarily interchange columns C and D, so the variables Year, Electric, and 4-yrMA will be 
in adjacent columns to satisfy ChartWizard requirements. 



Table 18.1. In proceeding from column (1), each of the four-year moving totals in 
column (2) falls between two of the actual time periods. In turn, each average in 
column (3) falls between two figures from column (2). The result of these consec- 
utive "in-between" placements is a moving average in column (4) that corre- 
sponds exactly to a time period of the original series. 

Computer Solutions 18.3 shows the yearly data, the procedure and the results 
when we use a four-period centered moving average to smooth the electricity purchases 
data. As noted previously, Excel requires a different method when we use an even num- 
ber of periods in the moving average, while the Minitab procedure does not change. 

Selecting the Number of Periods for 
a Centered Moving Average 

As the cumbersome operations in Table 18.1 suggest, if you're not using a 
computer, it's desirable to select an odd number of periods unless there is some 



Chapter 18: Models for Time Series and Forecasting 



695 



Year 



1990 
1991 
1992 
1993 
1994 
1995 
1996 



(D (2) (3) (4) 

Average of 

Electricity Four- Year the Four- Year Four- Year Centered 

Purchases Moving Total Moving Totals Moving Average 



815" 

826 

745 

776. 

838 

837 

831 



3162 
3185 
3196 
3282 
3364 
3422 



-►3173. 5^ -f- 4- 
3190.5 
3239.0 
3323.0 
3393.0 



-793.38 
797.63 
809.75 
830.75 
848.25 



TABLE 18.1 



When a centered moving 
average is based on an 
even number of periods, 
the procedure is as shown 
for the example years 
shown here. Otherwise, 
the moving averages will 
not correspond exactly to 
actual time periods from 
the original series. 



compelling reason to do otherwise. Such a reason could arise in the case of quar- 
terly (N = 4) or monthly (N = 12) data where seasonal effects are strong. For ex- 
ample, with a 12-month base, each point in the moving average will include one 
January, one February, one March, and so on. In the case of retailers who experi- 
ence the annual Christmas shopping frenzy, this seasonal "blip" will be dampened 
so that the overall trend can be better observed. In general, a moving average with 
a base equal to the duration of a strong cyclical or seasonal pattern will be highly 
useful in dampening fluctuations caused by the pattern. 

Selecting the number of periods (N) in the base of a centered moving average 
is a subjective matter. A larger base provides greater dampening, but is less re- 
sponsive to directional changes. In addition, an extremely large base will make 
less efficient use of the data because a greater number of the original values at the 
beginning and end of the series will not be represented by moving average points. 
In the Excel portion of Computer Solutions 18.2, there are 15 original data val- 
ues, but the three-year moving average has just 13 points. You may wish to verify 
that a nine-year centered moving average would include just 7 points. 

Exponential Smoothing 

The fluctuations in a time series can also be dampened by a popular and relatively 
simple technique known as exponential smoothing. In this approach, each 
"smoothed" value is a weighted average of current and past values observed in 
the series. It can be summarized as follows: 



The exponential smoothing of a time series: 




E f = a ■ y + (1 — a) ■ E f _ 1 where E t = 


the exponentially smoothed value 


for time period t 




the exponentially smoothed value 




for the previous time period 


y t = 


the actual y value in the time series 




for time period t 


a = 


the smoothing constant, and 




0 < a < 1 
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Each exponentially smoothed (E) value is a weighted average of (1) the actual 
y value for that time period and (2) the exponentially smoothed value calculated 
for the previous time period. The larger the smoothing constant (a), the more im- 
portance is given to the actual y value for the time period. 

To demonstrate how time series fluctuations can be dampened by exponen- 
tial smoothing, we'll apply the technique to the electricity purchases that were 
the topic of the preceding section. The procedure is as follows: 

1. Select the value of the smoothing constant, a. For this example, we'll arbitrar- 
ily select a = 0.3. 

2. Identify an initial value for the smoothed curve. Since each smoothed value 
will be (0.3 • the actual value) + ([1 — 0.3] • [the previous smoothed value]), 
we're stuck at the beginning unless we have a smoothed value with which to 
start the process. It's common practice to "prime the pump" by assuming that 

= Jv so this will be our approach. As shown below, the smoothed value 
for time period 1 (1990) is 815 thousand kilowatt-hours. This initial value 
will become less important as its effect is "diluted" in subsequent calculations. 

3. The calculations through 1993 are shown below. 



For a = 0.3, 
Electricity Exponentially 
Year Purchases Smoothed Purchases 



1990 


815 


815 


•< 


1991 


826 


818.3 


= 0.3(826) + 0.7(815.0) 


1992 


745 


796.3 


= 0.3(745) + 0.7(818.3) 


1993 


776 r 


790.2 


= 0.3(776) + 0.7(796.3) 



The fitted value for 1993 is 

0.3(the actual purchases for 1993) + 0.7(the smoothed purchases for 1992). 



The results of the remaining calculations are shown in the Excel portion of 
Computer Solutions 18.4, along with a set of exponentially smoothed values 
based on a larger smoothing constant, a = 0.6. Notice that the curve with the 
smaller smoothing constant (a = 0.3) more strongly dampens the fluctuations in 
the original series. However, it does so at the cost of lagging further behind when 
the series increases, decreases, or changes direction. In the Excel plots of Com- 
puter Solutions 18.4, both exponentially smoothed curves lag behind actual sales 
during the growth years from 1996 to 2004, much more so than either of the 
moving averages shown in Computer Solutions 18.2. 

In the Excel portion of Computer Solutions 18.4, we must specify a "damp- 
ing factor," calculated as (1 — a), then AutoFill to get the final smoothed value in 
the series — in this case, for 2004. Although Excel offers a plot along with the 
smoothed series, it assumes you are forecasting instead of just smoothing, and its 
plot automatically shifts the smoothed series one time period forward, leaving out 
the final smoothed value. The application of exponential smoothing to forecast- 
ing is covered in Section 18.5. 

Minitab users can proceed relatively swiftly through the procedure described 
in Computer Solutions 18.4. However, to match the calculations in this section, 
be sure to specify that the first period be used in determining the initial smoothed 
value. 



For 1990, Fis set as 
E = y= 81 5 so the 
smoothed curve will 
have a starting point 
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Exponentially Smoothing a Time Series 
EXCEL 




D 



Alpha0.3 



Alpha0.6 



1991 



1992 



815.0 



796.3 



815.0 



821.6 



775.6 



959.9 




700 



1990 1992 1994 1996 1998 2000 2002 2004 



Excel Exponential Smoothing with Smoothing Constant, a = 0.3 

1. Using file CX18EPWR.XLS, with years in A2:A16, electricity purchase data in B2:B16, and labels in A1 and B1: Click 
Tools. Click Data Analysis. Select Exponential Smoothing. Click OK. 

2. Enter B1:B16 into the Input Range box. Enter 0.7 into the Damping factor box. (Note: Excel's "damping factor" cor- 
responds to 1 - a.) Select Labels. Enter C1 into the Output Range box. (Do not select "Chart Output," because we 
will use the Chart Wizard to generate the plot.) Click OK. 

3. C1 will have "#NA" in it. Right-click and clear the contents of this cell. The first numerical value in column C should 
be C2. Enter the Alpha0.3 label into C1. Select C15 and, clicking at the lower right corner to produce the AutoFill " + " 
sign, AutoFill down to C16. 

4. (Optional) We can also generate exponentially smoothed values corresponding to a = 0.6: In step 2, enter 0.4 (i.e., 
1 - a) into the Damping factor box, and enter D1 into the Output Range box. In step 3, select and clear the "#NA" 
from D1, then enter the Alpha0.6 label into D1. Select D15 and, clicking at the lower right corner to produce the 
AutoFill " + " sign, AutoFill down to D16. 

5. Use ChartWizard, as in Computer Solutions 18.2, to plot the original series along with the two exponentially 
smoothed series generated here. 



MINITAB 

Minitab Exponential Smoothing with Smoothing Constant, a = 0.3 

1. Using file CX18EPWR.MTW, with years in C1 and electricity purchase data in C2: Click Stat. Select Time Series. Click 
Single Exp Smoothing. 

2. Enter C2 into the Variable box. Select Use: and enter the smoothing constant 0.3 into the box. Click Graphs. Se- 
lect Plot smoothed versus actual. Click OK. Select Options and enter 1 into the Set initial smoothed value box. 
Click OK. Click OK. The plot appears as shown here. 



[continued) 
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Selecting the value for the smoothing constant (a) is subjective and depends on 
the extent to which we wish to reduce the fluctuations in the data. At the extremes, 
a value of a = 0 yields a horizontal line with no useful information, while a = 1 re- 
sults in an exponentially smoothed curve identical to the original time series. Natu- 
rally, neither extreme would serve the purpose of satisfactorily smoothing the series. 




18.9 The following data show residential and commercial 
natural gas consumption (quadrillion Btu) from 1985 
through 2000. source: Bureau of the Census, Statistical Abstract of 
the United States 2002, p. 565. 



Year 


Consumption 


Year 


Consumption 


1985 


27.6 


1993 


30.6 


1986 


26.9 


1994 


30.8 


1987 


27.6 


1995 


31.6 


1988 


28.9 


1996 


33.0 


1989 


29.4 


1997 


33.0 


1990 


29.2 


1998 


32.8 


1991 


30.1 


1999 


35.8 


1992 


29.6 


2000 


37.4 



Construct a graph of the time series, then 
superimpose a three-year centered moving average 
over the original series. 

Superimpose a five-year centered moving average over 
the original series. Why is the moving average 
"smoother" when N = 5? 



18.10 For the data in Exercise 18.9, determine the 
centered moving average for N = 2; for N = 4. 

18.11 According to the U.S. Energy Information Admin- 
istration, a heating degree-day is a relative measurement 
of outdoor temperature that indicates how far the 
average daily temperature is below 65° Fahrenheit. The 
higher the number of heating degree-days, the greater the 
national demand for heating energy from sources like 
natural gas and electricity. The following monthly data 
show the number of heating degree-days from 1992 
through 1995. source: U.S. Energy Information Administration, 
Annual Energy Review 1995, p. 19. 





J 


F 


M 


A 


M 


J 


1992 


852 


644 


603 


345 


152 


46 


1993 


860 


827 


864 


368 


128 


38 


1994 


1031 


813 


594 


293 


174 


21 


1995 


826 


767 


543 


366 


168 


46 



(continued) 
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■ 

j 


A 
M 


c 

3 




M 


n 
u 


1992 


14 


24 


74 


301 


564 


822 


1993 


1 1 


9 


89 


302 


580 


824 


1994 


6 


16 


65 


268 


479 


723 


1995 


13 


1 1 


97 


239 


598 


862 



a. Construct a graph of the time series. Does the overall 
trend appear to be upward or downward? 

b. Determine the three-month and five-month centered 
moving average curves and superimpose each on the 
original time series. 

1 8. 1 2 For the data of Exercise 18.11: 

a. Obtain the centered moving average for N = 4; for 
N= 12. 

b. When N = 12 months, the centered moving average 
is relatively smooth compared to when N = 4. Aside 
from the fact that it includes a greater number of 
months at a time, is there any other reason why 

the moving average with N = 12 should be so 
smooth? 

18.13 The following data describe the monthly volume 
for an urban courier service. Data are in thousands of 
pieces handled. 





J 


F 


M 


A 


M 


J 


2003 


4.5 


4.7 


4.7 


5.0 


5.0 


4.4 


2004 


5.4 


4.9 


5.3 


5.6 


5.4 


5.4 


2005 


5.8 


5.8 


6.6 


6.5 


6.2 


6.7 




J 


A 


S 


O 


N 


D 


2003 


4.7 


4.9 


4.6 


5.4 


4.8 


5.2 


2004 


5.5 


5.4 


5.7 


6.3 


5.3 


6.3 


2005 


6.7 


6.7 


6.9 


7.4 


6.6 


7.4 



a. Construct a graph of the time series. Does the overall 
trend appear to be upward or downward? 

b. Determine the three-month and five-month centered 
moving average curves and superimpose each on the 
original time series. 

18.14 For the data of Exercise 18.13: 

a. Obtain the centered moving average for N = 4; for 
N= 12. 

b. When N = 12 months, the centered moving average is 
relatively smooth compared to N = 4. Aside from the 
fact that it includes a greater number of months at a 
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time, is there any other reason why the moving 
average with N = 12 should be so smooth? 

18.15 The following data describe U.S. manufacturers' 
shipments of general aviation jet aircraft from 1979 
through 1998: source: General Aviation Manufacturers Association, 
1 999 Statistical Databook, p. 6. 



Year 


Shipments 


Year 


Shipments 


1979 


282 


1989 


157 


1980 


326 


1990 


168 


1981 


389 


1991 


186 


1982 


259 


1992 


171 


1983 


142 


1993 


198 


1984 


169 


1994 


222 


1985 


145 


1995 


246 


1986 


122 


1996 


241 


1987 


122 


1997 


348 


1988 


157 


1998 


415 



a. Construct a graph of the time series. Does the overall 
trend appear to be upward or downward? 

b. Determine the three-year and five-year centered mov- 
ing average curves and superimpose each on the 
original time series. 

18.16 For the data of Exercise 18.11, fit an exponentially 
smoothed curve with smoothing constant a = 0.4; with 
smoothing constant a = 0.7. Which constant provides more 
"smoothing" and why? Which curve is quicker to "catch 
up" whenever the original series changes direction and why? 

18.17 For the data of Exercise 18.13, fit an exponentially 
smoothed curve with smoothing constant a = 0.4; with 
smoothing constant a = 0.7. Which constant provides 
more "smoothing" and why? Which curve is quicker to 
"catch up" whenever the original series changes direction 
and why? 

18.18 For the data of Exercise 18.15, determine the 
centered moving average for N = 2; for N = 4. 

18.19 For the data of Exercise 18.15: 

a. Fit an exponentially smoothed curve with smoothing 
constant a = 0.6. 

b. For these data, describe the appearance of the expo- 
nentially smoothed curve when the smoothing 
constant is a = 0.0; when a = 1.0. Why is it not use- 
ful to assign a either of these extreme values? 



SEASONAL INDEXES 1 8.4 



Depending on the nature of the time series variable, strong seasonal patterns may 
be present for certain months or quarters. The seasonal index is a factor that 
inflates or deflates a trend value to compensate for the effect of a pattern that 
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repeats over a period of one year or less. Besides identifying the relative influence 
of various months or quarters, seasonal indexes are useful in the forecasting tech- 
niques of the next section. 

To demonstrate the calculation of seasonal indexes, we will use a time series 
that has some very strong seasonal patterns. Shown in Figure 18.3, this is the 
quarterly sales of natural gas to customers in a northern city from 2000 through 
2004. As the time series suggests, home heating and other natural gas demands 
are especially heavy during the first quarter of each year and tend to be relatively 
light during the third quarter. 

The Ratio to Moving Average Method 

The effect of seasonal fluctuations is quantified by using a technique called the 
ratio to moving average method. This method is explained in the context of the 
underlying data for Figure 18.3, with the following steps: 

1. Construct a centered moving average of the time series. This is shown in 
columns (l)-(4) of Table 18.2; note that the procedure is similar to the moving 
average calculated earlier, in Table 18.1. Since data are quarterly, the moving 
average is based on four time periods. Had the data consisted of months, the 
moving average would have been based on 12 periods. 

2. Divide the original time series values by the corresponding centered moving 
average. This yields a series of percentages of the moving average. The result 
of this operation is column (5) of Table 18.2. For example, the actual sales in 
the first quarter of 2001 were 3781 thousand Btu, which is 155.365% of the 
centered moving average (2433.625) for that quarter. 

3. For each quarter (or month), identify the mean percentage from those ob- 
tained in step 2. This is done to reduce the effect of the irregular component 
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(1) 


(2) 


(3) 


(4) 


(5) 


TABLE 18.2 1 






Centered 


Sales as a 


Thp fir^t ^t^inp nf 








Four-Quarter 


Average of 


Four-Quarter 


Percentage of 


calculations in determining 


Year 


Quarter 


Sales 


Moving Total Moving Totals 


Moving Average 


Moving Average 


the seasonal indexes for the 
quarterly data of Figure 


2000 


| 


3661 










18.3. The remaining 












calculations and seasonal 
















indexes are shown in 




|| 


1 907 










Table 18.3. 








9,843 












III 


1 500 




9,903.0 


2475.750 


60 588% 










9,963 












IV 


2775 




9,884.0 


2471 000 


1 1 2.303 










9,805 










2001 


I 


3781 




9,734.5 


2433.625 


1 55.365 










9,664 












II 


1 749 




9,633.0 


2408 250 


72.625 










9,602 












III 


1 359 




9,541.5 


2385.375 


56.972 










9,481 












IV 


271 3 




9,541.0 


2385.250 


1 1 3.741 










9,601 










2002 


I 


3660 




9,589.0 


2397.250 


1 52.675 










9,577 












II 


1 869 




9,741.5 


2435.375 


76.744 










9,906 












III 


1 335 




10,146.5 


2536.625 


52.629 










10,387 












IV 


3042 




10,448.0 


261 2 000 


1 1 6.462 










10,509 










2003 


I 


4141 




10,532.5 


2633.1 25 


1 57.266 










10,556 












II 


1 991 




10,498.0 


2624 500 


75.862 










1 0,440 












III 


1 382 




10,446.5 


261 1 .625 


52.91 7 










10,453 












IV 


2926 




10,284.0 


2571 000 


1 1 3.808 










10,115 










2004 


I 


4154 




10,020.5 


2505.125 


165.820 










9,926 












II 


1653 




9,755.0 


2438.750 


67.781 










9,584 












III 


1193 














IV 


2584 













of the time series, and the results are shown in the first box of Table 18.3 
(page 702). Other possibilities include using either the median or a trimmed 
mean that is calculated after the highest and lowest values are dropped. 
4. If the average value of the means is not 100, multiply each one by the same 
adjustment factor so that the average of 100 is obtained. In other words, for 
quarterly data, the means should total 400. If this is not the case, inflate or 
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TABLE 18.3 



The seasonal indexes for 
quarterly sales of natural 
gas to a northern city, 
based on the preliminary 
calculations of Table 18.2 



Sales as a % of the moving average: 










Quarter 






I 


II 


III 


IV 


2000 






60.588 


112.303 




2001 


155.365 


72.625 


56.972 


113.741 




2002 


152.675 


76.744 


52.629 


116.462 




2003 


157.266 


75.862 


52.917 


113.808 




2004 


165.820 


67.781 








Mean 


157.782 


73.253 


55.777 


114.079 



The sum of the means is 400.891 
Multiplying each mean by the adjustment factor, 
400 

the seasonal indexes are 



400.891 





157.43 


II: 


73.09 


III: 


55.65 


IV: 


113.83 



deflate the means accordingly. In Table 18.3, this is accomplished by multiply- 
ing each of the means by 400/400.891. 
5. Following the adjustment (if necessary) in step 4, the resulting values are the 
seasonal indexes. For our example, these are shown in Table 18.3. The larger 
the seasonal index, the more influence that quarter has in terms of natural gas 
sales. As expected, the first quarter, and its heating requirements, has resulted 
in the highest seasonal index, 157.43. This means that 157.43% of the aver- 
age quarterly gas purchases occur during the first quarter. Likewise, the third 
quarter accounts for only 55.65% of the average quarterly purchases. 

Deseasonalizing the Time Series 

Having determined the seasonal indexes, we can go back and deseasonalize the 
original time series. In this process, we remove the seasonal influences and gener- 
ate a time series that is said to be seasonally adjusted. 

From the standpoint of the classical time series model, the original and the 
deseasonalized time series can be summarized as follows: 

1. Original time series Trend (T), cyclical (C), seasonal (S), irregular (I) compo- 
nents, or T ■ C • S ■ I. 

2. Deseasonalized time series Only trend, cyclical, and irregular components, 
or T • C • I. 

For any time period, the deseasonalized value can also be expressed as 
(T • C • S ■ I)/S, and dividing by S removes the seasonal influence from the 
series. In the denominator of the formula below, this seasonal influence (S) is 
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Year 


Quarter 


(1) 

Actual Sales 


(2) 

(Seasonal lndex)/100 


(3) 

Deseasonalizet 


2000 


I 


3661 


1.5743 


2325.5 




II 


1907 


0.7309 


2609.1 




III 


1500 


0.5565 


2695.4 




IV 


2775 


1.1383 


2437.8 


2001 


I 


3781 


1.5743 


2401.7 




II 


1749 


0.7309 


2392.9 




III 


1359 


0.5565 


2442.0 




IV 


2713 


1.1383 


2383.4 


2002 


I 


3660 


1.5743 


2324.8 




II 


1869 


0.7309 


2557.1 




III 


1335 


0.5565 


2398.9 




IV 


3042 


1.1383 


2672.4 


2003 


I 


4141 


1.5743 


2630.4 




II 


1991 


0.7309 


2724.0 




III 


1382 


0.5565 


2483.4 




IV 


2926 


1.1383 


2570.5 


2004 


I 


4154 


1.5743 


2638.6 




II 


1653 


0.7309 


2261.6 




III 


1193 


0.5565 


2143.8 




IV 


2584 


1.1383 


2270.1 



TABLE 18.4 



The deseasonalized time se- 
ries for the natural gas sales 
of Table 18.2 



represented as a proportion, or the seasonal index divided by 100: 

Deseasonalized time series: 

For each time period, the deseasonalized value is 

Value in the original time series 



D 



(Seasonal index for that period)/100 



The deseasonalized time series for the natural gas data is determined by the 
preceding formula and shown in Table 18.4. Throughout the series, each first 
quarter's sales are divided by 1.5743, each second quarter's sales by 0.7309, and 
so on. The deseasonalized sales in column (3) are the actual sales in column (1) di- 
vided by the (seasonal index)/100 in column (2). For example, the deseasonalized 
sales for the first quarter of 2004 are 

D = Actual sales _ 4154 _ 
~ (Seasonal index)/100 ~ 157.43/100 

The Excel portion of Computer Solutions 18.5 shows this and the other desea- 
sonalized quarterly sales superimposed on the original time series of Figure 18.3. 
The end result of deseasonalizing the time series is a line that gives us a clearer 
view of the trend and cyclical components of the series. 

Excel does not have a module dedicated to seasonal indexes, but the disk supplied 
with the text has Excel worksheet templates for determining seasonal indexes and 
deseasonalized values for quarterly and monthly series, and the Data Analysis Plus 
add-in can also be applied. Minitab does have a dedicated module but, as with Data 
Analysis Plus, the methodology differs from the technique described in this section 
and leads to seasonal index values that are slightly different from those obtained here. 
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computer solutions 1 8.5 



Determining Seasonal Indexes 

EXCEL, USING WORKSHEET TEMPLATE 




29 



30 



31 



32 



35 



3L 



q3_ 



q4_ 



157.4314 



55.6528 



113.8254 



W X Y Z 



AA 



AB 



AC 



AD 



AE 



AF 



4500 1 




Quarter 



Deseasoned 



Seasonal Indexes for Quarters 

1. Open file CX18GAS.XLS, with quarters in B2:B21, gas sales data in C2:C21, and labels in B1 and C1. Simultaneously 
open Excel worksheet template TMQUARTS.XLS, on the disk that accompanies the text. 

2. Working in TMQUARTS.XLS: Click on the row button for row 14, drag downward to select rows 14 through 37, 
then right-click and delete rows 14 through 37. There are now 20 rows in which to paste the 20 quarters of gas 
sales data. 

3. Working in CX18GAS.XLS, click on C2 and drag to C21 to select cells C2:C21 . Click Edit. Click Copy. 

4. Working in TMQUARTS.XLS, click to select C6. Click Edit. Click Paste. The sales data are now in cells C6:C25. Click on 
cell D8 and, using the AutoFill " + " at the lower right corner, AutoFill downward over all existing values in column D. 
In column B, ensure that the quarters are numbered in sequence, from 1 to 20. 

5. Within TMQUARTS.XLS, the seasonal indexes are shown in column V. Scroll downward in column R to view plots for 
the original time series and the deseasonalized time series. With slight formatting modifications, the deseasonalized 
plot appears as shown here. It has been moved next to the seasonal indexes for purposes of this Computer Solutions 
description. 

Notes 

1 . In general, the specific rows for the locations of the indexes and plots will vary, depending on how many years' worth 
of quarterly data are being examined. 

2. For monthly data, use Excel worksheet template TMMONTHS.XLS. The procedure will be similar to the one described 
here. 



EXCEL, USING DATA ANALYSIS PLUS 

Seasonal Indexes for Quarters 

1 . Open file CX18GAS.XLS. The quarterly data are in C2:C21 . Enter the label Quarter into cell D1. Enter the sequence of 
digits 1 through 4 into D2:D5. Continue repeating sequences of 1 through 4 into column D until you've put the se- 
quence of digits 1 through 4 into cells D18:D21 . Note that the repeating sequence of quarter identifications should be 
in the column immediately to the right of the column containing the data. If there is no blank column to the right, you 
will need to insert one. 

2. Click Tools. Click Data Analysis Plus. Select Seasonal Indexes. Click OK. 

3. Enter C1:D21 into the Input Range box and select Labels. The seasonal indexes are shown in the printout. 
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A 


B 


1 


Seasonal Indexes 


o 






3 


Season 


Index 


4 


1 


1 .5633 


5 


2 


0.741 1 


6 


3 


0.5492 


7 


4 


1.1463 



Notes 

1 . The Data Analysis Plus methodology differs slightly from the one described in the text and used in the Excel worksheet 
template method described previously, and it will result in slightly different seasonal index values. 

2. For monthly data, use the same procedure shown here, but repeat the sequence of digits 1 through 12 in the column 
immediately to the right of the column containing the data. 

MINI TAB 

Minitab Seasonal Indexes, Quarters 

1. Using file CX18GAS.MTW, with quarters in C1 and gas sales data in C2: Click Stat. Select Time Series. Click 
Decomposition. 

2. Enter C2 into the Variable box. Enter 4 into the Seasonal length box. Under Model Type, select Multiplicative. 
Under Model Components, select Trend plus seasonal. Click Options and enter 1 into the First obs is in seasonal 

period: box. Click OK. Click OK. The summary printout shown here is among the standard printout items, and a vari- 
ety of tables and plots can be obtained as well. 

Time Series Decomposition for Sales 

Multiplicative Model 

Data Sales 
Length 20 
NMissing 0 

Fitted Trend Equation 
Yt = 2513.81 - 4. 50850- t 

Seasonal Indices 
Period Index 

1 1.56598 

2 0.74378 

3 0.55044 

4 1.13980 

Notes 

1 . The Minitab methodology differs from the one described in the text and used in the Excel method above, and will re- 
sult in slightly different seasonal index values. 

2. For monthly data, use the same procedure, but in step 2 enter 12 into the Seasonal length box. 
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18.20 In determining seasonal indexes for the months of 
the year, a statistician finds the total of the "unadjusted" 
seasonal indexes is 1180 instead of 12 • 100, or 1200. 
What correction must be made to arrive at the adjusted 
seasonal indexes? 

18.21 The seasonal indexes for a convention center's 
bookings are 83, 120, 112, and 85 for quarters 1-4, 
respectively. What percentage of the center's annual 
bookings tend to occur during the second quarter? 

18.22 International Dairy Queen, Inc., a firm whose busi- 
ness is highly seasonal, has reported the following quar- 
terly net income figures for 1991 through 1995. source: 
International Dairy Queen, Inc., 1995 Annual Report, p. 31. 

Net Income (Millions) 



I II III IV 



1991 


$4,374 


8.227 


10.185 


5.135 


1992 


4.406 


8.674 


10.536 


5.479 


1993 


4.548 


8.727 


10.677 


5.936 


1994 


4.614 


9.260 


1 1.164 


6.383 


1995 


4.910 


9.848 


1 1.747 


6.712 



a. Construct the four-quarter centered moving average 
for these data and determine the percentages of the 
moving average for the quarters. 

b. Determine the seasonal indexes for the quarters. 

c. Use the seasonal indexes to deseasonalize the original 
time series. 

18.23 A major amusement park has the following num- 
ber of visitors each quarter from 2001 through 2005: 



Number of Visitors (Thousands) 





1 


II 


III 


IV 


2001 


155 


231 


270 


105 


2002 


182 


255 


315 


294 


2003 


160 


250 


280 


297 


2004 


210 


310 


365 


335 


2005 


225 


325 


384 


386 



a. Construct the four-quarter centered moving average 
for these data and determine the percentages of the 
moving average for the quarters. 

b. Determine the seasonal indexes for the quarters. 

c. Use the seasonal indexes to deseasonalize the original 
time series. 

18.24 Given the degree-day data of Exercise 18.11, 

(a) determine the monthly seasonal indexes for January 
through December, then (b) use these indexes to 
deseasonalize the original time series. 

18.25 Given the data of Exercise 18.13, (a) determine the 
monthly seasonal indexes for January through December, 
then (b) use these indexes to deseasonalize the original 
time series. 

18.26 A mail-order firm has found the following seasonal 
indexes for the number of calls received each month. 
Below the seasonal indexes is the actual number of calls 
(thousands) per month during 2005. 

a. Using the seasonal indexes, deseasonalize the data for 
2005. 

b. The president of the firm is quite pleased that the num- 
ber of calls went up each month during the May 
through December portion of 2005. Based on the 
results obtained in part (a), is the firm's performance 
really on the increase? 



Month: J FMAMJJASOND 

Seasonal 

index: 110 105 90 60 70 75 85 95 100 105 140 165 
Calls in 

2005: 7.2 5.9 5.5 3.2 4.0 4.1 5.3 5.4 5.7 6.1 7.9 8.2 



1 8.5 FORECASTING 



In forecasting, we use data from the past in predicting the future value of the vari- 
able of interest, such as appliance sales, natural gas consumption, or the thickness of 
the ozone layer. The time series and seasonal index methods discussed earlier are 
often employed for this purpose. Using past data to predict future data is very much 
like facing backward in a car, then trying to figure out where you are going by 



Chapter 18: Models for Time Series and Forecasting 



707 



looking at where you've been. There actually was such a car in the 1950s, and the 
analogy between it and time series-based forecasting is discussed in the opening 
vignette to this chapter. 

Forecasts Using the Trend Equation 

Once we've fitted a trend equation to the data, a forecast can be made by simply 
substituting a future value for the time period variable. The procedure is the same 
whether the fitted trend equation is linear or nonlinear. 

For example, in Figure 18.1, we fitted a quadratic trend equation to hybrid 
vehicle sales in the United States from 2000 through 2005. Using this equation to 
make a forecast for 2009: 

The fitted linear trend equation is 

y = 54.068 - 45.418* + 11.377* 2 

where y = the trend line estimate of hybrid vehicle sales (thousands) 
x = year code, with 2000 coded as 1. 

The forecast of hybrid vehicle sales for 2009 would be 
y = 54.068 - 45.418(10) + 11.377(10) 2 , or 737.588 thousand vehicles 

Forecasting with Exponential Smoothing 

Besides smoothing a time series, exponential smoothing can also be used in mak- 
ing a forecast for one time period into the future. The approach is similar to the 
smoothing method of Section 18.3, except that F (for forecast value) replaces E 
(exponentially smoothed value) for each time period, and the time period (t) 
"shifts" one unit into the future: 

Forecasting with exponential smoothing: 

forecast for time period t + 1 
actual y value for time period t 
the value that had been fore- 
cast for time period t 
the smoothing constant, and 
0 < a < 1 

As in our previous application of exponential smoothing, there is a smoothing 
constant, 0 ^ a < 1. If our forecast involves annual sales, the forecast for next 
year is a weighted average of the actual sales this year and the sales level that had 
been forecast for this year. If we choose a = 1, we are essentially saying "next 
year will be the same as this year." 

To illustrate the application of exponential smoothing to forecasting, we will 
use the electric power purchases data. These data included purchases from 1990 
through 2004, and exponential smoothing will give us a forecast for 2005. For 
our example, we will use a = 0.6 as the smoothing constant. 

As shown below, exponential smoothing requires that we assume an initial 
forecast value in order to get things started. For this reason, the forecast for 



t+i 



y t + d 



where F 



t+i 

y t 
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1990 is assumed to have been 815, the same as the actual purchases for that year. 
Proceeding toward 2005, the influence of this choice will decrease, since its effect 
is diluted as more recent forecast values are introduced. For each year, the forecast 
is a weighted average of the actual and the forecasted level of purchases for the pre- 
ceding year, and the weighting is according to the smoothing constant, a = 0.6: 



y = Actual F = Purchases forecast for the year. 
Year purchases with smoothing constant a = 0.6 



1990 
1991 
1992 
1993 



815 
826 
745 
776 



815.0 
815.0 
821.6 
775.6 



0.6(815) + 0.4(815.0) 
0.6(826) + 0.4(815.0) 
0.6(745) + 0.4(821.6) 



To get the process 
started, the forecast 
for 1990 is assumed 
to have been 81 5. 



2003 
2004 
2005 



977 
1008 



959.9 
970.2 
992.9 

A 



0.6( 973) + 0.4(940.2) 
0.6( 977) + 0.4(959.9) 
0.6(1008) + 0.4(970.2) 



The forecast for 2005 is a weighted average 
of the actual and the forecasted values for 2004. 



The Excel portion of Computer Solutions 18.6 lists these and the other 
forecasted values for electricity purchases. In addition to the forecast and plot 
shown in Computer Solutions 18.6, Minitab can also generate a complete 



computer solutions 18.G 



Forecasting with Exponential Smoothing 
EXCEL 
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C 
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2 
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815 
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3 


1991 
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4 
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5 


1993 
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6 


1994 
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775.9 




c 
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900 








7 
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CO 
IB 
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8 


1996 
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827.5 
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< Electric 






9 


1997 


858 


829.6 
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10 


1998 


896 


846.6 






800 












11 


1999 


926 


876.3 




o 


750 
700 








12 


2000 


946 


906.1 




"5 

0) 








13 


2001 


947 


930.0 




LU 








14 


2002 


973 


940.2 








1990 1992 1994 1996 1998 2000 2002 2004 






15 


2003 


977 


959.9 










Year 






16 


2004 


1008 


970.2 














17 


2005 




992.9 






















III 



Excel Forecasting with Exponential Smoothing, a = 0.6 



1. 



2. 



Using file CX18EPWR.XLS, with years in A2:A16, electricity purchase data in B2:B16, and labels in A1 and B1: Click 
Tools. Click Data Analysis. Select Exponential Smoothing. Click OK. 

Enter B1:B16 into the Input Range box. Enter 0.4 into the Damping factor box. (Note: Excel's "damping factor" cor- 
responds to 1 - a.) Select Labels. Enter C2 into the Output Range box. (Do not select "Chart Output," because we 
will use the ChartWizard to generate the plot.) Click OK. 
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3. Select C2 and enter 815 (this replaces the "#NA" and provides a forecast for the first time period.) Select C16 and, 
clicking at the lower right corner to produce the AutoFill " + " sign, AutoFill down to C17. Enter 2005 into A17. Enter 
the Forecast label into C1. 

4. Use ChartWizard, as in Computer Solutions 1 8.2, to plot the original series along with the forecast values generated here. 



MINITAB 

Minitab Exponential Smoothing with Smoothing Constant, a = 0.6 



Single Exponential Smoothing Plot for Electric 




Variable 
— •— Actual 

— — Forecasts 
A 9S.0% PI 



Smoothing Constant 
Alpha 0.6 



Accuracy Measures 
MAPE 3.43 
MAD M.M 
MSO 1388.74 



1. Using file CX18EPWR.MTW, with years in C1 and electricity purchase data in C2: Click Stat. Select Time Series. Click 
Single Exp Smoothing. 

2. Enter C2 into the Variable box. Select Use: and enter the smoothing constant 0.6 into the box. Select Generate fore- 
casts and enter 1 into the Number of forecasts box. (Just leave the "Starting from origin" box blank and Minitab 
will make a forecast for the one period immediately following the last data value.) 

3. Click Graphs. Select Plot predicted vs. actual. Click OK. Click Options and enter 1 into the Use average of first_ 
observations box. Click OK. Click OK. The plot appears as shown here, with the forecast displayed as a single point 
beyond the data series. 



table that includes time series values, smoothed values, forecast values, and 
residuals. 

When exponential smoothing is used for forecasting, the forecast will tend to 
be too low for time series that are increasing (positive trend) and too high for time 
series that are decreasing (negative trend). This is because the forecast for a given 
period is always the weighted average of two values associated with the immedi- 
ately preceding period. To correct this problem, the upward or downward trend of 
the time series can be introduced as an additional variable. Further details of this 
method, called the Holt-Winters forecasting model, can be found in more ad- 
vanced texts on forecasting. 
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Seasonal Indexes in Forecasting 

Once we have arrived at a forecast for a given year, we can use seasonal indexes 
to come up with a forecast for each month or quarter of the year. In doing so, we 
divide up the total forecast according to the relative sizes of the seasonal indexes. 
The procedure is as follows: 

Dividing an annual forecast into seasonal components: 

Forecast for the year 

Forecast for a season = — : • [(Seasonal index)/100] 

Number of seasons 

For the natural gas data, suppose that one of the forecasting methods of the 
preceding section had come up with a forecast of 9200 thousand Btu for 2010. 
Using the seasonal indexes determined earlier, the 9200 total would be divided up 
into the four quarters as follows: 

1. If the forecast for the entire year = 9200 thousand Btu, the average quarter 
will be 9200/4, or 2300. 

2. "Inflate" or "deflate" quarters according to their seasonal indexes: 

Quarter: I II III IV 

(Seasonal index)/100: 1.5743 0.7309 0.5565 1.1383 

Quarter 



I 2300(1 .5743) = 3620.89 thousand Btu 

II 2300(0.7309) = 1681.07 

III 2300(0.5565) = 1279.95 

IV 2300(1.1383) = 2618.09 



9200.00 



3. If necessary, adjust the quarterly forecasts so that their sum will be equal to 
the total forecast for the year. 

The sum of these quarterly forecasts (9200.00) is equal to the 9200.00 fore- 
casted for the entire year, so no further adjustment is necessary. (As an example, 
if the quarterly forecasts had totaled 9198.00, we would have had to multiply 
each of the quarterly estimates in step 2 by 9200/9198.00 so that the total for the 
year would be 9200.00.) 



1SL 



18.27 The linear trend equation y = 120 + 4.8x has 

been developed, where y = estimated sales (thousands of 
dollars) and x = 1 for the year 1994. What level of sales 
would be forecast for 2008; for 2010? 

18.28 An appliance repair shop owner has fitted the 
quadratic trend equation y = 90 + 0.9x + 3x 2 to a time 
series of annual repair orders, with y = the number of 
repair orders and x = 1 for 2000. Forecast the number of 
repair orders for 2008; for 2010. 



18.29 Based on annual truck rentals over the years, a 
rental firm has developed the quadratic trend equation 

y = 450 + 20x + 7.2x 2 , with x = 1 for 2000. Forecast y 
for 2008; for 2010. 

18.30 When exponential smoothing is used in fitting a 
curve to a time series, the approach is slightly different 
from its application to forecasting. Compare the 
appropriate formulas and point out how they differ. 
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18.31 The U.S. trade deficit with Canada (billions of 
dollars) from 1997 through 2004 is reported as shown in 
the table. Using exponential smoothing and the smoothing 
constant a = 0.7, what deficit would have been forecast 
for 2005? source: World Almanac and Book of Facts 2006, p. 1 09. 

Year: 1997 1998 1999 2000 2001 2002 2003 2004 
Deficit: $15.5 16.6 32.1 51.9 52.8 48.2 51.7 66.5 

18.32 For the data of Exercise 18.7, use the weighting 
constant a = 0.5 and exponential smoothing to 
determine the forecast for 2005. 

18.33 For the data of Exercise 18.9, use the weighting 
constant a = 0.6 and exponential smoothing to 
determine the forecast for 2001. 



18.34 If a time series is such that sales are consistently 
increasing from one year to the next, will exponential 
smoothing tend to produce forecasts that are (a) over 
or (b) under the sales that are actually realized for the 
forecast period? Why? 

18.35 A firm predicts it will use 560,000 gallons of 
water next year. If the seasonal index of water consump- 
tion is 135 for the second quarter of the year, what is 
the forecast for water consumption during that quarter? 

18.36 The quarterly seasonal indexes for a firm's elec- 
tricity consumption are 115, 92, 81, and 112 for quarters 
I-IV. It has been forecast that electricity consumption 
will be 850,000 kilowatt-hours during 2008. Forecast 
electricity consumption for each quarter of 2008. 



EVALUATING ALTERNATIVE MODELS: MAD AND MSE 



18.6 



When two or more models are fitted to the same time series, it's useful to have one 
or more criteria on which to compare them. In this section, we'll discuss two such 
criteria. 

According to the mean absolute deviation (MAD) criterion, the best-fit model is 
the one having the lowest mean value for \y — y~ \ . The direction of an error, or de- 
viation, is not considered, only its absolute value. The MAD criterion is as follows: 



The mean absolute deviation (MAD) criterion: 

From a given set of models or estimation equations fit to the same time series 
data, the model or equation that best fits the time series is the one with the 
lowest value of 



MAD 



\y t - y t \ 



where y t = an observed value of y 

y t = the value of y that is predicted using 

the estimation equation or model 
n = the number of time periods 



Another criterion is mean squared error (MSE), in which the best-fit model or 
equation is the one having the lowest mean value for (y t — y t ) 2 . This is analogous 
to the least-squares criterion used in determining the regression equations of 
Chapters 15-17. The MSE approach is described below: 



The mean squared error (MSE) criterion: 

From a given set of models or estimation equations fit to the same time series 
data, the model or equation that best fits the time series is the one with the 
lowest value of 



MSE 



where y t = an observed value of y 

y t = the value of y that is predicted using the 

estimation equation or model 
n = the number of time periods 
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Compared to the MAD criterion, the MSE approach places a greater penalty 
on estimates for which there is a large error. This is because MSE is the mean of 
the squared errors instead of simply the mean of the absolute values. Because 
of this, the MSE criterion is to be preferred whenever the cost of an error in esti- 
mation or forecasting increases in more than a direct proportion to the amount 
of the error. 

Table 18.5 shows the application of the MAD and MSE criteria to the linear 
and quadratic estimation equations that were developed for the hybrid vehicle 
sales data in Section 18.2 of this chapter. 



TABLE 18.5 



MAD and MSE measures of 
fit to the hybrid vehicle 
sales data of Section 18.2 



A. Fit for the Linear Estimation Equation, y = -52.114 + 34.219x 



Year 


Year 
Code, Xj 


Actual 
Sales, y t 


Estimated 
Sales, y t 


Error, (y t - y t ) 


Absolute 
Error, 

\y t -9 t \ 


Squared 
Error, 

(y t -y ( ) 2 


2000 


1 




9.350 


-17.895 


27.245 


27.245 


742.290 


2001 


2 




20.287 


16.324 


3.963 


3.963 


15.705 


2002 


3 




35.000 


50.543 


-15.543 


15.543 


241.585 


2003 


4 




47.530 


84.762 


-37.232 


37.232 


1386.222 


2004 


5 




88.000 


118.981 


-30.981 


30.981 


959.822 


2005 


6 


205.750 


153.200 


52.550 


52.550 


2761.503 














167.514 




6107.127 













MAD 



2|y f - 



167.514 



27.919 



1 


f 


MSE = z<y t -y t ) 2 = 6107.127 = 1017 g54 

n 6 



B. Fit for the Quadratic Estimation Equation, 
y = 54.068 - 45.41 8x + 11.377x 2 



Year 


Year 
Code, Xj 


Actual 
Sales, y t 


Estimated 
Sales, y t 


Error, (y t - y t ) 


Absolute 
Error, 

|y t -y ( l 


Squared 
Error, 

(y ( -y ( ) 2 


2000 


1 




9.350 


20.027 


-10.677 




10.677 


113.998 


2001 


2 




20.287 


8.740 


11.547 




11.547 


133.333 


2002 


3 




35.000 


20.207 


14.793 




14.793 


218.833 


2003 


4 


47.530 


54.428 


-6.898 




6.898 


47.582 


2004 


5 




88.000 


1 1 1 .403 


-23.403 




23.403 


547.700 


2005 


6 


205.750 


191.132 


14.618 




14.618 


213.686 
















81.936 




1275.133 













MAD 



2|y f -y t | 81.936 



13.656 





1 


„„.- 2(y f - y ) 2 275.133 

MSE = - ^— = = 212.522 

n 6 
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Regardless of the criterion, the quadratic equation is a better fit to these data. 
This is not surprising, since the quadratic equation has the advantage of including 
both a linear and a nonlinear term. 



exercises 




18.37 How do the MAD and MSE criteria differ in their 
approach to evaluating the fit of an estimation equation 
to a time series? 

18.38 The following data are the wellhead prices for 
domestically produced natural gas, in dollars per 
thousand cubic feet, from 1987 through 1994. Given 
these data and the trend equations shown here, use the 
MAD criterion to determine which equation is the better 
fit. Repeat the evaluation, using the MSE criterion, source: 
U.S. Department of Commerce, U.S. Industrial Outlook 1994, pp. 3^4. 



Year 


x = 
Year Code 


y = 

Price 




1987 


1 


$1.67 


Linear: 


1988 


2 


1.69 


y = 1.474 + 0.0769X 


1989 


3 


1.69 




1990 


4 


1.71 


Quadratic: 


1991 


5 


1.64 


y = 1.844 - 0.145x 


1992 


6 


1.80 


+ 0.0246x 2 


1993 


7 


2.09 




1994 


8 


2.27 





18.39 The following data are the number of U.S. motor- 
cycle thefts for 1992 through 1995. Given these data and 



the trend equations, use the MAD criterion to determine 
which equation is the better fit. Repeat the evaluation, 
using the MSE criterion, source: Earle Eldridge, "Thieves Hog 
Wild over Motorcycles," USA Today, July 19, 1996, p. 1B. 



x = y = 
Year Year Code Thefts 



1992 
1993 
1994 
1995 



6319 
7237 
7902 
9224 



Linear: 

y = 5325.5 + 

Quadratic: 
y = 5830.5 + 
+ 101x 2 



938x 



43 3x 



18.40 Using the MAD criterion, determine which one of 
the equations developed in Exercise 18.7 is the better fit 
to the data in that exercise. Repeat the evaluation, using 
the MSE criterion. 

18.41 Using the MAD criterion, determine which one of 
the equations developed in Exercise 18.8 is the better fit 
to the data in that exercise. Repeat the evaluation, using 
the MSE criterion. 



AUTOCORRELATION, THE DURBIN-WATSON TEST, 1 8.7 

AND AUTOREGRESSIVE FORECASTING 



Autocorrelation and the Durbin-Watson Test 

As we've discussed in earlier chapters, one of the assumptions required by the 
regression model is that the residuals (errors) be independent of each other. 
When this assumption is not met, the condition known as autocorrelation (or 
serial correlation) is present. Regression models based on time series data are es- 
pecially susceptible to this condition because: (1) key independent (x) variables 
that have not been included in the model are likely to be related to time, and 
(2) the omission of one or more of these independent variables will tend to yield 
residuals (errors) that are also related to time. 



714 



Part 5: Regression, Model Building, and Time Series 



Autocorrelation will tend to cause the calculated standard error of estimate 
(s ) to underestimate the true value (aj. The partial regression coefficients (b v b 2 , 
and so on) will also have standard errors that are understated, and the t values in 
their respective tests of significance will tend to be exaggerated. As a result, con- 
fidence and prediction intervals will be too narrow, and the respective indepen- 
dent variables will be exaggerated in their significance. 

A common method of testing for autocorrelation is the Durbin-Watson test. 
Our discussion of this test will be in terms of its ability to detect the presence of 
first-order autocorrelation. This is the correlation between adjacent residuals or 
errors — that is, between e t and e t _ v (By extension, second-order autocorrelation 
would be the correlation between e t and e t _ 2 . When we refer to autocorrelation 
in the remainder of this discussion, we will be referring to first-order autocorre- 
lation.) The Durbin-Watson test treats the population of residuals as shown 
below. 

Durbin-Watson test, description of residual (error) for time period t: 

s t = ps t _ j + z t where e t = residual associated with time period t 

s t _ 1 = residual associated with time period t — 1 
p = the population correlation coefficient 
between values of s t and s t _ l5 with 
-1 < p < +1 
z t = a normally distributed random variable 
with p: = 0 

If p is zero, there is no autocorrelation and e t = z t — that is, s t is nothing more 
than random variation with a mean of zero. Likewise, a p that is greater than 0 
reflects positive autocorrelation and a p that is less than zero reflects negative au- 
tocorrelation. Examples of positive and negative autocorrelation are shown in 
Figure 18.4. The Durbin-Watson test is carried out as shown below. 

Durbin-Watson test for autocorrelation: 
• Null and alternative hypotheses 

The test is 



for positive for negative 

autocorrelation autocorrelation a two-tail test 



H 0 : 


p < 0 


H Q : 


p > 0 


H 0 : 


P 


= 0 


H,: 


p > 0 


H t : 


p < 0 


H V . 


P 


* 0 



• Test statistic 

SK-Vl) 2 
d = : where e t 

t = n t 



n 



= observed residual (error) 
associated with time period t 

= observed residual (error) 
associated with time period 
t - 1 

= number of observations, 
with n > 15 
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• Critical values and decision zones Look up the appropriate d L and d v val- 
ues in the Durbin-Watson tables in Appendix A. The actual critical values 
and decision zones are based on (1) the type of test (positive, negative, or 
two-tail), (2) a = the level of significance for the test, and (3) k = the 
number of independent variables in the regression model. The decision 
zones and appropriate conclusions are shown in Figure 18.5. 



FIGURE 18.4 



Positive autocorrelation: Negative autocorrelation: 

Residuals tend to be followed Residuals tend to change signs 

by residuals with the same sign. from one period to the next. 




0 4 8 12 16 20 0 4 8 12 16 

Time period, t Time period, t 



Examples of positive and negative autocorrelation of residuals 



FIGURE 18.5 



| Testing for positive autocorrelation 



There is 



1 11C1C IS 

positive 
autocorrelation 




The test is 
inconclusive 




There is no evidence of autocorrelation 








1 


0 a 


L 


d 


u 


2 


4 



Testing for negative autocorrelation 



There is no evidence of autocorrelation 



The test is 
inconclusive 



There is 
negative 
autocorrelation 



4-rfr, 



4-4 



| Two-tail test for autocorrelation 



There is 
positive 
autocorrelation 


The test is 
inconclusive 


There is no evidence 
of autocorrelation 


The test is 
inconclusive 


There is 
negative 
autocorrelation 






1 '- 






0 a 


L d 


V 2 4- 


d v 4- 


d L A 



When proceeding with a Durbin-Watson test for autocorrelation, the calculated d statistic is compared to one of three sets of critical 
values and decision zones shown. 
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Appendix A provides three tables for the Durbin-Watson test. They corre- 
spond to the 0.05, 0.025, and 0.01 levels of significance for a one-tail (positive or 
negative) test, and to the 0.10, 0.05, and 0.02 levels for a two-tail test. 

The Durbin-Watson test statistic (d) will have a value somewhere between 0 
and 4, and a value of 2 indicates that the residuals are not autocorrelated. Al- 
though it's best to compare the calculated d with the appropriate critical values 
and decision zones described in Figure 18.5, there are some general guidelines for 
interpreting the calculated value of the test statistic, d: 

Value of d Interpretation 

Close to 0.0 Positive autocorrelation: A positive residual in one period tends 
to be followed by a positive residual in the next period, and a 
negative residual tends to be followed by a negative residual. 
See Figure 1 8.4 for an example. 

Approx. 2.0 Autocorrelation is weak or absent. 

Close to 4.0 Negative autocorrelation: Residuals tend to change signs from 
one period to the next, as shown in Figure 1 8.4. 



example 



Durbin-Watson Test 

Table 18.6 shows how the Durbin-Watson test could be applied to a time series con- 
sisting of deseasonalized quarterly shipment data. Most computer statistical pack- 
ages can spare you the trouble of calculating the Durbin-Watson d statistic, but the 
details are in Table 18.6 just to demonstrate how it's done. The calculated value 
(d = 1.90) is comfortably within the "there is no evidence of autocorrelation" de- 
cision zone, and we conclude that autocorrelation is not present in this series. 



When autocorrelation is present, we should consider the possibility that one 
or more important variables may have been omitted from the regression model. 
The identification and inclusion of such variables can reduce the presence of au- 
tocorrelation and enhance the quality of confidence and prediction interval esti- 
mates and significance tests for the regression model. Transformations performed 
on the dependent or the independent variables can also be useful in this regard. 

Computer Solutions 18.7 shows the procedures and results when we use Excel 
and Minitab in obtaining the Durbin-Watson statistic. In each case, we have per- 
formed the regression analysis and generated the residuals. With Excel, we must then 
specify the cells in which the residuals are located. With Minitab, we need only select 
the Durbin-Watson option when we are carrying out the regression analysis proce- 
dure. Although Excel and Minitab can provide us with the Durbin-Watson statistic, 
we must still refer to the printed tables in reaching a conclusion. 



The Autoregressive Forecasting Model 

When a time series exhibits autocorrelation, not only are the residuals correlated 
with each other, but the time series values themselves are correlated with each 
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For 16 quarters that have been deseasonalized. 

The least-squares estimation equation is y = 34.0 + 0.7f. 



p = 0 (there is no autocorrelation) 
p + 0 (there is an autocorrelation) 



For a two-tail test at the 0.05 level. With n = 1 6 observations and k = 1 
independent variable in the regression equation, reference to the 
Durbin-Watson tables in Appendix A gives d L =0.98 and d u = 1 .24. 



t 


yt 


yt 




el 


e. i e. — 
C t-1 t 


e, a 




1 


33.7 


34.7 


-1.0 


1.00 








2 


32.7 


35.4 


-2.7 


7.29 


-1.0 


1.7 


2.89 


3 


40.2 


36.1 


4.1 


16.81 


-2.7 


6.8 


46.24 


4 


42.5 


36.8 


5.7 


32.49 


4.1 


1.6 


2.56 


5 


36.6 


37.5 


-0.9 


0.81 


5.7 


6.6 


43.56 


6 


36.4 


38.2 


-1.8 


3.24 


-0.9 


0.9 


0.81 


7 


38.9 


38.9 


0.0 


0.00 


-1.8 


1.8 


3.24 


8 


37.6 


39.6 


-2.0 


4.00 


0.0 


2.0 


4.00 


9 


36.3 


40.3 


-4.0 


16.00 


-2.0 


2.0 


4.00 


10 


38.2 


41.0 


-2.8 


7.84 


-4.0 


1.2 


1.44 


11 


48.2 


41.7 


6.5 


42.25 


-2.8 


9.3 


86.49 


12 


41.0 


42.4 


-1.4 


1.96 


6.5 


7.9 


62.41 


13 


40.9 


43.1 


-2.2 


4.84 


-1.4 


0.8 


0.64 


14 


44.3 


43.8 


0.5 


0.25 


-2.2 


2.7 


7.29 


15 


43.8 


44.5 


-0.7 


0.49 


0.5 


1.2 


1.44 


16 


48.0 


45.2 


2.8 


7.84 


-0.7 


3.5 


12.25 










147.11 






279.26 


The calculated Durbin-Watson statistic is d = 


279.26/147.11 


= 1.90. 





TABLE 18.6 



In this application of the 
Durbin-Watson test the 
calculated d (1 .90) falls into 
the "there is no evidence of 
autocorrelation" decision 
zone of the two-tail test. 
The calculated d is available 
from most computer 
statistical packages, and 
Computer Solutions 18.7 
shows how we can obtain 
it with Excel and Minitab. 



There is 
positive 



There is 
negative 



auto- 
correlation 


The test is 
inconclusive 


There is no evidence 
of autocorrelation 


The test is 
inconclusive 


auto- 
correlation 


-< ► 


-< ► 




>- 


>- 



0.98 



1.24 



.90 



2.76 

4- d, 



u 



3.02 
4-d 



other. Although this can undermine the independent-residuals assumption of the con- 
ventional regression approach to time series, it provides an excellent opportunity for 
what is known as the autoregressive forecasting model. 

In autoregressive forecasting, the dependent variable (y) is treated the same way it 
is in other regression models within this chapter. However, each independent variable 
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computer solutions 1 8. 



Durbin-Watson Test for Autocorrelation 
EXCEL 





A 


B 


C 


1 


Durbin-Watson Statistic 


2 








3 


d = 1 .8983 







Excel Durbin-Watson Statistic 

1. The first two columns of Table 18.6 are listed in columns A and B of CX18SHIP.XLS, with t values in A2:A17, y 
values in B2:B1 7, and labels in A1 and B1 . First, we run the regression analysis: Click Tools. Click Data Analysis. 
Select Regression. Click OK. 

2. Enter B1:B17 into the Input Y Range box. Enter A1:A17 into the Input X Range box. Select Labels. Enter C1 into 
the Output Range box. Check the Residuals box. Click OK. 

3. The output lists the residuals, in cells E25:E40. Click Tools. Click Data Analysis Plus. Click Durbin-Watson Statis- 
tic. Click OK. Do not select "Labels." Enter E25:E40 into the Input Range box. Click OK. On a separate sheet, the 
Durbin-Watson statistic is reported as d = 1 .8983. 



MINITAB 

Minitab Durbin-Watson Statistic 
Regression Analysis: y versus t 

The regression equation is 
y = 34.0 + 0.700 t 

Predictor Coef SE Coef T 
Constant 34.008 1.700 20.01 
t 0.6999 0.1758 3.98 
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P 

0.000 
0.001 



Durbin-Watson statistic = 1.89832 

Using file CX18SHIP.MTW, with t values in C1 and /values in C2, we run the regression analysis and specify that the out- 
put include the Durbin-Watson statistic: Click Stat. Select Regression. Click Regression. Enter C2 into the Response 
box. Enter C1 into the Predictors box. Click Options. Select Durbin-Watson statistic. Click OK. Click OK. The partial 
printout shown here includes the Durbin-Watson statistic, reported as d = 1 .89832. 



represents previous values of the dependent variable. We will examine two levels 
of the autoregressive forecasting model: 

1. First-order autoregressive model The forecast for a given time period is 
based on the value observed in the preceding time period: 

% = + ^lVt-l w ^ tn Vt-i = t ^ ie va l ue °f V m period t — 1 
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2. Second-order autoregressive model The forecast for a given time period is 
based on the values observed in the two previous time periods: 

% = ^0 + ^lVt-l + ^i^t-i w ' tn Vt-l = ^ e value of y in period t — 1 

y t _ 2 = the value of y in period t — 2 

Table 18.7 shows annual billings (millions of dollars) for new U.S. manufac- 
tured general aviation aircraft shipped from 1986 (coded as t = 1) through 2005 



Year 


Year Code (t) 
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mi o 

I y I o 


i yy i 


D 
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"7 
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1 o4U 
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1 993 


8 


2144 


1 840 


1 968 


1994 


9 


2357 


2144 


I o^f u 


1995 


10 


2842 


2357 


2144 


1996 


11 


3048 


2842 


2357 


1997 


12 


4580 


3048 


2842 


1998 


13 


5761 


4580 


3048 


1999 


14 


7843 


5761 


4580 


2000 


15 


8558 


7843 


5761 


2001 


16 


8641 


8558 


7843 


2002 


17 


7719 


8641 


8558 


2003 


18 


6434 


7719 


8641 


2004 


19 


6816 


6434 


7719 


2005 


20 


8667 


6816 


6434 














r 


1 


1 



TABLE 18.7 



Annual billings (millions of 
dollars) for new U.S. manu- 
factured general aviation 
aircraft shipped from 1986 
through 2005. In these 
autoregressive forecasting 
models, the estimated value 
for a given year is based 
either on the value 
observed in the preceding 
year (first-order model) or 
on the values in the two 
preceding years (second- 
order model). The underly- 
ing data are in file CX18AIR. 



First-order autoregressive model: 
y t = 443.9 + 0.987y f _ 1 



Second-order autoregressive model: 
y t = 414.7 + 1.50y t _ 1 - 0.54y f _ 2 



Data source: General Aviation Manufacturers Association, 2005 Statistical Databook, p. 13. 
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(coded as t = 20). If we were to fit linear and quadratic trend equations to the 
time series, the results would be as follows: 

Linear: The equation is y t = —178.7 + 434. 04t and the Durbin- 
Watson statistic = 0.47 (strong positive autocorrelation). 

Quadratic: The equation is y t = 557.3 + 233. 3? + 9.558i 2 and the Durbin- 
Watson statistic = 0.50 (strong positive autocorrelation). 

In applying either of these estimation equations, we find very significant posi- 
tive autocorrelation between consecutive values of the aircraft shipment billing 
data. A year in which billings exceed the model's estimate tends to be followed by 
another year in which billings are higher than estimated. Similarly, if the billings 
for a year are lower than the model has estimated, the billings for the next year 
also tend to be lower than estimated. 

For each of the years in the time series, Table 18.7 shows the year code (t), the 
billing amount (y t ) for the year, the billing amount for the preceding year (y f _ 1 ), 
and the billing amount for the period 2 years earlier (y f _ 2 ). Notice how the billings 
in the fourth column are shifted downward one year from those in the third col- 
umn, and how those in the fifth column are shifted downward two years from 
those in the third column. We simply apply either simple or multiple regression in 
obtaining the first-order and second-order autoregressive equations in the lower 
portion of the table. 

The two equations at the bottom of Table 18.7 are the results when we apply 
the first-order and second-order autoregressive forecasting models to the aircraft 
billing data in the table. Using the mean absolute deviation criterion presented in 
Section 18.6, we find that MAD = $603.1 million for the first-order model and 
$575.0 million for the second-order model, indicating that the second-order model 
is a slightly better fit to the series. Given the available data and the autoregressive 
forecasting equations, the billings forecast for 2006 could be made as shown 
below: 

Forecast for 2006, using the first-order autoregressive model: 
y t = 443.9 + 0.987^_ 1 

and 

y 2006 = 443.9 + 0.98 7y 2005 = 443.9 + 0.987(8667) = $8998.2 million 

Forecast for 2006, using the second-order autoregressive model: 
y t = 414.7 + l.50y t _ l - 0.54y f _ 2 

and 

5*2006 = 414 - 7 + l-50y 2 oo5 - 0.54y 2004 = 414.7 + 1.50(8667) - 0.54(6816) 
= $9734.6 million 

Because its MAD value is the lesser of the two, the forecast with the second- 
order autoregressive model would seem to be the preferred one. However, this 
does not mean we should automatically use a second-order autoregressive model 
instead of a first-order model. In fitting the second-order model, we must discard 
the initial two years in the series instead of just one. Referring to Table 18.7, note 
that years 1986 and 1987 would be considered as "missing data" in this model 
because no billing values were provided for years prior to 1986. Discarding the 
first two periods in a time series might not be acceptable if the series already 
happens to contain only a small number of periods. 
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Computer Solutions 18.8 describes the Excel and Minitab procedures for ob- 
taining the first-order and second-order autoregressive equations shown at the 
bottom of Table 18.7. The Minitab procedure involves a special "Lag" module 
that makes it convenient to shift the data and create the y t _ 1 and y t _ 2 series that 
are lagged by one and two years, respectively. 




Autoregressive Forecasting 
EXCEL 

Excel Autoregressive Forecasting [equations at the bottom of Table 18.7] 



1. In file CX18AIR.XLS, the billings time series in Table 18.7 is in C2:C21, with the label in C1 . Select cells C2:C20 and 
copy them to cells D3:D21 — this will be the series of (f - 1) values. Select cells C2:C19 and copy them to E4:E21 — 
this will be the series of (r - 2) values. 

2. The first-order autoregressive equation is obtained by performing a simple regression in which the Y input range is 
C3:C21 and the X input range is D3:D21 When performing the regression, do not select "Labels." To obtain the resid- 
uals and predicted values, select Residuals. 

3. The second-order autoregressive equation is obtained by performing a regression analysis in which the Y input range 
is C4:C21 and the X input range is D4:E21. Do not select "Labels." To get the residuals and predicted values, select 
Residuals. 

MINITAB 

Minitab Autoregressive Forecasting [equations at the bottom of Table 18.7]. 

1 . Using file CX18AIR.MTW, with y values in C3: Click Stat. Select Time Series. Click Lag. Enter C3 into the Series 
box. Enter C4 into the Store lags in box. Enter 1 into the Lag box. Click OK. The (t — 1) series will be stored in C4. 

2. Repeat step 1 , but enter C5 into the Store lags in box and enter 2 into the Lag box. The (f - 2) series will be stored 
in C5. 

3. To obtain the first-order autoregressive equation Click Stat. Select Regression. Click Regression. Enter C3 into the 
Response box. Enter C4 into the Predictors box. Click OK. 

4. To obtain the second-order autoregressive equation, carry out step 3, but enter C4 C5 into the Predictors box. 





exercises 













18.42 When autocorrelation of the residuals is present, positive autocorrelation of the residuals at the 0.05 level 
what effect can this have on interval estimation and sig- of significance. Comment on the appropriateness of 
nificance tests regarding the regression model involved? using this model for this set of data. 



18.43 What is the Durbin- Watson test for autocorrelation, 



and how can it be useful in evaluating the relevance of a 


f 


y t 


t 


y t 


t 


y t 


given regression model that has been fitted to a set of 




97.0 




125.5 




150.6 


time series data? 
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1 1 




109.7 




132.9 




157.8 




2 


7 


12 


18.44 The least-squares regression equation y = 


3 


110.3 


8 


139.7 


13 


158.3 


98.1 + 5. Of has been fitted to the shipment data given 


4 


121.5 


9 


145.6 


14 


166.9 


here. Calculate the Durbin-Watson d statistic and test for 
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130.4 


10 


149.4 


15 


175.8 
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18.45 The least-squares regression equation y = 15.2 + 
10. 7t has been fitted to the shipment data in the table. 
Calculate the Durbin-Watson d statistic and test for posi- 
tive autocorrelation of the residuals at the 0.05 level of 
significance. Based on your conclusion, comment on the 
appropriateness of using this model for this set of data. 



t 


y t 


t 




t 


y t 


1 


65.6 


1 


68.3 


13 


141.3 


2 


56.7 


8 


79.1 


14 


149.6 


3 


79.5 


9 


104.2 


15 


177.1 


4 


53.4 


10 


104.2 


16 


193.6 


5 


59.7 


1 1 


115.4 


17 


238.9 


6 


71.1 


12 


86.6 


18 


259.2 



18.46 A regression model has 3 independent variables and 
20 observations, and the calculated Durbin-Watson d statis- 
tic is 0.91. What, if any, conclusion will be reached in test- 
ing for autocorrelation of the residuals with a: 

a. two-tail test at the 0.02 level of significance? 

b. one-tail test for positive autocorrelation at the 
0.05 level? 

18.47 A regression model has 2 independent variables 
and 16 observations, and the calculated Durbin-Watson d 
statistic is 2.78. What, if any, conclusion will be 
reached in testing for autocorrelation of the residuals 
with a: 

a. two-tail test at the 0.05 level of significance? 

b. test for negative autocorrelation at the 0.01 level? 

18.48 Analysis of a time series consisting of weekly visits 
(in thousands) to a corporate website has led to the fol- 
lowing autoregressive forecasting equation: y t = 34.50 + 
0.58y f _j — 0.72vj_ 2 - If there were 260 thousand visitors 
this week and 245 thousand last week, how many visitors 
would the forecast predict for next week? 

18.49 For the shipment data listed in Exercise 18.45, 
construct a first-order autoregressive forecasting equation 
and make a forecast for period 19. 



18.50 According to the Investment Company Institute, 
total industry net assets (billions of dollars) for equity 
funds were as shown here for years 1981 (coded as t = 1) 
through 1998. Construct a second-order autoregressive 
forecasting equation and determine what the forecast 
would have been for 1999. source: Investment Company Insti- 
tute, Mutual Fund Fact Book 1999, p. 67. 
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15 
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83.1 


10 


245.6 


16 


1726.1 


5 


1 16.9 


1 1 


41 1.2 


17 


2368.0 


6 


161.4 


12 


522.1 


18 


2978.2 



18.51 Over the past 20 years, inventory carrying costs for 
a large tire manufacturing facility have been as follows. 
Data are in thousands of dollars. Construct a first-order 
autoregressive forecasting equation for these data, calcu- 
late the mean absolute deviation (MAD) for the fit of the 
forecast values to the actual values, then use the equation 
to forecast inventory carrying costs for time period 21. 
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/t 
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y t 
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2828.5 


2 
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2734.8 


3 
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13 
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3437.2 


17 


2815.0 


8 


3203.2 
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3206.2 
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2938.1 


10 


2963.1 


20 


2918.6 



18.52 Repeat Exercise 18.51, but for a second-order 
autoregressive forecasting equation. Compare the MAD 
values for forecasts generated by the two equations and 
indicate which one is the better fit to these data. 



1 8.8 INDEX NUMBERS 



In Section 18.4, we discussed seasonal indexes, measures intended to reflect the 
extent to which various parts of the year experience higher or lower levels of pro- 
duction, demand, or other kinds of economic activity. This section introduces an- 
other concept that includes the word "index," but this is where the similarity 
ends. Before introducing the notion of index numbers, let's consider a variety of 
settings in which they could be useful: 

• A family of four has a household income of $50,000, up 15% from three 
years ago, but they still have a lot of trouble making ends meet. Is the family's 
purchasing power stronger or weaker than it was three years ago? 
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• Wishing to compare the "livability" of various metropolitan areas, a 
researcher collects data on crime, suicide, alcoholism, and unemployment 
rates in each. 

• Critics point out that the $8000 per semester tuition at a state university has 
escalated the cost of a college education beyond the reach of the typical fam- 
ily. University administrators, pointing to inflation, respond that a college ed- 
ucation is more accessible than ever before. 

For applications like these, it is useful to convert information to the form of 
an index number. An index number is a percentage that expresses a measurement 
in a given period in terms of the corresponding measurement for a selected base 
period. The measurement in the base period is defined as 100, and the measure- 
ment in a given period will be greater than or less than 100, depending on 
whether the measurement was greater than or less than that of the base period. 
For example: 

In 1990 a company's inventory 

costs were $3.6 million. — > I 1990 = 100 for the 1990 base period 

In 2001 the company's inventory 

/. 5.9 \ 

costs were $5.9 million. — > I 7001 = 163.9 I i.e., • 1001 

In other words, if inventory costs in 1990 are defined as 100, 2001 costs can 
be expressed as 163.9, or 63.9% higher. Like other index numbers, this one is 
unitless. Note that both the numerator and the denominator are described in 
millions of dollars, and these cancel out during the division. 

The base period for an index number should be a relatively typical period 
during which extremely unusual phenomena have not occurred. For example, it 
wouldn't be desirable to construct an index of oil prices using a base year during 
which there was either a severe oil glut or a shortage. 

Index numbers are used in measuring changes over time for such diverse enti- 
ties as stock values, prices, wages, production, and output. Some of them even 
combine many different measurements into a single value. If you happen to live in 
one of the colder regions of the nation, you're familiar with "wind chill," the trans- 
lation of outside temperature and wind velocity into a single number representing 
what would be an equally uncomfortable temperature if the wind were calm. This 
particular indicator, based on Antarctic experiments of the 1930s, is not derived by 
the methods through which index numbers are usually determined, but it's yet an- 
other example of providing information by condensing two or more numbers into 
just one. 

Many popular statistical indexes are in use, and you may already be familiar 
with such indexes as the Dow Jones Industrial Average, the Standard & Poor's 
Index of 500 Stocks, and the Consumer Price Index. The latter can be used in de- 
termining such things as whether the household earning $50,000 has more pur- 
chasing power than three years ago. 

The Consumer Price Index 

The Consumer Price Index (CPI) describes the change in prices from one time pe- 
riod to another for a fixed "market basket" of goods and services. It is generated 
monthly by the U.S. Bureau of Labor Statistics and is probably the index with 
which both businesspersons and the general public are most familiar. 
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Derivation of the CPI 

The CPI is based on periodic consumer expenditure surveys. At this writing, the 
base prices are for the 1982-1984 period. The change to the 1982-1984 base pe- 
riod was made on the basis of data from the 1982-1984 Consumer Expenditure 
Survey. From 1971 until 1987, the base period for the index had been 1967. 

The "market basket" for the CPI includes items that reflect the price of bread, 
physicians' and dentists' fees, alcoholic and nonalcoholic beverages, clothing, 
transportation, shelter, and other goods and services purchased for day-to-day liv- 
ing. Every month, prices are obtained from 57,000 housing units and 19,000 es- 
tablishments representing 85 areas across the United States. 1 The resulting data 
are used in generating price indexes for specific regions and cities as well as for 
various categories of products and services. 

Applications 

Economic Indicator By reflecting changes in prices, the Consumer Price Index 
is the primary measurement of inflation in the United States and of the perceived 
success or failure of governmental policies to control inflation. Political candi- 
dates are eager to point out increases in the CPI when they wish to gain office and 
equally quick to call attention to any slowing in the growth of the CPI whenever 
they have been in office and are defending past policies. 

Escalator A common application of the Consumer Price Index is in the auto- 
matic adjustment of contracted wages or pension benefits through what are 
known as escalator clauses or cost-of-living adjustments (COL As). As prices go 
up, the wages or benefits are increased, either (1) in proportion to the change in 
the CPI, or (2) according to a formula that has been included in the collective bar- 
gaining agreement or set forth in the relevant legislation. 

Following a 1987 work stoppage, 4600 workers at Rohr Industries' aircraft 
parts plants in California got a new three-year contract negotiated by the Machin- 
ists Union. Among other items, the contract called for hourly wage increases based 
on the Consumer Price Index. The U.S. Department of Labor summed up the con- 
tract details: 

Three-year agreement negotiated 2/87 also provided: lump sum payments equal 
to 10 percent of pay 1st year and 6 percent in the 2nd and 3rd years; current 
COLA rolled into base rates and cost-of-living adjustment formula continued to 
provide 1 cent for each 0.3-point movement in the BLS-CPI; $24 (was $20) 
monthly pension benefit rate; 100 percent (was 95) employer payment for pre- 
ferred provider option-health care; $1 million (was $250,000) lifetime major 
medical maximum; $15,000 (was $10,000) life insurance 2 

Deflator Using the Consumer Price Index, current prices and wages can be de- 
flated to real prices or real wages. Compared to their "current" counterparts, 
these let us more readily determine whether we have more earning power than in 



1 The New York Times Almanac 2006, p. 330. 

^Bureau of Labor Statistics, U.S. Department of Labor, Current Wage Developments, April 1987, 
p. 10. 
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some previous period. If a person's real income has not kept pace with real prices, 
his or her economic condition and standard of living will have decreased. Current 
(nominal) income can be converted to real (constant) income as follows: 

Converting current income to real income: 

Current income 

Real income = — — — - — : — • 100 

Consumer Price Index 



In 2000 the median U.S. family income was $50,890, compared to the 1970 
figure of $9867. With 1982-1984 = 100 as the base, the 2000 Consumer Price 
Index was 172.2, compared to 38.8 in 1970. 3 Had real income increased or 
decreased from 1970 to 2000? To get the answer, we need only compare the real 
income levels for the two years: 

• 2000 

Median family income = $50,890 and CPI (1982-1984 base) = 172.2 

$50 890 

Real income (in 1982-1984 dollars) = ' - • 100 = $29,552.85 

• 1970 

Median family income = $9867 and CPI (1982-1984 base) = 38.8 

& no CH 

Real income (in 1982-1984 dollars) = — — — • 100 = $25,430.41 

38.8 



In terms of current dollars, the 2000 median family income was 5.16 times as 
great as in 1970. The 2000 CPI, however, was just 4.44 times as large. Because the 
current income growth exceeded the CPI growth, the median family had a higher 
real income in 2000 than in 1970. Comparing real income for the two years, we 
find the $29,552.85 median family real income of 2000 was 16.2% greater than 
the $25,430.41 of 1970. Statistics In Action 18.1 provides a glimpse of some 
prices from years ago and takes a "time-traveler" approach to the CPI. 



Other Indexes 

The federal government and other institutions employ a great many special- 
purpose indexes for specific applications. The following are typical examples: 

• The Producer Price Index (PPI) Dating from 1980, this is the oldest 
continuous statistical series published by the U.S. Bureau of Labor Statis- 
tics. It reflects the prices of commodities produced or processed in the 
United States. 

• Index of Industrial Production Provided by the Board of Governors of 
the Federal Reserve System since the 1920s, this index measures changes 
in the physical volume or quantity of output of manufacturing, mining, 
and electric and gas utilities. 



^The New York Times Almanac 2003, pp. 321, 323. 



statistics i 




3? *^ iL-Lf 




The CPI Time Machine 

How would it feel to be able to buy first-class postage for 3 
cents, a quart of milk for 15 cents, a new car for $1649, and a 
new house for $1 2,638? Seems like a pretty good deal, at least 
until you consider that your salary is a mere $2359 — per year. 

It was 1946. Timex watches and Tide detergent were 
brand new, less than 1 % of households had a TV, and the 
hit song of the year was "Zip-A-Dee-Doo-Dah." The milk, 
car, house, and salary figures above were the averages for 
that year. The minimum wage was 40 cents an hour, but 
would go up to 65 cents later in the year. 

Technology hasn't yet made it possible for us to time- 
travel, but — using the Consumer Price Index (CPI) discussed 



in this section — we can translate those long-ago prices and 
salaries into their modern-day equivalents. For example, 
that $1649 car would be about $15,974 in terms of 2004 
dollars, and the 1 5-cent quart of milk would have been the 
equivalent of $1 .45 in 2004. The multiplication factor we're 
using is the ratio of the Consumer Price Indexes for 2004 
and 1946: 188.9 and 19.5, respectively. 



Source: "A Snapshot of What Life Was Like in '46," USA Today, February 12, 
1996, p. 6D; New York Times Almanac 2006, p. 330. 



• Employment Cost Index (ECI) Compiled by the U.S. Bureau of Labor 
Statistics, the ECI is a quarterly measure of the total cost of employing 
labor. It includes wages and salaries, plus employers' costs for employee 
benefits such as paid leave, severance pay, health and life insurance, pen- 
sion and saving plans, Social Security, and unemployment insurance. 

• Standard & Poor's Composite Index of Stock Prices From Standard & 
Poor's Corporation, the index reflects the composite price of 500 common 
stocks. 

• New York Stock Exchange (NYSE) Composite Index Compiled by the 
New York Stock Exchange, this index measures changes in the aggregate 
value of all stocks listed on the NYSE. 



Shifting the Base of an Index 

Changing the base of an index can be desirable for two purposes: 

1. To better reflect the mix of public tastes or economic activities from which the 
index is compiled In this application, the change to a more recent base year 
involves a reweighting of the components comprising the index. 

2. To convert the index to the same base period as another series so that the two 
may be more easily compared For example, Table 18.8 lists 1996 through 
2001 values for two indexes: (a) the Engineering News-Record construction 
price index and (b) the Producer Price Index for insulation materials. The for- 
mer has 1996 as its base; the latter, 1982. 

The two indexes will be more comparable if they have a common base year, 
and the 1996 base year of the construction price index has been arbitrarily se- 
lected for this purpose. As shown in column (3) of Table 18.8, the shift is made by 
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d) 


(2) 




(3) 




Engineering News-Record 


Producer Price Index, Insulation 




Construction Price Index 




Materials 


Year 


(1996 = 100) 


(1982 = 100) 


(1996 = 100) 


1996 


100.0 


118.E 




100.0 


1998 


105.3 


119.7 


100.7 


ZUUU 


II U. / 


128.6 


108.2 


ZUU I 


1 1 Z.o 


127.5 


p *-107.2 








107.2 = 


127.5 „„„ 

00 

118.9 















Data source: Bureau of the Census, U.S. Department of Commerce, Statistical Abstract of the United States 2002, 
pp. 584, 585. 



simply dividing each index number by the index for the new base year, then mul- 
tiplying the quotient by 100. Following the conversion, it is apparent that the con- 
struction prices increased 12.8% from 1996 to 2001, while the price of insulation 
materials went up by 7.2% during the same period. 



TABLE 18.8 



As reported, these indexes 
have base years of 1996 
and 1982, respectively. By 
shifting the base year for 
insulation materials as 
shown here, the two series 
will be more comparable, 
with each having 1996 as 
its base year. 



exercise! 





4*, , V 



18.53 What is an index number, and what is its value for 
the base period? 

18.54 What is the Consumer Price Index and what does 
it reflect? 

18.55 The Consumer Price Index and average annual 
wages in selected industries were as shown in the 
table, source: Bureau of the Census, Statistical Abstract of the United 
States 1999, pp. 451, 607. 

Average Wages (S/Hour) 

CPI 

Retail (1982-84 
Manufacturing Mining Services Trade =100) 



1980 
1985 
2001 



$7.27 
9.54 
14.84 



$9.17 
11.98 
17.65 



$5.85 
7.90 
14.61 



$4.88 82.4 
5.94 107.6 
9.82 177.1 



From 1980 to 2001, the average wages went up for 
workers in all four industries. However, which wage- 



earning groups fared best and worst in terms of their per- 
centage change in spending power during this period? 

18.56 An executive's salary increased from $95,000 to 
$110,000 between 2000 and 2004, but the Consumer 
Price Index went from 172.2 to 188.9 during this time 
span. In terms of real income, what was the percentage 
increase or decrease in the executive's real earnings? 

18.57 Use the Consumer Price Index to convert the 2000 
salary of the executive in Exercise 18.56 to its 2004 
equivalent. 

18.58 The Producer Price Index values for plumbing 
fixtures and brass fittings from 1996 through 2001 are 
shown below. Convert the index numbers so that the base 
period will be 1996 instead of 1982. source: Bureau of the 
Census, Statistical Abstract of the United States 1999, p. 584. 



1996 
1997 
1998 



171.1 
174.5 
175.1 



1999 
2000 
2001 



176.7 
180.4 
180.7 
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• Time series model and its components 

The analysis of time series data is useful in identifying past patterns and tenden- 
cies as well as helping to forecast future values. In the classical time series model, 
the time series is made up of trend (T), cyclical (C), seasonal (5), and irregular (I) 
components. Although other possibilities exist, a popular approach to combining 
these components is the multiplicative approach, or y = T • C • S • I. The model 
assumes that any observed value of y is the result of influences exerted by the four 
components. 

• Trend equations and smoothing techniques 

Trend equations can be fitted to a time series, and two approaches are 
examined: (1) the linear trend equation, y = b Q + b^x; and (2) the quadratic 
trend equation, y = b Q + b t x + b 2 x 2 . The time series can also be smoothed by 
using a centered moving average or by exponential smoothing. When moving av- 
erages are used, the procedure will differ slightly, depending on whether the num- 
ber of periods in the basis of the average is odd or even. 

• Seasonal indexes 

Depending on the nature of the time series, strong seasonal patterns may be pre- 
sent for certain months or quarters, and seasonal indexes can be calculated in 
order to inflate or deflate a trend value to compensate for seasonal effects. Sea- 
sonal indexes can be determined through the ratio to moving average method, 
and a time series can be deseasonalized by using the seasonal indexes for the re- 
spective months or quarters of the year. 

• Seasonal indexes and forecasting 

In forecasting based on time series information, data from the past are used in pre- 
dicting the future value of the variable of interest. Two approaches to time series fore- 
casting are the extrapolation of a trend equation fitted to the series and the use of ex- 
ponential smoothing. Once a forecast has been made for a given year, seasonal 
indexes can be used in determining the forecast for each month or quarter of the year. 

• Testing for fit and autocorrelation 

When two or more models are fitted to the same time series, it's useful to have one 
or more criteria on which to compare the closeness with which they fit the data. 
Two such criteria are the mean absolute deviation (MAD) and mean squared 
error (MSE). The Durbin-Watson test can be used to determine whether the resid- 
uals in a regression model are autocorrelated. 

• Autoregressive forecasting 

When values in a time series are highly correlated with each other, autoregressive 
forecasting models can be useful. In autoregressive forecasting, each independent 
variable represents the value of the dependent variable for a previous time period. 
First-order and second-order autoregressive forecasting models are discussed. 

• Index numbers 

An index number is a percentage that compares a measurement to its value 
for a selected base period. The Consumer Price Index describes the change in 
prices from one period to the next for a fixed "market basket" of goods and ser- 
vices. The CPI is used as a measurement of inflation, as a "cost of living" adjust- 
ment in labor contracts, and as a means of deflating current prices or wages to 
"real" prices or wages. 
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Classical Time Series Model 

y = T ■ C ■ S ■ I 



Linear Trend Equation 

y = b 0 + b^x 



where y = observed value of the time series variable 
T = trend component 
C = cyclical component 
S = seasonal component 
I = irregular component 



where y = the trend line estimate of y 
x = time period 



Quadratic Trend Equation 

y = b 0 + b]X + b 2 x 2 where y = the trend line estimate of y 

x = time period 
(x and x 2 are treated as separate variables.) 

Exponential Smoothing of a Time Series 

E t = a ■ y f + (1 - a) ■ E f _ 1 



where E f = the exponentially smoothed value 
for the time period t 
the exponentially smoothed value 
for the previous time period 
the actual y value in the time series for 
time period t 
a = the smoothing constant, and 
0 < a < 1 



J f - 1 

y, 



Deseasonalized Time Series 

For each time period, the deseasonalized value is 

Value in the original time series 



D 



(Seasonal index for that period)/100 



Forecasting with Exponential Smoothing 

F t +1 = oi • y t + (1 — a) • F t where F t 



+ 1 = forecast for time period t + 1 
y t = actual y value for time period t 
F t = the value that had been 

forecast for time period t 
a = the smoothing constant, and 
0 < a < 1 



Dividing an Annual Forecast into Seasonal Components 

Forecast for the yesr - 

Forecast for a season = — : • [(Seasonal index)/100] 

Number of seasons 

Mean Absolute Deviation (MAD) Criterion for Evaluating Fit 

2 1 y t - % | 



MAD = 



where y t = an observed value of y 

y t = the value of y that is predicted using the 

estimation equation or model 
n = the number of time periods 
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Mean Squared Error {MSE) Criterion for Evaluating Fit 

^(y t -y t ) 2 

MSE = where y t = an observed value of y 

y t = the value of y that is predicted using the 

estimation equation or model 
n = the number of time periods 

Durbin-Watson Test, Description of Residual (Error) for Time Period t 

£ t = ps t _ 1 + z t where e t = residual associated with time period t 

E t _ j = residual associated with time period t — 1 
p = the population correlation coefficient between 

values of s t and s t _ p with — 1 < p < +1 
z t = a normally distributed random variable 
with p, = 0 



Durbin-Watson Test for Autocorrelation 

• Null and alternative hypotheses 

The test is 



for positive 
autocorrelation 



for negative 
autocorrelation 



a two-tail test 





p < 0 


H Q : 


p > 0 


H Q : 


P 


= 0 




p > 0 


H,i 


p < 0 


Hv 


P 


* 0 



• Test statistic 



d = 



t=2 



t = n 

2-? 



c «- 1 



where e f = observed residual (error) 

associated with time period f 
observed residual (error) 
associated with time period t — 1 
h = number of observations, 
with n > 15 

• Critical values and decision zones Look up the appropriate d L and d v values in the 
Durbin-Watson tables in Appendix A. The actual critical values and decision zones 
are based on (1) the type of test (positive, negative, or two-tail), (2) a = the level of 
significance for the test, and (3) k = the number of independent variables in the re- 
gression model. The decision zones and appropriate conclusions are shown in Figure 
18.5. Appendix A provides three tables for the Durbin-Watson test. They correspond 
to the 0.05, 0.025, and 0.01 levels of significance for a one-tail (positive or negative) 
test, and to the 0.10, 0.05, and 0.02 levels for a two-tail test. 

Autoregressive Forecasting Model 

• First-order autoregressive model The forecast for a given time period is based on the 
value observed in the preceding time period: 

% = + ^lVt-i with y t _ j = the value of y in period t — 1 

• Second-order autoregressive model The forecast for a given time period is based on 
the values observed in the two previous time periods: 

y t = b 0 + b 1 y t _ 1 + b 1 y t _ 1 with y t _ 1 = the value of y in period t — 1 

y t -j_ = the value of y in period t — 2 



Converting Current Income to Real Income 

_ , Current income 

Real Income 



Consumer Price Index 



100 
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18.59 Fit a linear trend equation to the following data 
describing the number of active U.S. military personnel 
from 1994 through 2000. What is the trend estimate for 
2010? source: Time Almanac 2001, p. 370. 



Year 



Military Personnel 
(millions) 



1994 
1995 
1996 
1997 
1998 
1999 
2000 



1.61 
1.52 
1.47 
1.44 
1.41 
1.38 
1.37 



18.60 Given the time series in Exercise 18.59, and using 
exponential smoothing with a = 0.7, what would have 
been the forecast for 2001? 

18.61 SUBWAY has grown to include more than 20,000 
restaurants worldwide since they opened their first one in 
1965. The accompanying data show the number of 
SUBWAY restaurants worldwide from 1985 through 
2002. Fit a quadratic equation to these data and estimate 
the number of SUBWAY restaurants worldwide in 2010. 
source: SUBWAY.com, July 3, 2003. 





SUBWAY 




SUBWAY 


Year 


Restaurants 


Year 


Restaurants 


1985 


596 


1994 


9893 


1986 


991 


1995 


11,420 


1987 


1810 


1996 


12,516 


1988 


2888 


1997 


13,066 


1989 


4071 


1998 


13,600 


1990 


5144 


1999 


14,162 


1991 


6187 


2000 


14,662 


1992 


7327 


2001 


15,762 


1993 


8450 


2002 


1 7,482 



18.62 Given the time series in Exercise 18.61, and using 
exponential smoothing with a = 0.8, what would have 
been the forecast for 2003? 

18.63 From 1990 through 1996, average monthly cable 
TV bills were as shown below, source: Richard Latture and Marcia 
Staimer, "Cable Bills on the Rise," USA Today, December 9, 1996, p. 1B. 



Year 



Average Bil 



1990 
1991 
1992 
1993 
1994 
1995 
1996 



$17.60 
19.20 
20.62 
22.49 
21.60 
23.07 
25.50 



a. Fit a linear trend equation to the time series. Using this 
equation, determine the trend estimate for 2010. 

b. Fit a quadratic equation to the time series, then use the 
equation to determine the trend estimate for 2010. 

c. Which of the preceding equations is the better fit to the 
time series if the MAD criterion is used? If the MSE 
criterion is used? 

18.64 The annual number of Memorial Day travelers 
from 1992 through 1999 is shown here, source: Sam Ward, 
"Memorial Day Travelers, " USA Today, May 12, 2000, p. 1D. 



Year 


Travelers (millions) 


1992 


27.0 


1993 


28.6 


1994 


29.2 


1995 


30.0 


1996 


30.3 


1997 


31.8 


1998 


32.0 


1999 


33.4 



a. Fit a linear trend equation to the time series. Using this 
equation, determine the trend estimate for 2010. 

b. Fit a quadratic equation to the time series, then use the 
equation to determine the trend estimate for 2010. 

c. Which of the preceding equations is the better fit to the 
time series if the MAD criterion is used? If the MSE 
criterion is used? 

18.65 The following data show U.S. retail sales of canoes 
from 1980 through 1997, with data in thousands of boats. 
source: National Marine Manufacturers Association, Boating 1995; and Bu- 
reau of the Census, Statistical Abstract of the United States 1999, p. 269. 



Year 


Sales 


Year 


Sales 


1980 


105 


1989 


80 


1981 


126 


1990 


75 


1982 


101 


1991 


72 


1983 


107 


1992 


78 


1984 


103 


1993 


90 


1985 


79 


1994 


100 


1986 


80 


1995 


98 


1987 


85 


1996 


93 


1988 


90 


1997 


104 



b. 



d. 



Construct a graph of the time series. Does the overall 
trend appear to be upward or downward? 
Construct a three-year centered moving average for 
this series. 

Using the constant a = 0.3, fit an exponentially 

smoothed curve to the original time series. 

Repeat part (c) using a = 0.7. How has the new value 

for the constant affected the smoothness of the fitted 

curve? 
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18.66 Given the time series in Exercise 18.65, and using 
exponential smoothing with a = 0.5, what would have 
been the forecast for 1998? 

18.67 The following data are quarterly sales for the 

J. C. Penney Company for 2000 through 2002. Determine 
the quarterly seasonal indexes, then use these indexes to 
deseasonalize the original data, source: J.C. Penney, Inc., 2001 
Annual Report, p. 31, 2002 Annual Report, p. 39. 



Gross Sales (Billions) 





1 


II 


III 


IV 


2000 


$7.53 


7.21 


7.54 


9.57 


2001 


7.52 


7.21 


7.73 


9.54 


2002 


7.73 


7.20 


7.87 


9.55 



18.68 Construct three-period and four-period centered 
moving averages for the quarterly J. C. Penney sales in 
Exercise 18.67. Are there any other retailing companies 
for whom a four-period moving average might have such 
a great effect on smoothing quarterly sales data? If so, 
give a real or hypothetical example of such a company. 

18.69 The following data describe quarterly residential 
natural gas sales from 1991 through 1994. Determine the 
quarterly seasonal indexes, then use these indexes to de- 
seasonalize the original data, source: American Gas Association, 
1995 Gas Facts, p. 67. 



Residential Gas Sales (Trillions of BTU) 



I II III IV 


1991 


2048 


757 


398 


1342 


1992 


2032 


818 


409 


1435 


1993 


2276 


931 


436 


1454 


1994 


2426 


788 


395 


1363 



18.70 Construct four-period and five-period centered 
moving averages for the quarterly residential gas sales in 
Exercise 18.69. Are there any other energy industries for 
whom a four-period moving average might have such a 
great effect on smoothing quarterly consumption data? If 
so, give a real or hypothetical example of such an industry. 

18.71 The Walt Disney Company has reported the 
following quarterly revenues (in billions of dollars) for its 
Parks & Resorts operating segment for fiscal years 1998 
through 2002. Determine the quarterly seasonal indexes, 
then use these indexes to deseasonalize the original data. 
source: The Walt Disney Company, Fact Book, editions 1998-2002. 

Quarter 



I II III IV 



1998 


$1,260 


1.244 


1.504 


1.524 


1999 


1.434 


1.408 


1.711 


1.553 


2000 


1.577 


1.571 


1.940 


1.715 


2001 


1.729 


1.650 


1.946 


1.684 


2002 


1.433 


1.525 


1.847 


1.660 
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18.72 Construct four-period and five-period centered mov- 
ing averages for the quarterly revenues in Exercise 18.71. 
Are there any other entertainment or recreation companies 
for whom a four-period moving average might have such a 
great effect on smoothing quarterly revenue data? If so, 
give a real or hypothetical example of such a company. 

18.73 The actual number of help-wanted ads in a local 
newspaper was 1682, 1963, 2451, and 3205 for quarters 
I through IV, respectively, of the preceding year. The cor- 
responding deseasonalized values are 2320, 2362, 2205, 
and 2414, respectively. Based on this information, what 
seasonal index is associated with each quarter? 

18.74 The trend equation y = 2050 + 12x has been de- 
veloped for the number of vehicles using the ferry service 
at a national seashore each year. (The years have been 
coded so that 1993 corresponds to x = 1.) If the seasonal 
indexes for quarters I through IV are 55.0, 130.0, 145.0, 
and 70.0, respectively, estimate the usage volume for each 
quarter of 2010. 

18.75 The trend equation y = 150 + 12x + O.Olx 2 has 
been developed for annual unit sales of a filtering system 
component used by municipal water authorities. The 
years have been coded so that 1996 corresponds to 

x = 1. If the seasonal indexes for quarters I through IV 
are 85.0, 110.0, 125.0, and 80.0, respectively, estimate 
the unit sales for each quarter of 2010. 

18.76 The National Association of Railroad Passengers 
has reported the following data describing U.S. commuter 
train ridership. source: Time Almanac 2003, p. 222. 

Year 

1995 1996 1997 1998 1999 2000 
Riders 42.2 45.9 48.5 54.0 58.3 61.6 million 
For the ridership data shown above: 

a. Fit a linear trend equation to the time series. Using this 
equation, determine the trend estimate for 2010. 

b. Fit a quadratic equation to the time series, then use the 
equation to determine the trend estimate for 2010. 

c. Which of the preceding equations is the better fit to the 
time series if the MAD criterion is used? If the MSE 
criterion is used? 

18.77 Ford Motor Company reports the average 
hourly labor cost per hour (earnings plus benefits) for 
1993-2002. Fit a quadratic trend equation to the data, fit 
a linear trend equation to the data, and then use the MSE 
criterion to determine which equation is the better fit. 
source: Ford Motor Company, 2002 Annual Report, p. 91 . 



Year 


Labor Cost 


Year 


Labor Cost 


1993 


$39.06/hour 


1998 


45.72 


1994 


40.94 


1999 


47.37 


1995 


40.45 


2000 


48.44 


1996 


41.77 


2001 


47.73 


1997 


43.55 


2002 


52.65 
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18.78 A Caribbean cruise line booked the number of pas- 
sengers (in thousands) described below for each quarter 
from 2001 through 2005. 





1 


II 


III 


IV 


2001 


27 


23 


21 


24 


2002 


32 


26 


18 


29 


2003 


28 


25 


16 


30 


2004 


37 


31 


21 


34 


2005 


38 


33 


23 


39 



a. Combining the quarterly bookings for each year, fit a 
linear trend equation and forecast the number of 
bookings for 2009. 

b. Determine the seasonal indexes and break the 2009 
forecast in part (a) into its quarterly components. 

c. Combining the quarterly bookings for each year, use 
exponential smoothing, with a = 0.3, to forecast the 
number of bookings for 2006. 

d. Use the seasonal indexes to break the 2006 forecast in 
part (c) into its quarterly components. 

18.79 Use the seasonal indexes obtained in Exercise 
18.78 to generate a deseasonalized series of quarterly 
bookings for those data. 

18.80 Monthly loan applications for a local bank from 
2002 through 2005 were as shown in the table. 





J 


F 


M 


A 


M 


J 


2002 


30 


28 


32 


33 


32 


32 


2003 


22 


26 


30 


36 


35 


40 


2004 


19 


22 


30 


31 


31 


36 


2005 


29 


28 


42 


47 


49 


53 




J 


A 


S 


O 


N 


D 


2002 


39 


44 


48 


47 


33 


28 


2003 


41 


36 


32 


34 


23 


19 


2004 


39 


37 


36 


36 


32 


29 


2005 


51 


55 


50 


46 


36 


33 



a. Combining the monthly applications for each year, fit 
a linear trend equation and forecast the number of ap- 
plications for 2008. 

b. Determine the seasonal indexes and break the 2008 
forecast in part (a) into its monthly components. 

c. Combining monthly applications for each year, use ex- 
ponential smoothing and a = 0.4 to obtain a forecast 
for 2006. 

d. Use the seasonal indexes to break the 2006 forecast in 
part (c) into its monthly components. 

18.81 Use the seasonal indexes obtained in Exercise 
18.80 to generate a deseasonalized series of monthly loan 
applications for the bank. 



18.82 Apply the Durbin- Watson test and the 0.05 level of 
significance in examining the data series in Exercise 18.65 
for positive autocorrelation. Based on your conclusion, 
comment on the appropriateness of the linear regression 
model for these data. 

18.83 Apply the Durbin- Watson test and the 0.05 level of 
significance in examining the data series in Exercise 18.61 
for positive autocorrelation. Based on your conclusion, 
comment on the appropriateness of the linear regression 
model for these data. 

18.84 The Producer Price Index for finished consumer 
goods from 1980 through 2004 is in file XR18084, with 
1982 = 100 and 1980 coded as t = \. Construct a first- 
order autoregressive forecasting equation for these data, 
calculate the mean absolute deviation (MAD) for the fit of 
the forecast values to the actual values, then use the equa- 
tion to forecast the PPI for 2005. source: Bureau of the Census, 
Statistical Abstract of the United States 2006, p. 489. 

18.85 The Consumer Price Index from 1961 through 
2004 is in file XR18085, with 1982-1984 = 100 and 1961 
coded as t = 1. Construct a first-order autoregressive 
forecasting equation for these data, calculate the mean 
absolute deviation (MAD) for the fit of the forecast 
values to the actual values, then use the equation to fore- 
cast the CPI for 2005. source: The New York Times Almanac 
2006, p. 330; and Bureau of the Census, Statistical Abstract of the United 
States 1999, p. 495. 

18.86 Repeat Exercise 18.85, but for a second-order 
autoregressive forecasting equation. Compare the MAD 
values for forecasts generated by the two equations and 
indicate which one is the better fit to these data. 

18.87 From 1996 to 2004, the median weekly earnings 
of workers represented by unions went from $610 to 
$776, and the Consumer Price Index (1982-1984 = 100) 
went from 156.9 to 188.9. What was the percentage in- 
crease or decrease in real wages for these workers? 
source: World Almanac and Book of Facts 2006, pp. 80, 135. 

18.88 Convert each of the following indexes so that the 
new base period will be 1997. 





A 


B 


1994 


101.3 


100.0 


1995 


100.0 


102.9 


1996 


103.4 


104.7 


1997 


105.6 


103.2 


1998 


107.9 


107.5 


1999 


1 10.3 


126.4 


2000 


1 16.2 


122.1 


2001 


125.3 


129.0 




(1995 = 100) 


(1994 = 100) 



734 



Part 5: Regression, Model Building, and Time Series 




Thorndike Sports Equipment (Video Unit Five) 



Negotiations are taking place for the next labor agreement 
between Thorndike Sports Equipment and its manufactur- 
ing employees. The company and the union have agreed to 
consider linking hourly wages to the predicted Consumer 
Price Index (CPI) for each of the four years of the new con- 
tract. The latest proposal would specify an hourly wage 
that is 10% of the estimated CPI (e.g., if the estimated CPI 
for a given year were 172.5, the hourly wage for that con- 
tract year would be $17.25). 

Table 18.9 shows the CPI for 1961 through 2004, 
along with the year and the year code. The data are also in 
file THORN18. The base period for the CPI figures is 
1982-1984, for which the average is 100. Assuming that 
the new contract will be in force from 2008 through 2011, 
use a computer statistical package to help determine the 
hourly wage that would be paid in each year of the con- 
tract if the estimated CPI is an extrapolation based on the 
following: 



1. A simple linear estimation equation (y = b Q + b^x) 
fitted to the 1961-2004 CPI data. 

2. A quadratic estimation equation (y = b Q + 
b x x + b 2 x 2 ) fitted to the 1961-2004 CPI data. 

Several additional questions regarding the estimation 
equations and the contract negotiations: 

3. Which equation would each negotiating party tend to 
prefer as the basis of the hourly wages to be paid dur- 
ing the contract years? 

4. Using the mean absolute deviation (MAD) criterion, 
which equation is the better fit to the CPI time series 
data? 

5. Assuming (a) the hourly wages are linked to the equa- 
tion identified in the preceding question, (b) the esti- 
mated CPI for 2011 actually occurs, and (c) the aver- 
age Thorndike worker made $12.40 per hour in 1987, 
will his or her real earnings increase or decrease from 
1987 to 2011? Explain. 



TABLE 18.9 



Values of the Consumer 
Price Index (CPI) for 1961 
through 2004. The base pe- 
riod is 1982-1984 = 100. 



Year 


Year 
Code 


CPI 


Year 


Year 
Code 


CPI 


1961 


1 


29.9 


1983 


23 


99.6 


1962 


2 


30.2 


1984 


24 


103.9 


1963 


3 


30.6 


1985 


25 


107.6 


1964 


4 


31.0 


1986 


26 


109.6 


1965 


5 


31.5 


1987 


27 


113.6 


1966 


6 


32.4 


1988 


28 


118.3 


1967 


7 


33.4 


1989 


29 


124.0 


1968 


8 


34.8 


1990 


30 


130.7 


1969 


9 


36.7 


1991 


31 


136.2 


1970 


10 


38.8 


1992 


32 


140.3 


1971 


11 


40.5 


1993 


33 


144.5 


1972 


12 


41.8 


1994 


34 


148.2 


1973 


13 


44.4 


1995 


35 


152.4 


1974 


14 


49.3 


1996 


36 


156.9 


1975 


15 


53.8 


1997 


37 


160.5 


1976 


16 


56.9 


1998 


38 


163.0 


1977 


17 


60.6 


1999 


39 


166.6 


1978 


18 


65.2 


2000 


40 


172.2 


1979 


19 


72.6 


2001 


41 


177.1 


1980 


20 


82.4 


2002 


42 


179.9 


1981 


21 


90.9 


2003 


43 


184.0 


1982 


22 


96.5 


2004 


44 


188.9 



Sources: Bureau of the Census, U.S. Department of Commerce, Statistical Abstract of the 
United States 1999, p. 495; and New York Times Almanac 2006, p. 330. 




Lawsuits— Take the Cash or Try For the Curtain? 

After Jill became pregnant, her company decided it no longer needed her services. 
Upon being terminated, Jill accused the company of bias and filed a wrongful-firing 
suit against it. The company is now trying to settle out of court and has offered Jill 
$63,000. 

Should Jill accept the settlement offer, or might it be more attractive to go to 
court and take her chances with the trial? According to a Jury Verdict Research study, 
65% of pregnancy bias cases are won by the plaintiff, and the median award for win- 
ners is $87,500. 

Essentially, Jill's alternatives can be viewed as (a) the sure $63,000 versus (b) the 
65% chance at $87,500 (which includes a 35% chance at $0). 



Based on the expected payoff decision crite 



ad, take it. 



rion discussed in this chapter, Jill 
should take the money and run. 
After reading that section, you can 
verify our advice to Jill for yourself. By 
the way, Jill should be willing to pay up 
to $22,050 for a perfect forecast as to 
what the jury would decide if she were 
to take the case to trial. That's known as 
the expected value of perfect information, 
and it's in this chapter, too. 



Source: The win percentage and median award are from 
Anne R. Carey and Marcey E. Mullins, "Cost of Bias Suits 
over Firing," USA Today, June 18, 1996, p. IB. 
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Express a decision situation in terms of decision alternatives, states of nature, 
and payoffs. 

Differentiate between non-Bayesian and Bayesian decision criteria. 

Determine the expected payoff for a decision alternative. 

Calculate and interpret the expected value of perfect information. 

Express and analyze the decision situation in terms of opportunity loss and 
expected opportunity loss. 

Apply incremental analysis to inventory-level decisions. 



19.1 INTRODUCTION 



While it's been said that nothing is certain except death and taxes, there is one more 
thing that's certain: uncertainty. A fish market owner isn't sure how many cus- 
tomers will decide to buy a flounder today, a pharmaceutical company isn't com- 
pletely certain that a new drug will be safe for every one of 20 million users, and a 
computer firm doesn't have complete certainty that a new computer in the final de- 
sign stages will still be competitive when it is finally manufactured. In each of these 
cases, a decision must be made. For the fish market owner, it's how many flounder 
to stock; for the pharmaceutical and computer firms, it's whether to proceed with 
the new product. 

Both decision making and uncertainty are also discussed in other chapters. In 
hypothesis testing, we had to decide whether to reject the null hypothesis, and un- 
certainty was reflected in the Type I and Type II errors. In the next chapter, we will 
examine how control charts can help us decide whether a process is in control, 
and we will briefly discuss how acceptance sampling can assist in deciding 
whether an entire shipment of products or components should be accepted. 

The biggest difference between these discussions and the topics of this chapter 
is that we will be concentrating here on decisions made without the benefit of 
knowledge gained from sampling. While the possibility of gathering further infor- 
mation will be considered, this will be presented in terms of, "What is the most 
we should consider paying for additional information to reduce our uncertainty 
in making the decision?" 

We will begin by structuring the decision situation and expressing it as either 
a payoff table or a decision tree diagram. The discussion then turns to two gen- 
eral approaches by which the actual decision might be reached. The chapter con- 
cludes with a discussion of incremental analysis, a technique especially useful for 
inventory decisions. 



1 9.2 STRUCTURING THE DECISION SITUATION 



Before analyzing a decision situation, we must have a structure to which its vari- 
ous components can be fitted. These components, and the structure itself, are 
described in this section. 



learning 
objectives # 

After reading this 
chapter, you should 

be able to: » 





New Technology Makes Invention Hit Bottom 



One day, while changing his baby daughter's diaper, an 
inventor was unable to find a safety pin and became con- 
cerned that it might have come loose and been swal- 
lowed by the child. Although the pin was later found, the 
experience inspired him to invent a new type of safety pin. 
His new safety pin had a secure locking mechanism, but 
could easily be installed and removed by an adult using 
just one hand. 

The product seemed a sure bet for concerned parents 
and tender babies everywhere, but its major difficulty 
turned out to be the uncontrollable variable known as 



technology. Unfortunately, the new type of safety pin 
was quickly followed by a new type of diaper, of the self- 
adhesive and disposable type. 

The appearance of this unforeseen state of nature 
quickly unfastened the safety pin's claim to fame. The 
patent for the safety pin was obtained in 1971 but ran out 
in 1988, with the primary recognition for its inventor con- 
sisting of a single television talk show appearance. 



Source: Brian O'Neill, "Disposable Diapers Wipe Out Anxious Dad's Safer 
Pin," Pittsburgh Press, March 12, 1989, p. B1 . 



Alternatives 

Decision alternatives are the choices available to the decision maker. For the 
computer firm, decision alternatives could be (1) plan to introduce the new com- 
puter, and (2) drop plans to introduce the new computer. Only one alternative can 
be selected from among those available. 

States of Nature 

After a decision alternative has been selected, the outcome is subject to chance 
and no longer under the control of the decision maker. For example, the new com- 
puter could be either competitive or noncompetitive in technology and features 
versus other brands on the market at the time of its introduction. Each of these 
possibilities is a state of nature, a future environment that has a chance of occur- 
ring. Statistics in Action 19.1 describes how the occurrence of a given state of na- 
ture spelled doom for the financial aspirations of one entrepreneur. The states of 
nature must be both mutually exclusive (only one will occur) and exhaustive (one 
of them must occur). 

Having introduced the concept of states of nature, we can now describe three 
different levels of doubt regarding the decision situation itself: 

RISK Both the possible states of nature and their exact probabilities of occur- 
rence are known. This is typical of many gambling situations, such as playing 
roulette. This is an extreme rarity in business decisions, since the exact probabilities 
are generally not known. 

UNCERTAINTY The possible states of nature are known, but their exact proba- 
bilities are unknown. Most business decision situations fall into this category, and 
the probabilities must be estimated based on past experience or executive 
judgment. 

IGNORANCE Neither the possible states of nature nor their exact probabilities 
are known. This is another rarity, as only a fugitive from Mars would have abso- 
lutely no idea what states of nature might occur after a decision has been made. 
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The Payoff Table 

The payoff table is a table in which the rows are decision alternatives, the 
columns are states of nature, and the entry at each intersection of a row and col- 
umn is a numerical payoff such as a profit or loss. The table may also include 
estimated probabilities for the states of nature. 



Example 



Payoff Table 

A German automobile manufacturer currently produces its best-selling U.S. 
model in Germany, but the relative strength of the Euro versus the U.S. dollar has 
been making the car very expensive for the market in which it competes. To en- 
sure a lower, more stable price, the company is considering the possibility of man- 
ufacturing cars for U.S. consumption at one of its South American plants. 

SOLUTION 

Table 19.1 is a payoff table that summarizes this decision situation. It includes the 
following components: 

• Alternatives (1) Continue producing cars in Germany for the U.S. market; 
or (2) manufacture cars for U.S. consumption in the South American plant. 

• States of nature Over the next two model years, the relative strength of the 
dollar versus the Euro will be (1) weak, (2) moderate, or (3) strong. 

• Probabilities Corporate economists have estimated the probabilities for the 
relative strength of the dollar as 0.3 (weak), 0.5 (moderate), and 0.2 (strong). 

• Payoffs The estimated profit for this model depends on which alternative 
is selected and which state of nature occurs. For example, if manufacturing 
is switched to the South American plant and the dollar is strong, the firm's 
profit is estimated as $16 million over the two-year period. 



The same decision situation could also be described by a decision tree, a dia- 
gram in which alternatives, states of nature and their probabilities, and payoffs are 



TABLE 19.1 



The payoff table includes 
the decision alternatives, 
the possible states of 
nature, and the payoff for 
each combination of the 
alternatives and states of 
nature. Estimated probabili- 
ties for the states of nature 
may also be included, as 
shown here. Profits are in 
millions of dollars. 



State of Nature 
Compared to the Euro, 
the strength of the U.S. dollar will be 



Alternative 


Weak 


Moderate 


Strong 




(Prob. = 0.3) 


(Prob. = 0.5) 


(Prob. = 0.2) 


German production 


$10 


$15 


$25 


South American production 


$20 


$18 


$16 



Chapter 19: Decision Theory 



739 




displayed in graphical form. Figure 19.1 is the decision tree counterpart to the pay- 
off table of Table 19.1. It provides the same information, but in a graphical form. 
The first two branches are the decision alternatives. Depending on which alterna- 
tive is selected, either node 1 or node 2 will be encountered next, and the probabil- 
ities and the states of nature are the same for each, just as in the payoff table. 

In the decision tree, each square node shows the making of a decision, and 
each circular node shows the occurrence of a state of nature. The payoff table will 
be our primary method for describing decision situations, but the decision tree 
approach will be revisited for some later applications. 



exercise 



19.1 What are the differences among risk, uncertainty, and 
ignorance? Into which category of decision situation do 
most business decisions fall? 

19.2 What is meant when we say that the states of nature 
in the payoff table must be mutually exclusive and 
exhaustive? 

19.3 The manager of a small local airport must extend a 
runway if business jets are to be accommodated. However, 



the airport is practically surrounded by a suburban hous- 
ing development, and residents are already complaining 
to the city council about the volume and noise of existing 
air traffic at the facility. Identify the decision alternatives 
and the possible states of nature that might occur after 
the decision is made. 

19.4 A management consultant, unable to locate receipts 
or records for several business trips during the past year, 
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makes some rough estimates of these expenditures and 
considers claiming them as business expenses on his 
income tax form. If he makes the claims and is not 
audited, he will owe $12,000 in taxes. If he makes the 
claims and the IRS audits his return, he will have to pay 
an extra $6,000 in taxes and penalties. If he ignores the 
trips, he will owe $14,000 in taxes. Construct a payoff 
table and a decision tree diagram for this decision 
situation. 

19.5 A trainee in a realty firm is being paid a straight 
salary, but has the opportunity to (a) continue with 
straight salary, (b) change to straight commission, or 
(c) change to a combination that includes a lower base 
salary and a commission on each sale. What states of 



nature might be useful in constructing a payoff table for 
this situation? 

19.6 Having been injured while trying to use his rotary 
lawn mower as a snowblower, a consumer has sued the 
company and is offered the chance to settle out of court. If 
the case goes to trial, the jury could award him either 
nothing or a large amount in damages. Construct a payoff 
table for this consumer, including probabilities and payoff 
amounts that you feel might be realistic. 

19.7 A Florida orange grower is considering the installa- 
tion of a heating system to protect his crop in the event of 
frost during the coming winter. What would be the rows 
and columns of a payoff table applicable to this decision? 



1 9.3 NON-BAYESIAN DECISION MAKING 



This section examines the first of two categories of decision criteria. Just as deci- 
sion rules are used in hypothesis testing and acceptance sampling, they also apply 
to the decisions in this chapter. In non-Bayesian decision making, the decision rule 
does not consider the respective probabilities of the states of nature. Three such 
decision criteria are discussed here. 



Maximin 

The maximin criterion specifies that we select the decision alternative having the 
highest minimum payoff. This is a conservative, pessimistic strategy that seems to 
assume that nature is "out to get us." In fact, the word maximin is equivalent to 
"maximize the minimum possible payoff." 

Table 19.2 applies the maximin criterion to the decision situation in Table 
19.1. For each alternative, we identify the worst-case scenario. If the firm contin- 
ues to make the cars in Germany, the worst state of nature that could occur is a 
weak dollar (profit = $10 million). If production is switched to South America, 



TABLE 19.2 



Using the maximin criterion, 
production of vehicles for 
the U.S. market would be 
switched to the South 
American plant. The max- 
imin criterion pessimistically 
assumes that the worst pos- 
sible state of nature will 
occur after the decision has 
been made. 



Alternative 



German production 

South American 
production 



State of Nature 
Compared to the Euro, 
the strength of the U.S. dollar will be 

Weak Moderate Strong 
(Prob. = 0.3) (Prob. = 0.5) (Prob. = 0.2) 



$10 


$15 


$25 


$20 


$18 


$16 



/linimum 
payoff 



>10 million 
>16 million - 



>16 million is the highest minimum payoff. 
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Alternative 



German production 

South American 
production 



State of Nature 
Compared to the Euro, 
the strength of the U.S. dollar will be 

Weak Moderate Strong 
(Prob. = 0.3) (Prob. = 0.5) (Prob. = 0.2) 



$10 


$15 


$25 


$20 


$18 


$16 



Maximum 
payoff 



$25 million 
$20 million 



$25 million is the highest maximum payoff. 



TABLE 19.3 



Using the maximax 
criterion, production of 
vehicles for the U.S. market 
would remain in Germany. 
This criterion is optimistic, 
since it assumes that the 
best possible state of nature 
will occur after the decision 
has been made. 



the worst possible scenario is a strong dollar (profit = $16 million). Since $16 
million is preferable to $10 million, the maximin criterion would have the firm 
switching production to South America. 

Maximax 

The maximax criterion specifies that we select the decision alternative having the 
highest maximum payoff. This is an optimistic strategy, making the assumption 
that nature will be generous to us. The word maximax is equivalent to "maximize 
the maximum possible payoff." 

Table 19.3 applies the maximax criterion to the decision situation in Table 19.1. 
For each alternative, the best-case scenario is identified. If the firm continues German 
production for the United States, the best state of nature that could occur is a strong 
dollar (profit = $25 million). If production is switched to South America, the best 
possible scenario is a weak dollar (profit = $20 million). Because $25 million is bet- 
ter than $20 million, the maximax criterion would specify that German production 
continue. 



Minimax Regret 

The minimax regret criterion is the equivalent of being paranoid about "Monday- 
morning quarterbacks" who, with the benefit of hindsight, come along later and 
tell you what you should have done. In using minimax regret, we must first con- 
struct a regret table. This is a table in which, for each state of nature, the entries 
are regrets, the differences between each payoff and the best possible payoff if 
that state of nature were to occur. The regret table is the same as the opportunity 
loss table, to be discussed in Section 19.5. 

Table 19.4 (page 742) is the regret table for the manufacturing decision de- 
scribed in Table 1 9 . 1 . If the firm knew for sure that the dollar was going to be mod- 
erately strong, it would switch production to the South American plant and realize 
a profit of $18 million. However, if this state of nature were to occur and the firm 
had continued German production, profits would only be $15 million. The entry 
in the first row, second column of the regret table is thus ($18 million — $15 million), 
or $3 million. Using the same logic, there will be zero regret if the dollar is 
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TABLE 19.4 



The regret table for the 
manufacturing plant deci- 
sion of Table 19.1. Using 
the minimax regret decision 
criterion, production will be 
switched to South America. 



Alternative 



State of Nature 
Compared to the Euro, 
the strength of the U.S. dollar will be 

Weak Moderate Strong 
(Prob. = 0.3) (Prob. = 0.5) (Prob. = 0.2) 



Maximum 
regret 



German production 

South American 
production 



$10 




$3 


$0 


$0 




$0 


$9 



>10 million 



9 million 



If the dollar is moderately strong, profit 
from German production ($15 million) 
will be $3 million less than what could 
have been made with South American 
production ($18 million). 

If the firm makes the cars in Germany 
and this state of nature occurs, the 
amount of the regret will be $3 million. 



$9 million is the lowest maximum regret. 



moderately strong and production is switched to the South American plant — for 
this state of nature, switching production out of Germany would be the better 
choice. 

The minimax regret criterion specifies selection of the alternative with the 
lowest maximum regret. In Table 19.4, the regret could be as high as $10 million 
if production stays in Germany, while the maximum regret would be just $9 mil- 
lion if the cars are made in South America. If the minimax regret criterion is used, 
the firm will move production to the South American plant. 




19.8 For Exercise 19.4, which decision alternative will be 
selected if the consultant uses the maximin criterion? The 
maximax criterion? The minimax regret criterion? 

19.9 Dave's local college football team is playing for the 
national championship in next month's bowl game, and 
he's thinking about having "We're number one" T-shirts 
imprinted in honor of the team. It's too late to get the 
shirts before game day, but if he orders today, he can get 
them within a few days after the game. The shirts must be 
ordered in multiples of 1000, and Dave has estimated the 



amount of profit he will make for possible outcomes of 
the game. Given the following payoff table, what will 
Dave do if he uses the maximin criterion? The maximax 
criterion? The minimax regret criterion? 

The Team 

Wins Loses Ties 

Order no shirts $0 $0 $0 

Order 1000 shirts $800 -$200 $200 

Order 2000 shirts $1600 -$400 $300 
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19.10 Three different designs are being considered for a 
new electronics product, and profits will depend on the 
combination of product design and market condition. The 
following payoff table summarizes the decision situation, 
with amounts in millions of dollars. What decision will be 
made using the maximin criterion? The maximax 
criterion? The minimax regret criterion? 



Market Condition 



I II III 


Design A 


$25 


$14 


$6 


Design B 


$15 


$16 


$18 


Design C 


$5 


$10 


$40 



19.11 For each of the following, indicate whether the be- 
havior represents the maximin criterion or the maximax 
criterion. Explain your reasoning. 

a. A trucker is behind schedule and knows he is 
approaching an area where police radar is often 
encountered, but he continues driving at 15 mph over 
the speed limit. 

b. A struggling retailer, in danger of defaulting on next 
month's payroll, hires a public relations firm to carry 
out an intensive promotional campaign for his store. 
If the campaign is not extremely successful, the retailer 
won't be able to pay the PR firm, either. 

c. The owner of a roller-skating rink buys liability 
insurance. 

19.12 When Fred takes the direct route to work, he gets 
there in 20 minutes if no train is at the railroad crossing. 



When a train is using the crossing, his travel time is 
15 minutes longer. His alternative is a route that takes 
30 minutes to travel, but has no traffic signals or other 
delays. Assuming that shorter transit times are more 
desirable, which route will Fred select if he uses the 
maximin criterion? The maximax criterion? The 
minimax regret criterion? 

19.13 An antique collector is shipping $50,000 worth of 
items from Europe to America and must decide whether to 
insure the shipment for its full value. An insurance firm will 
provide this protection for a fee of $500. What will be the 
collector's decision if she uses the maximin criterion? The 
maximax criterion? The minimax regret criterion? 

19.14 An auto manufacturer has preliminary plans to intro- 
duce a new diesel engine capable of extremely high fuel 
economy. The success of the engine depends on whether the 
U.S. Environmental Protection Agency goes ahead with 
stringent particulate emission limits that are scheduled for 
the coming model year. The payoff table, with amounts in 
millions of dollars for the corporation's profit, is shown 
below. What decision will be made using the maximin crite- 
rion? The maximax criterion? The minimax regret criterion? 

The strict emission limits are 
Enacted Not Enacted 

Continue engine 

development -$10 $30 

Halt engine 

development $15 $15 



BAYESIAN DECISION MAKING 1 9.4 



Whereas the decision criteria just discussed ignore the probabilities for the respec- 
tive states of nature, Bayesian decision making takes them into account. Specifi- 
cally, the alternative chosen is the one with the best expected payoff. 

Expected Payoff 

The expected payoff of a decision alternative is the sum of the products of the 
payoffs and the state of nature probabilities. Each decision alternative is treated 
as a discrete probability distribution with an expected value that can be calculated 
using the method introduced in Chapter 6: 

Expected payoff for a decision alternative: 

Expected payoff = 2p- • m i where p i = the probability that state of 

nature i will occur 
m i = the payoff if this alternative 
is selected and state of 
nature /' occurs 
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For the automobile production decision we've been following, the expected 
payoffs for the alternatives can be calculated as follows: 

German production: 

Expected payoff = 2p ; • m ; = 0.3(10) + 0.5(15) + 0.2(25) = $15.5 million 
South American production: 
Expected payoff = 2p ; • m i = 0.3(20) + 0.5(18) + 0.2(16) = $18.2 million 

As Table 19.5 shows, the alternative with the higher expected payoff is South 
American production, with an expected profit of $18.2 million. Although $18.2 
million is not one of the entries in the payoff table, this would be the result if 
this decision situation were to be replayed a great many times — on the average, 
the profit would be $18.2 million. Expected payoff is sometimes referred to as 
expected monetary value (EMV). 

Figure 19.2 shows the decision tree diagram corresponding to Table 19.5. In 
using the decision tree, calculations proceed from right to left (i.e., from future to 
present), and the expected payoff for each alternative is obtained as in the calcula- 
tions just presented. The "X" through the "German production" branch indicates 
that this alternative has an inferior expected payoff and would not be selected. 



The Expected Value of Perfect Information 

In decision making under uncertainty, it's often possible to invest in research that 
will give us a better idea as to which one of the states of nature will actually occur. 
For example, before deciding whether to continue making cars for the U.S. mar- 
ket in Germany, the automobile company might like to consult with outside eco- 
nomic experts and forecasters regarding the predicted strength of the U.S. dollar 
versus the Euro. 

When we are considering whether to collect additional information before 
making the decision, an important question arises: how much should we pay for 
such information? To help answer this question, we can determine the expected 



TABLE 19.5 



Using the expected payoff 
criterion, production will be 
switched to South America 
and the expected profit will 
be $18.2 million. 



Alternative 



German production 

South American 
production 



State of Nature 
Compared to the Euro, 
the strength of the U.S. dollar will be 

Weak Moderate Strong 
(Prob. = 0.3) (Prob. = 0.5) (Prob. = 0.2) 



$10 


$15 


$25 


$20 


$18 


$16 



Expected 
payoff 



S15.5 million 
;18.2 million- 



If production is switched to South America, the expected payoff is 

2p y - ■ m ( - = 0.3(20) + 0.5(18) + 0.2(16) = $18.2 million 
Using the expected payoff criterion, this alternative would be selected. 
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Decision 
alternative 



Probability of each 
state of nature 



Payoff 
(millions) 



0.3 Dollar is weak. 



Make in 
Germany 





0.2 Dollar is strong 



EMV= 0.3(10) + 0.5(15) + 0.2(25) 
15.5 million 



EMV= 0.3(20) + 0.5(18) + 0.2(16) 
$18.2 million 



0.3 Dollar is weak. 



Make in 
South America 




0.2 Dollar is strong. 



FIGURE 19.2 



The decision tree summary 
for the table and expected 
payoff calculations 
of Table 19.5. The 
"X" through the German 
production alternative 
reflects that its expected 
payoff makes it inferior to 
the other alternative. 



value of perfect information {EVPI). This is the difference between (1) the ex- 
pected payoff if we are guaranteed perfect knowledge about the future and (2) the 
expected payoff under uncertainty: 

The expected value of perfect information, EVPI: 

Expected payoff with Expected payoff with 
EVPI = p er fect information — present information 

For the auto company, the expected payoff with present information has al- 
ready been calculated as $18.2 million. As Table 19.5 showed, this is the expected 
payoff for the alternative with the greatest expected payoff — that is, switching 
production of the car to South America. 

If perfect information were available on the future relative strength of the U.S. 
dollar versus the Euro, the company could make its decision under certainty. It 
would know which state of nature was going to occur and would make the best 
choice for this state of nature. For example, if the firm had 100% confidence that 
the "weak dollar" state were going to occur, it would switch production to South 
America and have a profit of $20 million. 

In calculating the expected value of perfect information, we must first deter- 
mine the expected payoff with perfect information. The perfect information has 
not yet been received, so we operate under the assumption that it might predict 
any one of the three states of nature. The probabilities of these states of nature 
have been estimated as 0.3, 0.5, and 0.2: 

• Probability = 0.3 that perfect information will say "weak dollar. " In this case, the 
firm switches production to South America and makes a profit of $20 million. 



746 



Part 6: Special Topics 



FIGURE 19.3 



For the decision situation of 
Table 19.5, the expected 
value of perfect information 
is $1 .8 million. This is the 
upper limit the firm should 
consider spending for any 
information to reduce 
uncertainty. 



$20.0 



$18.2 



Expected payoff with perfect information 

0.3(20) + 0.5(18) + 0.2(25) = $20.0 million 

There is a 0.5 chance that the perfect information 
will say "moderately strong dollar." In this event, 
the firm will shift production to South America 
and make $18 million. 



Expected value of perfect information 

EVPI = $20.0 - $18.2 = $1.8 million 



Expected payoff with present information 

In the state of uncertainty (i.e., with present 
information), the expected payoff for the best 
alternative is $18.2 million for "switch production 
to South America." 



• Probability = 0.5 that perfect information will say "moderately strong dol- 
lar." In this case, the firm switches production to South America and makes a 
profit of $18 million. 

• Probability = 0.2 that perfect information will say "strong dollar." In this case, 
the firm continues German production and makes a profit of $25 million. 

Then the expected payoff with perfect information is 

0.3(20) + 0.5(18) + 0.2(25) = $20.0 million 

The expected payoff with present information has already been calculated as 
$18.2 million. This is the expected payoff for the alternative with the highest ex- 
pected payoff under uncertainty (i.e., switch production to South America). Now 
we can calculate the expected value of perfect information: 

EVP I = Expected payoff with _ Expected payoff with 
perfect information present information 

= 20.0 million - $18.2 million = $1.8 million 

As calculated here and shown in Figure 19.3, the expected value of perfect in- 
formation is $1.8 million. This is the upper limit the firm should consider spend- 
ing for the purpose of reducing uncertainty. 



exercises 





19.15 What is the most important difference between 
Bayesian and non-Bayesian decision criteria? 

19.16 A financial institution is evaluating the loan appli- 
cation of an entrepreneur who has come up with a revo- 



lutionary product for which he needs developmental 
funding. The entrepreneur has practically no assets for 
collateral, but offers the institution a portion of the profits 
if the product is successful. The loan officer has constructed 
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the following payoff table, with institution profits in 
thousands of dollars and the probabilities estimated for 
three states of nature. 

The product's level of success is 

Low Moderate High 
(0.1) (0.3) (0.6) 

Loan the full 



amount requested 


-$25 


$20 


$50 


Loan half the 








amount requested 


-$10 


$10 


$20 


Do not grant 








the loan 


$0 


$0 


$0 



Using the expected payoff criterion, which alternative 
will be selected and what is its expected monetary value? 
Determine the expected value of perfect information and 
interpret its meaning. 

19.17 In Exercise 19.4, if the probability that his return 
will be audited is 0.1, which alternative will the consultant 
choose if he maximizes expected monetary value? What is 
the most he should be willing to pay for more information 
regarding his likelihood of being audited? 

19.18 In Exercise 19.9, assume that the team has a 0.6 
probability of winning, a 0.3 probability of losing, and a 
0.1 probability for a tie game. Which alternative should 
Dave select if he wishes to maximize his expected payoff, 
and what is the expected value of perfect information? 

19.19 In Exercise 19.10, the probabilities for the three 
market conditions are estimated as 0.1, 0.6, and 0.3, 
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respectively. Which design should be selected in order to 
maximize the firm's expected profit? What is the most the 
firm should be willing to pay for a research study designed 
to reduce its uncertainty about market conditions? 

19.20 For the following payoff table, determine the 
expected value of perfect information. Does the result 
seem unusual? If so, why did this occur? 

State of Nature 

I (0.6) II (0.4) 

Alternative A $80 $30 

Alternative B $12 $20 

19.21 In Exercise 19.12, there is a 0.1 chance that 

a train will be using the railroad crossing. Given this in- 
formation, which route should Fred select, and what is 
the expected number of minutes his trip will require? 
What is the expected number of minutes Fred would save 
if he could predict with certainty whether a train will be 
at the crossing? 

19.22 In Exercise 19.13, if x is the probability that the 
shipment is lost, for what value of x will the alternatives 
have the same £MV? Assuming this value for x, what is 
the expected value of perfect information? 

19.23 In business research, it's extremely rare that any 
study will provide a perfect assessment of the future. 
Why then should we bother determining the expected 
value of perfect information when such information is 
practically never available? 



THE OPPORTUNITY LOSS APPROACH 19.5 



Another way to use probabilities in identifying the best alternative and finding the 
expected value of perfect information is to structure the decision in terms of op- 
portunity loss. Opportunity loss is another term for regret, discussed earlier in the 
chapter, and expected opportunity loss is calculated in a manner similar to that of 
expected payoff: 



Expected opportunity loss 


(EOL) for a decision alternative: 


EOL = ZPi • h 


where p t = the probability that state of 




nature i will occur 




l t = the opportunity loss if this 




alternative is selected and 




state of nature i occurs 
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TABLE 19.6 



The expected opportunity 
loss is the same as the 
expected value of perfect 
information. The expected 
payoff and expected oppor- 
tunity loss approaches will 
always lead to the same 
alternative being selected. 



Alternative 



German production 

South American 
production 



State of Nature 
Compared to the Euro, 
the strength of the U.S. dollar will be 

Weak Moderate Strong 
(Prob. = 0.3) (Prob. = 0.5) (Prob. = 0.2) 



Expected 
opportunity 
loss 



$10 


$3 


$0 


$0 


$0 


$9 



$4.5 million 
$1.8 million 



If production is switched to South America, the expected opportunity 
loss is 

2p ; -/ ( = 0.3(0) + 0.5(0) + 0.2(9) = $1.8 million 



The results of the opportunity loss approach will be the same as those ob- 
tained in the expected payoff method of the preceding section. In particular, 

1. the alternative having the maximum expected payoff will also have the 
minimum expected opportunity loss (EOL), and 

2. the expected opportunity loss for this alternative is the expected value of 
perfect information. 

Table 19.6 is the opportunity loss approach to the auto maker's decision situ- 
ation. Note that the table entries are the same as for the regret table of Table 19.4 — 
this is because opportunity loss and regret are the same thing. 

In Table 19.6, "South American production" has the minimum expected oppor- 
tunity loss, $1.8 million. This indicates that (1) "South American production" is the 
best alternative and (2) the absence of perfect information can be expected to cost 
the company $1.8 million. Again, for any decision situation the expected payoff 
and expected opportunity loss approaches will always lead to the same alternative 
being selected. Just as opportunity loss and regret are the same thing, expected 
opportunity loss and the expected value of perfect information are equivalent. 




19.24 In what way are the minimax regret criterion and 
the expected opportunity loss (EOL) criterion similar? In 
what ways do they differ? 

19.25 An investor intends to purchase stock in one of 
three companies. Company A is very sensitive to the over- 
all national economy, company C is relatively insensitive 
to the economic environment, and company B is about 
average. The investor has constructed the following pay- 
off table for his profit, in thousands of dollars. 



The state of the economy will be 





Weak 


Moderate 


Strong 




(0.2) 


(0.4) 


(0.4) 


Invest in 


-$40 


$30 


$60 


company A 








Invest in 


-$20 


$20 


$45 


company B 








Invest in 


$24 


$28 


$30 


company C 
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Using the opportunity loss approach, which alterna- 
tive will be selected, and what is its expected opportunity 
loss? What is the most the investor should be willing to 
pay for perfect information about the future state of the 
economy? 

19.26 In Exercise 19.10, with the probabilities for the 
three market conditions estimated as 0.1, 0.6, and 0.3, re- 
spectively, what is the expected opportunity loss associated 



with each of the three decision alternatives? In what way is 
the lowest of these expected opportunity losses related to 
the expected value of perfect information? 

19.27 In Exercise 19.12, with a probability of 0.1 that a 
train will be using the crossing, compute the expected op- 
portunity loss for each of Fred's alternatives. Explain the 
meaning of the EOL for each alternative. 



INCREMENTAL ANALYSIS AND 1 9.6 

INVENTORY DECISIONS 



In some cases, the number of decision alternatives and states of nature may be ex- 
tremely large. This can occur, for example, if a retailer can stock anywhere from 0 to 
1000 units and demand might also be anywhere from 0 to 1000. A payoff table 
would end up having 1000 rows and 1000 columns, and calculations to find the best 
decision alternative would be extremely tedious. In such applications, we can use in- 
cremental analysis, where units are sequentially considered, with each unit in the se- 
quence being stocked only if the expected payoff for stocking it exceeds the expected 
payoff for not stocking it. Incremental analysis involves the following key terms: 

MARGINAL PROFIT The profit if one more unit is stocked and sold. 
MARGINAL LOSS The cost if one more unit is stocked and not sold. 



Example 



Incremental Analysis 

A supermarket receives weekly milk shipments from its dairy products supplier. 
The weekly demand for milk can be approximated by a normal distribution hav- 
ing a mean of 2000 gallons and a standard deviation of 300 gallons. The super- 
market pays $2.10 per gallon and makes a profit of $0.50 on each gallon sold. At 
the end of the week, the supplier repurchases all unsold milk for $0.60 per gallon. 

SOLUTION 

A payoff table for this situation could have over 1000 rows and columns, with 
each one representing a single gallon of milk. Using incremental analysis, we can 
determine the alternative with the maximum expected payoff (or minimum ex- 
pected opportunity loss) much more easily. 

To show how it works, let's first consider whether the first gallon of milk 
should be stocked. If it's stocked and sold, the marginal profit is $0.50. If it's 
stocked and not sold, the marginal loss is $1.50. For a normal distribution with 
{jl = 2000 and a = 300, it's virtually certain that at least one gallon of milk will 
be sold. Table 19.7 (page 750) is a series of payoff tables for this decision setting, 
and part A shows that the expected payoff from stocking the first gallon easily 
exceeds the expected payoff if it is not stocked. 
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TABLE 19.7 



In incremental analysis, item 
/ is stocked if the expected 
payoff from stocking it ex- 
ceeds the expected payoff 
from not stocking it. 



A. Should the 1st Gallon of Milk Be Stocked? 

The week's demand will be 



Yes 



> 1 gallon <1 gallon 
(Prob. = 1.00) (Prob. = 0.00) 



Expected 
Payoff 



Stock the 1st gallon 
Don't stock it 



$0.50 
$0.00 



-$1.50 
$0.00 



$0.50 
$0.00 



B. Should the 2nd Gallon of Milk Be Stocked? 

The week's demand will be 



Yes 



> 2 gallons <2 gallons 
(Prob. = 1.00) (Prob. = 0.00) 



Stock the 2nd gallon $0.50 
Don't stock it $0.00 



-$1.50 
$0.00 



Expected 
Payoff 

$0.50 
$0.00 



C. Should the /'th Gallon of Milk Be Stocked? 



where p c 



Yes, [if] p = the probability that it will be sold is > p c 
Marginal loss $1.50 



Marginal profit + Marginal loss $0.50 + $1.50 
The week's demand will be 



0.75 



>/ gallons </ gallons 
(Prob. = p) (Prob. = 1 - p) 



Expected 
Payoff 



Stock the /th gallon $0.50 
Don't stock it $0.00 



-$1.50 $0.50p - $1.50(1 - p) 
$0.00 $0.00 



In part B of Table 19.7, we move on to consider the second gallon. As with the 
first gallon, the probability that demand will be at least two gallons is virtually 
certain. If the second gallon is stocked, the expected payoff will be $0.50 higher 
than if it is not stocked. 

Since it would be inefficient to continue evaluating one gallon at a time, let's 
introduce a time-saver. The key is the answer to the question, "What probability 
of selling one more gallon results in the expected payoff for stocking this gallon 
being at least as high as the expected payoff for not stocking it?" Given the 
marginal profit if the /th gallon is sold and the marginal loss if it is not, this prob- 
ability can be determined as follows: 

Critical probability for incremental analysis: 

Stock the /'th unit if Prob(demand & /') is at least p c , with 
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fill 

where p = the critical probability 



mp + m m p _ ^ e mar gj na | profit if the 

unit is stocked and sold 
ml = the marginal loss if the unit 
is stocked and not sold 

With marginal profit mp = $0.50 and marginal loss ml = $1.50, the critical 
probability for this decision situation will be 

ml $1.50 



Fc mp + ml $0.50 + $1.50 

Thus, as long as there is at least a 75% chance that the ith gallon will be sold, 
that gallon should be stocked. This can also be found by setting the expected pay- 
off in the first row of Table 19.7, part C, to be & 0, or 

$0.50p - $1.50(1 - p) > $0 
2.00p - 1.50 > 0 

and p must be s 0.75, the value of p c 

To find which gallon of milk corresponds to a 0.75 probability of being sold, 
we use the standard normal distribution as shown in Figure 19.4. The point at 
which the area to the right is 0.75 will be z = —0.67 standard deviation units, and 

x — ix n „ x gallons — 2000 gallons 

z = or -0.67 = — 



300 gallons 
and 

x = 1799 gallons 

The probability is 0.75 that the demand will be high enough to include the 
1799th gallon, and this is the quantity the supermarket should stock. Stocking the 
1800th gallon would not be advisable, since the probability of a demand at least 
this high would be slightly less than 0.75 and the expected payoff for stocking the 
1800th gallon would be less than the expected payoff for not stocking it. 



Demand: 
fl = 2000 gallons 



Area 


= 0.25 






v d— 300 gallons 








Area = 0.75 
1 






-0.67 


0.00 


(a units) 



1799 
T 



2000 



(Gallons demanded) 



For a left-tail area of 0.25, z must be -0.67. 
If the .vth gallon has a 75% chance of being sold, 
x is expressed by 

x - 2000 



z = x -^ or -0.67 = 
a 



300 



and x = 1799 gallons 



FIGURE 19.4 



The probability is 0.75 that 
demand will include the 
1799th gallon of milk. If the 
1800th gallon is stocked, 
there is a less than 75% 
chance it will be sold. 
Using incremental analysis, 
expected profit is maxi- 
mized if 1799 gallons are 
stocked. 
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19.28 A newsstand manager must decide how many daily 
papers to stock. Past experience has shown that demand 
can be approximated by a normal distribution with a 
mean of 400 papers and a standard deviation of 80 
papers. The manager pays 20 cents for each paper, sells it 
for 40 cents, and any papers not sold are picked up by the 
distributor, with the manager receiving 5 cents for each of 
them. If the manager wants to maximize his expected 
profit, how many newspapers should be stocked? 

19.29 The manager of a fish market pays 60 cents each 
for cod and sells them for $1.00 each. Fish left over at the 
end of the day are discarded. The daily demand can be 
approximated by a normal distribution having a mean of 
300 and a standard deviation of 50. To maximize his ex- 
pected profit, how many cod should the manager buy be- 
fore the market opens each day? 



19.30 For Exercise 19.29, how many cod should be pur- 
chased if leftover fish can be sold to a fertilizer company 
for 20 cents each? 

19.31 An athletic director must order football programs 
well in advance of game day. She pays $0.50 for each 
program, and the selling price to fans is $3.00. Programs 
not sold at the game can be sold to the college book- 
store for $0.10 each. Demand at the game is normally 
distributed, with /jl = 3000 and a = 500. To maximize 
expected payoff, how many programs should the athletic 
director order? 

19.32 Repeat Exercise 19.31, under the assumption that 
the bookstore manager has refused to purchase unsold 
programs for future games. 



19.7 SUMMARY 



• Structuring the decision situation 

The decision situation can be structured according to the decision alternatives and 
the possible states of nature. The states of nature are both exhaustive (one of them 
must occur) and mutually exclusive (only one can occur). The decision situation 
may involve risk, uncertainty, or ignorance, depending on how much is known 
about the states of nature and their exact probabilities. Most business decisions 
are made under uncertainty, where the states of nature are known, but their exact 
probabilities are not. 

The payoff table rows represent the decision alternatives, the columns show 
the states of nature, and the entries are payoffs for the various intersections of al- 
ternatives and states of nature. The decision situation can also be described by a 
decision tree, in which alternatives and states of nature are displayed in graphical 
form. Both the payoff table and the decision tree can include estimated probabil- 
ities for the states of nature. 

• Bayesian and non-Bayesian decision criteria 

In non-Bayesian decision making, there is no attempt to consider the probabilities 
of the respective states of nature. Three such decision rules are maximin, maxi- 
max, and minimax regret. Bayesian decision making takes these probabilities into 
account, and the alternative with the highest expected payoff is selected. If it is 
possible to obtain information that will reduce uncertainty, the expected value of 
perfect information (EVP I) can be calculated. This is the amount by which the ex- 
pected payoff could be improved if perfect information were available, and it is 
the most that should be paid for any information designed to reduce uncertainty. 

• Opportunity loss approach 

In the opportunity loss table, each entry is the difference between a payoff and the 
best possible payoff for that state of nature. The results of the opportunity loss 
approach are the same as for the expected payoff method, and the alternative 
with the minimum expected opportunity loss will have the maximum expected 
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payoff. Also, the minimum expected opportunity loss is equivalent to the 
expected value of perfect information. 

• Incremental stocking decisions 

For some situations, such as inventory stocking decisions, the number of alterna- 
tives and states of nature may be extremely large. In such cases, incremental anal- 
ysis can be applied. In this method, units are sequentially considered, with each 
unit in the sequence being stocked only if the expected payoff for stocking it 
exceeds the expected payoff for not stocking it. 



II 

"a.-.t* 


equation 




r 4t- 




,, ' 



Expected Payoff for a Decision Alternative 

Expected payoff = • m 



where p t = the probability that state of 
nature i will occur 
m, = the payoff if this alternative is 

selected and state of nature i occurs 



Expected Value of Perfect Information, EVPI 

_ Expected payoff with 
perfect information 



EVPI 



Expected payoff with 
present information 



Expected Opportunity Loss (EOL) for a Decision Alternative 

EOL = 2P; ' where p, = the probability that state of nature i will occur 
/ ; = the opportunity loss if this alternative is 
selected and state of nature i occurs 

Critical Probability for Incremental Analysis 

Stock the ilh unit if Prob(demand > i) is at least p c , with 

tnl 

p c = — — - where p c = the critical probability 

m ^ m mp = the marginal profit if the unit is stocked and sold 

ml = the marginal loss if the unit is stocked and not sold 




19.33 For a given perishable product, a retailer pays $5 
for each unit, then sells them for $10 each. At the end of 
the day, units not sold at the store are disposed of, and 
the retailer receives just $1 for each. Given the following 
probability distribution describing daily demand, how 
many units should be stocked? 



X 


Prob. 
(Demand = x) 


X 


Prob. 
(Demand = x) 


0 


0.01 


7 


0.18 


1 


0.02 


8 


0.14 


2 


0.04 


9 


0.09 


3 


0.07 


10 


0.05 


4 


0.09 


11 


0.02 


5 


0.12 


12 


0.01 


6 


0.15 


13 


0.01 



Em 



19.34 Dennis Dunkwright is a former NBA player and 
free spirit whose face and unusual antics are well known 
to young Americans across the land. A marketer of 
athletic shoes is considering the possibility of retaining 
Dennis to appear in a series of high-profile television ads 
over the coming fiscal year. If Dennis is associated with 
their company and stays out of legal and substance abuse 
difficulties during the year, the company estimates its 
profits will be $4.4 million for the year. However, if Den- 
nis runs afoul of the law or does anything else that might 
make the company look bad, the estimated profit would 
be just $0.5 million. If the company remains with its cur- 
rent spokesperson, who is lesser known but more stable, 
it estimates its annual profit for the year to be $1.2 
million. If the company believes there to be a 0.20 proba- 
bility that Dennis will get into trouble during the year, 
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what strategy should be selected if the company wishes 
to maximize its expected profit? What would be the 
expected value of a perfect forecast for the future behav- 
ior of their potential new spokesperson? 

19.35 In Exercise 19.34, what strategy would the 
company select if it were using the maximin criterion? 
The maximax criterion? The minimax regret criterion? 

19.36 An investor is trying to decide between two 
promising technology stocks to round out his portfolio. 
The remainder of his portfolio consists of relatively con- 
servative companies and will not enter into his decision as 
to which technology company will be selected. Disk Worth 
is a well-established company engaged in the design and 
manufacture of data storage disks, while ComTranDat is 
a start-up firm in the midst of developing what could be a 
wireless communication and data transmission system su- 
perior to those currently used or under development by 
the leaders in its industry. Depending on whether the tech- 
nology sector weakens, remains the same, or strengthens 
over the next 10 years, the investor estimates that his 
profits with DiskWorth would be $20,000, $30,000, and 
$40,000, for the respective scenarios. The corresponding 
profits for ComTranDat are estimated to be —$20,000, 
$10,000, and $90,000, respectively. If the technology sec- 
tor has a 20% probability of weakening over the next 

10 years, a 50% probability of remaining the same, and 
a 30% probability of strengthening, which stock purchase 
should the investor make if he wishes to maximize his ex- 
pected profits over the 10-year period? What is the most 



he should be willing to pay for a perfect forecast for the 
future strength of the technology sector? 

19.37 In Exercise 19.36, which company would the 
investor select if it were using the maximin criterion? The 
maximax criterion? The minimax regret criterion? 

19.38 A ski resort operator must decide before the winter 
season whether he will lease a snow-making machine. If he 
has no machine, he will make $20,000 if the winter is 
mild, $30,000 if it is typical, and $50,000 if the winter is 
severe. If he decides to lease the machine, his profits for 
these conditions will be $30,000, $35,000, and $40,000, 
respectively. The probability of a mild winter is 0.3, with a 
0.5 chance of a typical winter and a 0.2 chance of a severe 
winter. If the operator wants to maximize his expected 
profit, should he lease the machine? What is the most he 
should be willing to pay for a perfect forecast? 

19.39 In Exercise 19.38, the operator says that he has no 
idea regarding probabilities for the various weather con- 
ditions for the coming winter. What are some other crite- 
ria by which he might reach a decision, and what decision 
would be reached using each of them? 

19.40 A driver in the Indianapolis 500 auto race has the lead 
but is nearly out of fuel with eight laps remaining in the race. 
The second-place car is just 10 seconds behind and has 
plenty of fuel. The driver in the lead car elects to try for the 
finish line without making a fuel stop. Does this behavior 
seem to be a maximin choice or a maximax choice? Explain. 



integrated case 




— ' 



Thorndike Sports Equipment (Video Unit Seven) 



For several years, Thorndike Sports Equipment has been 
one of the suppliers of refreshments to the Titans of Tennis 
national tournament. Luke Thorndike doesn't really like to 
call the company's product a "refreshment," since it's been 
scientifically designed to replenish the fluids and vital body 
chemicals lost through perspiration and exercise. Accord- 
ing to the elder Thorndike, Thornado is not "just another 
sugar-fluffy soft drink. It helps athletes perform at their 
best." 

Naturally, each competing athlete is given a compli- 
mentary carton of Thornado. However, the Thorndikes 
must decide how many cans of the product to send to the 
tournament for fan consumption. The Thorndikes make a 
profit of $0.25 on each can of Thornado purchased at the 
tournament. Each can that is not purchased is distributed 



to local sports clubs in the tournament's host city. Luke 
Thorndike believes in helping those in need, but is quick to 
point out that it costs him a total of $0.80 to produce and 
distribute each of these unsold cans of Thornado. 

Luke believes in charity, but he is also rather frugal in 
making it available. He's asked Ted to figure out how many 
cans of Thornado to send to the Titans of Tennis tourna- 
ment in order to make as much profit as possible. Ted 
Thorndike, having analyzed past sales patterns for Thor- 
nado at this tournament, has found annual tournament 
sales of the product to be closely approximated by a nor- 
mal distribution with fjL = 4500 cans and cr = 300 cans. In 
order to satisfy his grandfather's dictate that profit be max- 
imized, how many cans of Thornado should Ted plan on 
shipping to the Titans of Tennis? 



"The Toyota Way" 

Not very many automobile companies have books written about them; and when 
books are written, they are often not very complimentary about either the company 
or its vehicular products. A notable and highly recommended exception is a book 
titled The Toyota Way, by Dr. Jeffrey K. Liker. Whether you read the book or simply 
Google it, I hope you will gain ideas and inspiration that will benefit your life, your 
company or organization, and your career. The same hopes are sincerely expressed for 
your reading of this chapter and the information and motivation it's intended to pro- 
vide. The spirit of this chapter and the key to the respect Toyota has earned are 
perhaps best expressed in this quote from the foreword to the book, by Gary Convis, 
Managing Officer and President of Toyota Motor Manufacturing, Kentucky: 

The Toyota Way can be briefly summarized through the two pillars that support it: 
"Continuous Improvement" and "Respect for People. " Continuous improvement, often 
called kaizen, defines Toyota's basic approach to doing business. Challenge everything. 
More important than the actual improvements that individuals contribute, the true 
value of continuous improvement is in creating an atmosphere of continuous learning 
and an environment that not only accepts, but actually embraces change. 

As you know from your personal life, getting something right the first time is a lot 
more effective, satisfying, and kind to your blood pressure than often-frustrating at- 
tempts to fix it afterward. Likewise, there are few things that we or our organization 
could not do better if we continually made even small, incremental 
improvements in the process through which we 
do them — this is the "kaizen" con- „,,iresatten tion 
cept mentioned above, and it will Quality 
be among the topics discussed 
within the chapter. If your course 
schedule is too tight for this chapter 
to be included in your syllabus, I 
encourage you to at least skim it, and 
I sincerely believe that doing so and 
embracing its spirit will help you greatly, 
both personally and professionally. 



Source: Quoted materials from Liker, Jeffrey K., 

The Toyota Way (New York: McGraw-Hill), 2004, p. xi. 
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learning 
objectives 

After reading this 
chapter, you should 
be able to: 



Discuss the concept of total quality management. 

Understand the nature and importance of the process orientation that is 
central to total quality management. 

Differentiate between defect prevention and defect detection strategies for 
the management of quality. 

Discuss the Kaizen concept of ongoing improvement. 

Explain the difference between random and assignable process variation. 

Describe, in general, how random and assignable process variation can each be 
identified and reduced. 

Explain what is meant by statistical process control and describe how it 
contributes to the quality of manufactured products. 

Construct and interpret statistical process control charts for variables and 
attributes. 



20.1 INTRODUCTION 



The quest for quality is probably more widespread and more intense than at any 
other time in our history. This pursuit is shared by producers and consumers 
alike, and is fueled by many factors, including world competition, consumer ex- 
pectations, technological advances, and evolving techniques and philosophies for 
quality measurement and enhancement. More than ever before, the successful 
enterprise must focus on the quality of its product offerings. 



What Is Quality? 

If we look up "quality" in the dictionary, we will find it defined by conceptual 
descriptors such as "excellence" or "first-class." However, for the firm offering 
products to an increasingly demanding marketplace, the most important defini- 
tion of quality is the one held by the customers the firm is trying to reach. 

Products will differ greatly in terms of the quality dimensions that are most 
important to their consumers. People who buy roofing nails will tend to be most 
interested in the performance and durability dimensions, while buyers of fashion 
attire may place more value on the aesthetic and perceived-quality dimensions. 
Regardless of the product, it is increasingly important that all activities, from 
original design, to production, to after-sales service, have quality to the consumer 
as a central focus. 



Why Is Quality Important? 

Perhaps the most obvious reason quality is important to producers is the fact that 
quality is important to consumers; a producer who has been abandoned by his or 
her customers will not stay in business very long. Quality is also important 
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because quality — and the lack of it — both cost money. For this reason, we can 
also look at the importance of quality in terms of two kinds of costs: 1 

1. Costs of preventing poor quality. These are also known as costs of confor- 
mance, and they are associated with making the product meet requirements or 
expectations. 

Some of the activities related to this objective include worker training and 
involvement, coordination with suppliers, customer surveys, and revisions to 
product design and the production process. 

2. Costs that occur as the result of poor quality. These are also known as costs 
of nonconformance, and they are associated with the failure to meet require- 
ments or expectations. 

Defective products may need to be scrapped, reworked, repaired, and 
retested, any of which can be both costly and disruptive to the production pro- 
cess. If defective products end up being purchased and used by consumers, this 
can raise warranty costs, lead to reduced goodwill and lost business, require 
image-damaging product recalls, and even result in expensive product liability 
lawsuits. 

A large and growing school of thought argues that even great amounts of 
money spent on preventing poor quality are a bargain when compared to the 
enormous visible and invisible costs that can occur when poor quality is allowed 
to occur. Today's trend is heavily toward the "stitch in time" (spending on confor- 
mance) to "save the nine" (spending as a result of nonconformance). 

Process Variation and Product Quality 

The quality of products is closely related to variation in the process that has pro- 
duced them. As in dart throwing and bowling, variation is inherent in manufac- 
turing and other processes. The process (and the resulting product) will tend to be 
at least a little different from one unit of output to the next. There are two sources 
of process variation: 

1. Random variation occurs as the result of chance. This type of variation is in- 
herent in practically any process, and random variation in the process leads to 
random variation in the product. For example, each of a truckload of window 
air conditioners may have been designed to cool at the rate of 5000 BTU/hour. 
However, some may be capable of 5100 BTU/hour while others might reach 
only 4900. The pitch of the fan blades, the friction within the electric motor, 
the fit of the insulation, and the amount of system refrigerant injected are just 
a few of the process parameters that are not perfectly identical from one assem- 
bled unit to the next. Whether we're producing air conditioners, steel rods, or 
ball bearings, the output, tensile strength, or diameter will naturally tend to ex- 
hibit at least minor variations during the production run. This type of variation 
is not generally considered to be under the control of the individual worker. 
Industry often refers to random variation as common-cause variation. 

Random variation, while inherent, can often be reduced by using more 
expensive machines or materials. As long as the amount of random variation 
is tolerable, with the end product falling within acceptable specifications, 
neither its presence nor its sources are causes of alarm. 

'Norman Gaither, Production and Operations Management 6/e. Fort Worth: The Dryden Press, 1994, 
ch. 17. 
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2. Assignable variation occurs due to identifiable causes that have changed the pro- 
cess. For example, a worker might commit an error, a tool may break, a machine 
may be "drifting" out of adjustment, operators may become fatigued toward 
the end of the work day, contact surfaces may be getting worn, or an extrusion 
machine may accumulate deposits during its period of operation. Some even feel 
it's unwise to purchase an automobile that was made on a Friday or a Monday, 
since substitute workers may be filling in for some of the more highly skilled in- 
dividuals who decided to enjoy an extended weekend. This type of variation is 
generally considered to be under the control of the individual worker. Industry 
often refers to assignable variation as special-cause variation. 

Compared to random variation, assignable variation tends to be direc- 
tional. For example, as a cutting tool wears down, the thickness of the bars, 
rods, or disks being cut will increase over time. While the exact causes of ran- 
dom variation can be very expensive to track down and reduce, assignable 
variation is both more easily identified and more efficiently corrected. 

One of the great challenges in industry is to be able to tell when process vari- 
ation is random and when it is assignable; then, if it is assignable, to do something 
about it. Assignable variation is the primary concern of much of the chapter. 




20.1 Given that a production process may experience 
both random and assignable variation, for which type 
of variation is the source generally easier to identify 
and eliminate? Why? 

20.2 Differentiate between conformance and nonconfor- 
mance in product quality. What are some of the costs 
associated with each? 



20.3 In a manufacturing process, what is random varia- 
tion? If cereal boxes are being filled by a machine, 
describe how random variation might affect the content 
weight of the individual boxes. 

20.4 Repeat Exercise 20.3, but substitute "assignable 
variation" for "random variation." 



20.2 A HISTORICAL PERSPECTIVE AND DEFECT DETECTION 



Human concern with quality is at least as old as the wheel, since the discovery of 
this useful object was surely followed by the realization that it would work better if 
it were more perfectly round. As wheels and many other products came to be fab- 
ricated, control over their quality was typically the responsibility of the individual 
craftsman, and it was not unusual for each individual item to be the subject of its 
maker's scrutiny. 

As our economy became more mass-production oriented, such individual in- 
spection became less practical, and the 1920s marked the beginning of what is 
now known as statistical quality control, in which quality is monitored on the 
basis of samples of items taken from the production line. Two techniques intro- 
duced during this era are especially noteworthy: 

1. The Statistical Process Control (SPC) Chart Developed by Walter Shewhart, 
this is a graphical procedure for determining whether a production process is 
operating properly. In general, as long as assignable variation is absent and 
random variation remains within acceptable limits, the process is deemed to be 
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stable, or in statistical control. A production process that is unstable will tend 
to generate products that are of unpredictable and/or unsatisfactory quality. 

By monitoring the stability of the process as products are being made, the 
SPC method represents a defect-prevention strategy for quality management. 
Because it is employed during the manufacture of a stream of components or 
products, this method enables midstream corrections to be made if the process 
should begin to "jump" or "drift" due to an assignable cause. Today's quality 
orientation is heavily toward defect prevention, and the SPC method enjoys 
more popularity than ever. Statistical process control is discussed further in 
Sections 20.6-20.9. 

2. Acceptance Sampling Developed largely through the efforts of Harold Dodge 
and Harry Romig, this is a set of procedures whereby, based on a random sam- 
pling of the items within, an entire production lot or shipment is (1) accepted, 
(2) rejected, or (3) subjected to further sampling before the final decision to ac- 
cept or reject. 

Acceptance sampling involves inspection of a sample of products after an 
entire production run or shipment lot has been produced. As such, it repre- 
sents a defect-detection approach to quality management. The methods of 
acceptance sampling remain useful and continue to be applied. 

Although both statistical process control and acceptance sampling make in- 
ferences on the basis of samples, there is a very important difference in their pur- 
poses: (1) SPC uses the sampled products to make inferences about the process 
from which they came, while (2) acceptance sampling uses sampled products to 
make inferences about a larger population of the products. 

Although statistical process control methods have much to contribute to defect 
prevention, they had the misfortune of appearing during an era that was dominated 
by defect detection, with its requisite array of quality inspectors in white coats whose 
responsibility was to ferret out defective products and prevent them from leaving the 
factory. For some companies, that era may have not yet ended, as they continue to 
place nearly sole reliance on weeding out defective products instead of trying to im- 
prove the design and production process that allowed the products to be defective in 
the first place. 

While well intentioned, the traditional philosophy of defect detection has gen- 
erally involved a corporate atmosphere in which quality is a separate entity within 
the firm. Not coincidentally, suppliers and workers have tended to be viewed 
as adversaries instead of partners. Those who aspire to global competitiveness 
and world-class quality now realize that quality can no longer be considered as an 
isolated function — rather, it must be integrated throughout the organizational 
structure and be made central to corporate thinking in all matters. 



exercises 




20.5 Differentiate between statistical process control and 
acceptance sampling. What is the purpose of each? 

20.6 Statistical process control and acceptance sampling 
both rely on a sampling of products to make inferences. 
In what important way do the types of inferences 
differ? 



20.7 Contrast the concepts of defect prevention and de- 
fect detection as they might apply to each of the follow- 
ing "defects" in society: 

a. automobile accidents 

b. juvenile delinquency 

c. alcoholism 
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20.3 THE EMERGENCE OF TOTAL QUALITY MANAGEMENT 



The 1950s saw the beginnings of a movement toward total quality manage- 
ment (TQM), a philosophy that (1) integrates quality into all facets of the organi- 
zation and (2) focuses on improving the quality of the product by systematically 
improving the process through which the product is designed, produced, deliv- 
ered, and supported. The conviction of its believers and the success of its adopters 
combined to intensify the momentum of the TQM movement. 

TQM Pioneers 

W. Edwards Deming is the best known and probably most influential pioneer of 
TQM. During the years immediately following "World War II, industry in the 
United States was prospering and Dr. Deming's ideas did not attract an audience 
in this country. On the other hand, Japan — with "made in Japan" perceived by 
many as "cheap," "shoddy," and "junk" — was anxious for any help it could get. 
In 1950 the Japanese invited Dr. Deming to visit their country and discuss the 
quality issue. He visited, they listened, and the quality of Japanese products was 
on its way to becoming the envy of the world. It was not until the 1980s that 
American companies began giving serious attention to Deming's ideas, and some 
observers would argue that they did so just in the nick of time. 

One of Dr. Deming's earliest contributions was the Deming cycle, also known 
as the Deming wheel and the PDCA (Plan-Do-Check -Act) cycle. Based on an ear- 
lier model called the Shewhart cycle, it stresses the importance of continuous and 
systematic quality-improvement efforts at all levels of the organization: 2 

Plan Develop a plan for a possible improvement to the product or the 
process through which the product is made. 

Do Apply the potential improvement in a small-scale test. 

Check Evaluate the change to see whether it was successful. 

Act Based on the results: (1) adopt the plan, (2) abandon the plan, or (3) use 
the information gained to revise the plan and go through the cycle 
again. 

Among the other key pioneers of the TQM movement are Joseph M. Juran, 
Kaoru Ishikawa, Phillip B. Crosby, and Armand V. Feigenbaum. 3 Like Deming, 
Juran was instrumental in helping the postwar Japanese improve their product 
quality. Juran also authored the Quality Control Handbook. Ishikawa, author of 
Guide to Quality Control, was an innovator who developed the cause-and-effect 
diagram and the quality circle, concepts that are discussed later in the chapter. 
Crosby is the author of Quality Is Free, in which he argues that poor quality is so 
expensive (see Section 20.1 and the costs of nonconformance) to the firm that 
even great amounts of money could profitably be spent on moving toward the 
goal of zero defects and doing it right the first time. Feigenbaum's book, Total 
Quality Control, stresses a concept called quality at the source, where responsibi- 
lity for quality is with the individual worker. In this concept, production workers 
can stop production whenever they spot quality problems. 

2 Henry R. Neaves, The Deming Dimension. Knoxville: SPC Press, 1990, p. 142. 

3 Norman Gaither, Production and Operations Management 6/e. Fort Worth: The Dryden Press, 1994, 

ch. 17. 
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FIGURE 20.1 



Total quality management 
stresses the importance of 
understanding and reduc- 
ing variation within the 
process by which a product 
is generated. 



Source: R. Snee, "Statistical Thinking and Its Contribution to Total Quality," American Statistician, Vol. 44, 
pp. 116-121. 



The Process Orientation 

Where traditional quality control tended to be result oriented, with primary em- 
phasis on the end product and defect detection, total quality management is pro- 
cess oriented, stressing the prevention of defects through continual attention to, 
and improvements in, the processes through which the product is designed, pro- 
duced, delivered, and supported. Figure 20.1 illustrates the emphasis that TQM 
places on understanding and reducing the sources of process variation that were 
discussed earlier. (Note: In TQM, the "satisfied customer" of Figure 20.1 is not 
necessarily the final retail customer — it could be the next workstation or proce- 
dure in the manufacturing sequence.) 

Six Sigma and Other Quality Systems 

TQM is being complemented and supplemented by other approaches to quality 
improvement. Perhaps the most popular and best known of these is the Six Sigma 
approach that was developed at Motorola in the 1980s. The name is inspired by 
the normal distribution and the incredible area beneath the curve between the 
—6a and the +6a locations. The remaining area is so miniscule that it reflects 
only 3.4 defects per million products or opportunities — the result is a process or 
environment that is practically defect-free. 

According to Greg Brue, author of Six Sigma for Managers, Six Sigma has a 
strong profitability and project/problem orientation, and involves five phases: 

(1) define the project or goal and the internal and external customers involved; 

(2) measure the current performance of the process; (3) analyze the root causes of 
the defects generated by the process; (4) improve the process so as to eliminate the 
defects; and (5) control the future performance of the process. There is also a strong 
orientation to using available data and the statistical tools for their analysis. 4 

A detailed comparison of current systems, whether the Toyota Production 
System (TPS), TQM, or Six Sigma, would lie beyond the scope of this text. Each 

4 Greg Brue, Six Sigma for Managers, McGraw-Hill, 1988, pp. 6-8. 
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system has its proponents and detractors, but they all have the same admirable 
goal of improving the processes involved in the quality of the goods and services 
we purchase, and they tend to utilize many of the same statistical tools. See the ref- 
erences below, or Google "Six Sigma versus TQM" for more information. 5 In the 
meantime, our emphasis will continue to be on the traditional TQM methods and 
on the spirit that tends to characterize the chapter-opening vignette, which features 
a corporation whose quality is among the most respected in all of industry. 



exercises 




20.8 What is the Deming PDCA cycle, and how is it 
applicable to total quality management? 

20.9 What is the basic difference between total quality 
management and the traditional approach to quality 
management? 



20.10 What is meant by the statement that total quality 
management has a "process orientation"? 

20.11 What are the major phases involved in the Six 
Sigma quality system? 



20.4 PRACTICING TOTAL QUALITY MANAGEMENT 



As both concept and practice, total quality management is rapidly evolving. As 
such, it is increasingly difficult to differentiate between ( 1 ) strategies and practices 
that comprise TQM and (2) strategies and practices that are conducive to TQM. 
Accordingly, the topics that follow may contain elements of each. 

"Kaizen" and TQM 

If total quality management had to be described with a single word, perhaps the 
best choice would be the word "improvement." There is a Japanese counterpart 
called "Kaizen." In his 1986 book, Kaizen, author Masaaki Imai further describes 
the concept that Kaizen represents: 

The essence of Kaizen is simple and straightforward: Kaizen means improvement. 
Moreover, Kaizen means ongoing improvement involving everyone, including 
both managers and workers. The Kaizen philosophy assumes that our way 
of life — be it our working life, our social life, or our home life — deserves to be 

constantly improved The message of the Kaizen strategy is that not a day 

should go by without some kind of improvement being made somewhere in the 
company. 6 

Imai further describes Kaizen as a philosophy that focuses on small, ongoing 
improvements rather than relying on large-scale (and, typically, more abrupt) in- 
novations, such as might come from advances in technology or equipment. Ac- 
cording to Imai, Western management has historically relied much more heavily 
on the latter than on the former, but that is being reversed as more companies 
move toward total quality management. 

5 In addition to the work cited in reference 4, see Jeffrey K. Liker, The Toyota Way, pp. 135, 252-253, 
and 295-297; and Mekongcapital.com, "Introduction to Six Sigma," October 15, 2004. 
6 Masaaki Imai, Kaizen. New York: McGraw-Hill, 1986, pp. 3, 6. 
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1 . "Create constancy of purpose for continual improvement of products and 
service to society 

2. "Adopt the new philosophy .... We can no longer live with commonly accepted 
levels of delays, mistakes, defective materials, and defective workmanship." 

3. "Eliminate the need for mass inspection as the way of life to achieve quality by 
building quality into the product in the first place." 

4. "End the practice of awarding business solely on the basis of price tag. Instead, 
require meaningful measures of quality along with price." 

5. "Improve constantly and forever every process for planning, production, 
and service." 

6. "Institute modern methods of training on the job for all, including management, 
to make better use of every employee." 

7. "Adopt and institute leadership aimed at helping people to do a better job." 

8. "Encourage effective two-way communication and other means to drive out 
fear throughout the organization so that everybody may work effectively and 
more productively for the company." 

9. "Break down barriers between departments and staff areas." 

10. "Eliminate the use of slogans, posters and exhortations for the work force, 
demanding Zero Defects and new levels of productivity without providing methods." 

1 1 . "Eliminate work standards that prescribe quotas for the workforce and numerical 
goals for people in management .... Substitute aids and helpful leadership in 
order to achieve continual improvement of quality and productivity. " 

12. "Remove the barriers that rob hourly workers, and people in management, of 
their right to pride of workmanship." 

13. "Institute a vigorous program of education, and encourage self-improvement for 
everyone. What an organization needs is not just good people; it needs people 
that are improving with education. Advances in competitive position will have 
their roots in knowledge." 

14. "Clearly define top management's permanent commitment to ever-improving 
quality and productivity, and their obligation to implement all of these principles." 

Source: W. Edwards Deming, in Henry R. Neaves, The Deming Dimension, Knoxville: SPC Press, 1990, 
pp. 287-413. 



TABLE 20.1 



Deming's 14 points are at 
the core of his teachings on 
quality improvement. 



Deming's 14 Points 

At the core of Dr. Deming's teachings are a set of guidelines that are known as 
"Deming's 14 points." There were fewer in the 1950s, when he began spreading 
his message, and the number gradually evolved to the present 14. Table 20.1 gives 
a condensed version of the points — a complete volume would be necessary to do 
true justice to all of them, and the interested reader may wish to refer to Henry R. 
Neaves' The Deming Dimension (1990) and William W. Scherkenbach's The 
Deming Route to Quality and Productivity (1988). Each book includes a fore- 
word by Deming and a complete chapter on each one of his 14 points. 7 

Deming was adamant that the interdependence of the 14 points is such that all 
of them must be adopted as a package if success is to be achieved. The 14 points 
have an underlying theme of cooperation, teamwork, and the belief that workers 
truly want to perform their jobs in a quality fashion. 

7 Neaves, The Deming Dimension, and William W. Scherkenbach, The Deming Route to Quality and 
Productivity. Washington: CEEPress Books, 1988. 
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The Quality Audit 

It is not unusual today for a company to carry out a quality audit of prospective 
suppliers in addition to evaluating its own quality efforts. According to Richard J. 
Schonberger, author of World Class Manufacturing, the most important element in 
an audit of suppliers is process capability — that is, the prospective supplier must 
provide evidence that it can meet the required product specifications. Evidence to 
this effect sometimes takes the form of studies involving statistical process control 
charts, and the process must be shown (1) to be in statistical control, with 
assignable process variation absent, and only random variation remaining, and 
(2) to have random variation that does not exceed the limits allowed by the prod- 
uct specifications. Statistical process control is discussed in Sections 20.6-20.9. 

In addition to evaluating process capability, quality audits may involve the eval- 
uation of other areas, such as employee quality training, the extent and nature of 
quality improvement efforts, the extent to which the firm applies statistical tools for 
TQM (discussed in Sections 20.5-20.9), and has an award structure to recognize 
and reward employees who make special contributions to quality improvement. On 
a more general level, the goal of a quality audit might simply be described as the 
assessment of the firm's strategies and successes regarding defect prevention. 

Competitive Benchmarking 

One popular approach to quality improvement is to engage in benchmarking. 
This involves studying, and attempting to emulate, the strategies and practices of 
organizations already known to generate world-class products and services. De- 
pending on the specific type of excellence in which a firm happens to be interested, 
there may be one or several companies recognized as quality leaders. 

Here are just a few examples: Institutions of higher education may wish to 
compare themselves against leading institutions within their category as listed in 
the annual America's Best Colleges and America's Best Graduate Schools publica- 
tions from U.S. News & World Report. Companies wishing to improve their 
performance in employee enthusiasm and training would do well to compare 
themselves to the Walt Disney Company, and those engaged in Internet and mail- 
order activities might wish to study the excellence long exhibited by L.L. Bean. 

Just-in-Time Manufacturing 

Just-in-time (JIT) manufacturing systems operate with a minimum of inventory 
on hand, with materials and components literally arriving "just in time" to enter 
the production process. Some have the misconception that JIT is simply an inven- 
tory philosophy — actually, it is a production philosophy that happens to have an 
impact on inventory. Unlike systems where there are large inventory stocks, when 
a problem occurs in the JIT system, production is stopped until the problem is 
fixed. This approach fits in well with the philosophy held by Deming and others 
that the consumer must always be considered as the very next person or worksta- 
tion in the manufacturing sequence. 

Although JIT systems save money by reducing in-process inventory costs, 
there are quality benefits as well. Workers and suppliers become very conscious of 
product quality — with zero or minimal parts on hand to buffer irregularities in 
the production pace, the worker, machine, or supplier generating poor quality has 
nowhere to hide. Because production comes to a halt until quality has been 





Quality Circles and the "Suggestion Box' 



The typical "suggestion box" has included ideas that im- 
proved quality or productivity, ideas that were impractical 
to implement, and ideas that would have been physiologi- 
cally impossible for the boss to perform. Although the box 
has not completely disappeared, its more useful functions 
have been taken over, and improved, by the quality circle. 

The quality circle helps bring workers into the task of im- 
proving and maintaining product quality. An especially val- 
ued technique in Japan, the quality circle consists of about a 
dozen workers from the same department. Typically volun- 
teers, they meet periodically to discuss issues and ideas per- 
taining to the quality of both their workplace and its output. 

Ideas generated by the quality circle contribute to the 
reduction of both random and assignable variation in a 
process. Although they possess neither MBA nor engineering 



degrees, workers are closer to the production process than 
anyone else. When an intelligent person spends this much 
time in proximity to a machine or a process, familiarity with 
its capabilities and quirks is bound to develop, and it isn't 
surprising that workers can often recognize or solve prob- 
lems that have evaded the efforts of those who are more 
highly educated in the conventional sense. 

The quality circle is more than just a talk session held twice 
a month. The group may initiate specific studies and have ac- 
cess to company records pertinent to the problem at hand. 
This gives the worker both enhanced importance and an en- 
hanced sense of importance. The quality circle helps raise 
workers from the simple status of "hired help" to a position 
where they are valued members of a quality team that has the 
enthusiastic support of management at the highest levels. 



restored, any problem that arises will be the object of serious efforts not only to 
momentarily solve it, but to permanently eliminate it. JIT also encourages small 
production lots, and mistakes that do occur are more quickly identified and cor- 
rected than in conventional systems that have large production runs before ship- 
ment or transfer to either the final customer or the next workstation. Through 
its effectiveness in preventing defects, the JIT system helps reduce the costs of 
nonconformance discussed in Section 20.1. 



Worker Empowerment 

Also called quality at the source, worker empowerment ( 1 ) makes the individual 
worker responsible for the quality of output from his or her own workstation, 
and (2) provides the worker with the authority to make changes deemed neces- 
sary for quality improvement. When workers have both responsibility and au- 
thority for quality, they can apply the Deming PDCA cycle at their own level, and 
statistical tools like those discussed in Sections 20.5-20.9 may be used or adapted 
for the worker's use at the workstation. The worker also has the authority to halt 
production when he spots a quality problem either in his own output or in com- 
ponents coming from a workstation earlier in the production sequence. Quality 
circles (see Statistics in Action 20.1) often play an integral role in the application 
of the worker empowerment philosophy. 

Awards and the Recognition of Quality 

Besides enjoying the more direct benefits from offering world-class products and 
services, companies practicing TQM are increasingly being motivated by the chal- 
lenge of being recognized for their quality achievements. One of the first such 
awards was the Deming Prize, established by the Japanese government in 1951 
and awarded annually to a Japanese company that has distinguished itself in the 
quality-management arena. The Malcolm Baldrige National Quality Award is 
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awarded annually to U.S. companies that have demonstrated excellence in qual- 
ity achievement and management, with award categories including manufactur- 
ing, service, and small businesses. Rochester Institute of Technology and USA 
Today established the RIT/U5A Today Quality Cup in 1992, annually recogniz- 
ing quality teams or individuals in five categories: manufacturing, service, small 
organizations, government, and not-for-profit. 




20.12 What is "Kaizen," and how is it related to total 
quality management? 

20.13 In the Kaizen philosophy, what is the differ- 
ence between "ongoing improvement" and "innovation"? 

20.14 Give some specific examples of how the Kaizen 
concept might be applied (a) by a clerk at a convenience 
store, and (b) by the manager of an apartment that rents 
to university students. 

20.15 What is a quality audit, and for what purposes 
might one be performed? 

20.16 What is process capability, and how can its 

presence be ascertained? 



20.17 What is competitive benchmarking, and how can it 

be used? 

20.18 What is just-in-time manufacturing, and how does 
it contribute to total quality management? 

20.19 "JIT manufacturing is just a gimmick for reducing 
inventory costs; it has nothing to do with quality." 
Discuss the preceding statement. 

20.20 In what ways can large production runs and high 
inventory levels adversely affect product quality? 

20.21 What two elements are necessary for worker 
empowerment to be a successful tool in total quality 
management? 



20.5 SOME STATISTICAL TOOLS FOR TOTAL 

QUALITY MANAGEMENT 



Many statistical tools can be applied to total quality management, and several are 
briefly described in this section. For the interested reader, more complete discus- 
sion of these and other tools can be found in relevant works, such as Kaoru 
Ishikawa's Guide to Quality Control and William W. Scherkenbach's The 
Deming Route to Quality and Productivity. Sections 20.6-20.9 are devoted to an 
especially popular and important statistical tool for total quality management: 
the statistical process control chart. 



The Process Flow Chart 

The process flow chart visually depicts the operations (or subprocesses) involved 
in a process, showing the sequential relationships between them. The process flow 
chart contributes to our understanding of the process as well as our efforts to im- 
prove it. By showing the operations in the context in which they are active in the 
overall process, the flow diagram facilitates discussion and data collection for 
each individual operation — in essence, each operation can be considered as a 
process itself, and the object of efforts employing the Deming PDCA cycle for 
continuing improvement. Figure 20.1, presented earlier in the context of the pro- 
cess orientation, can be viewed as a process flow chart for the study and reduction 
of sources of variation in a process. Figure 20.2 shows a process flow chart for 
visiting a video rental store and renting a movie. Note that activities are shown in 
rectangular boxes, while decisions are represented by diamond-shaped boxes. 
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FIGURE 20.2 



A process flow chart for 
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FIGURE 20.3 



Rubber compounding Tire assembly Because of its shape, 




the cause-and-effect 
diagram is sometimes called 
a fishbone diagram. 



Problem: 
Blisters in 
tread #552 



Tread extrusion Tire curing 



Source: Norman Gaither, Production and Operations Management, 6/e, Fort Worth: The Dryden Press, 1 994, ch. 17. 

The Cause-and-Effect Diagram 

The cause-and-effect diagram, developed by Kaoru Ishikawa, assists problem solv- 
ing by visually depicting effects and their possible causes. Because of its appear- 
ance, it is sometimes called a fishbone diagram. An example of a cause-and-effect 
diagram for an automotive tire manufacturer is shown in Figure 20.3, and it 
includes possible causative factors leading to blistering in the tread portion of 
the tire. 

As Figure 20.3 shows, the cause-and-effect diagram depicts various causes at 
the left that feed directionally toward the ultimate effect at the far right. The 
causes will typically be associated with people, methods, materials, machines, 
or work environment, and the diagram can include categories such as the four 
subprocesses in Figure 20.3. 
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mind? 
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Either the diagram or the problem-solving sessions that it facilitates can be- 
come more complex when we consider that causes themselves have causes. By 
continually asking "why}" in a procedure similar to peeling back the layers of an 
onion, it is often possible to identify a root cause that lies at the core of the visi- 
ble problem. The following series of "why?" questions is attributed to Taiichi 
Ohno, former vice president of Toyota Motor. 8 

Question 1: Why did the machine stop? 
Answer 1: Because the fuse blew due to an overload. 

Question 2: Why was there an overload? 
Answer 2: Because the bearing lubrication was inadequate. 

Question 3: Why was the lubrication inadequate? 
Answer 3: Because the lubrication pump wasn't working properly. 

Question 4: Why wasn't the lubrication pump working properly? 
Answer 4: Because the pump axle was worn out. 

Question 5: Why was the pump axle worn out? 
Answer 5: Because sludge got in. 

The permanent solution to this machine-stoppage problem ended up being a 
strainer to prevent dirt from getting into the lubrication pump. Had the question 
"why?" not been asked five times, the solution might have been a stopgap mea- 
sure such as a replacement fuse. 

The Pareto Diagram 

The Pareto diagram, also known as the Juran diagram, is a bar chart describing 
the relative frequencies with which various kinds of defects, or nonconformities, 
have occurred. The diagram contributes to the efficiency and effectiveness of 
quality management efforts by helping management and workers concentrate 
their limited resources on areas offering the greatest potential for improvement. 

The usefulness of the Pareto diagram is underscored by a guideline that some refer 
to as the "80/20 rule" — that 80% of the problems come from only 20% of the prob- 
lem areas. Naturally, the exact breakdown will not always be 80/20, but the general 
idea is to use the Pareto diagram for guidance in attacking those few areas where most 
of the problems have occurred. Figure 20.4 shows a Pareto diagram of warranty re- 
pairs for electric clothes dryers. The greatest potential for gains in quality appear to 
be associated with improving the heating element and the electric motor. 

The Check Sheet 

The check sheet can be likened to the "foot soldier" in the military. Often no more 
than a sheet of paper on which to tally frequency counts or record measurements, 
it is simply a mechanism for entering and retaining data that may be the subject 
of further analysis by another statistical tool. The continuing improvements 
sought by Deming's PDCA cycle and the Kaizen concept rely heavily on quanti- 
tative guidance, and companies that practice TQM listen most closely whenever 
advice or proposals are accompanied by supporting data. 

8 Masaaki Imai, Kaizen. New York: McGraw-Hill, 1986, p. 50. 
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Taguchi Methods 

Traditional methods of evaluating quality are of a binary, yes/no nature — either a 
product is within specifications or it is not. For example, if a specification requires 
that a product be within 0.016 inches of 14.000 inches, an individual unit will be 
judged acceptable as long as it is no shorter than 13.984 inches and no longer than 
14.016 inches. In this situation, 14.000 inches is the target and ±0.016 inches is the 
tolerance. 

Engineer Genichi Taguchi suggested that the target measurement is an ideal or 
optimum, and that greater deviations from the target represent greater decreases 
in quality — in other words, one individual unit can be higher in quality than an- 
other, even though both are within the traditional specifications. Parts (a) and (b) 
of Figure 20.5 (page 770) illustrate how the Taguchi method differs from the 
traditional approach to quality evaluation. The curve in part (b) is referred to as 
the Taguchi loss function, and it shows how quality is lost as the measurement 
gets farther away from the target, or optimum value. 

As an example of how the Taguchi method applies, consider the temperature of 
a swimming pool. If the most comfortable water temperature is 80° Fahrenheit and 
a tolerance of ±5° is deemed acceptable to swimmers, a pool manager might tell 
personnel to make sure the temperature stays within the 75° to 85° range. On the 
other hand, swimmers do not suddenly become uncomfortable as the temperature 
drops from 75.1° to 74.9° — 75.1° is a bit uncomfortable as well. In this situation, 
the Taguchi method would recognize that people experience greater discomfort 
(greater quality loss) as the temperature gets farther away from the optimum. 

The Taguchi method can be applied to individual components as well as to 
manufactured products consisting of a great many components. A product is the 
totality of its parts, and some products can be of higher quality than others — 
even though all of their component parts might be within their respective tradi- 
tional specifications. For example, chances are you will be much happier with 
your car if each of its approximately 15,000 parts is close to its ideal specification 
instead of just barely falling within the limits of acceptability. 
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FIGURE 20.5 



While traditional specifica- 
tion limits (a) view a prod- 
uct as either "good" or 
"bad," the Taguchi method 
(b) goes beyond this to rec- 
ognize loss in quality as a 
dimension or measurement 
deviates farther from the 
target. 
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Designed Experiments 

By helping us measure the influence of factors that might affect quality, experimen- 
tation plays an important role in the continual improvement of products and ser- 
vices. With designs like two-way analysis of variance, it is even possible to measure 
effects from various levels of two factors at the same time, along with interactions 
between levels of these factors. The basic notion of considering, then trying and 
evaluating new ideas is what the Deming PDCA cycle is all about. Once again, total 
quality management is highly data dependent, and new ideas must be supported by 
appropriate data and analysis if they are to be adopted on a large scale. 



exercises 




20.22 What is a process flow chart, and how can it 
contribute to the goals of total quality management? 

20.23 What is a cause-and-effect diagram, and what role 
can it play in total quality management? 

20.24 What is the "80/20" rule, and how does it apply to 
total quality management? 

20.25 What is a "check sheet," and what is its role in 
total quality management? 



20.26 Differentiate between the traditional and the 
Taguchi approaches to deviations from a target 
dimension or measurement. 

20.27 How might the traditional and the Taguchi 
approaches to quality loss apply with regard to the amount 
of chocolate put in when making a cup of hot chocolate? 

20.28 How can designed experiments facilitate the appli- 
cation of the Deming PDCA cycle? 
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STATISTICAL PROCESS CONTROL: 
THE CONCEPTS 


20.6 



On the highway, you and your car have probably been shaken by "rumble strips," 
harsh grooves across the road surface that warn you when a construction area or 
a dangerous intersection is about to appear. Imagine a long, straight road with 
two continuous and narrow sets of rumble strips — one at each edge of the lane in 
which you're traveling. If you were blindfolded, you could still stay in your lane. 
The strategy would be simple: adjust to the left when you touch the right-hand 
rumble strip, adjust to the right when you encounter the strip on the left. Such a 
highway would be analogous to the control chart. 

Control charts were developed to keep track of ongoing processes and to signal 
when such a process had reached the point of going "out of control," much like the 
parallel set of rumble strips on our hypothetical highway. Instead of rumble strips, the 
control chart has upper and lower control limits for the sample statistics provided by 
each of successive samples taken over time. When nonrandom, or directional, varia- 
tion causes these measurements to become unusually high or low, the upper or the 
lower control limit will be violated. In this event, the control chart will constitute a 
warning signal that the process may need corrective attention. The control chart can- 
not make the correction. It serves only as a warning that something needs to be done. 

Types of Control Charts 

Control charts apply to either variables or attributes, depending on the type of 
process and how it is being monitored. The distinction is as follows: 

CONTROL CHART FOR VARIABLES This type of control chart is appropriate 
whenever we are interested in monitoring measurements, such as weights, diame- 
ters, thicknesses, or temperatures. The next section of the chapter deals with control 
charts for variables, and includes those for sample means and for sample ranges. 

CONTROL CHART FOR ATTRIBUTES This type of control chart applies when we 
are counting something associated with the process output. Section 20.8 discusses 
control charts for attributes, including those for the proportion of defectives (the 
p-chart) and the number of defects per unit of output (the c-chart). 



The Control Chart Format 

Figure 20.6 (page 772) shows the typical format of a control chart. As successive 
samples are drawn from the process, the results are plotted and the chart indicates 
whether the process is in control. While random variation may occur from one 
sample to the next, as long as sample results stay within the upper and lower 
control limits, there is deemed to be no cause for corrective action. However, if a 
sample result does not fall within the upper and lower control limits, the process 
is said to be out of control. In other words, we conclude that the sample statistic 
is too extreme to have been merely the result of random, or sampling, error. 

If all this seems familiar, it's because the control chart is closely related to the 
concept of hypothesis testing. The upper and lower control limits are analogous 
to the upper and lower critical values that we determined when conducting a 
two-tail test involving a sample mean or proportion. From the standpoint of 
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FIGURE 20.6 
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hypothesis testing, the null hypothesis is "The process is in control." As in 
hypothesis testing, there can be errors. For example, a process may be deemed to 
be in control when in fact it is not (Type II error), or it may be incorrectly judged 
to be out of control (Type I error). 




« il m 



20.29 What is a control chart, and what purpose does it 
serve? 

20.30 Differentiate between a variable and an attribute. 

20.31 What function is performed by a control chart's 
upper and lower limits? 



20.32 In terms of random variation and assignable varia- 
tion, what is meant by the statement, "The process is in 
control"? 

20.33 With respect to the null hypothesis, "H Q : The pro- 
cess is in control," explain the distinction between a 
Type I error and a Type II error. 



20.7 CONTROL CHARTS FOR VARIABLES 



The mean and range charts monitor the central tendency and variability, respec- 
tively, of a process. These control charts are interrelated, and the range in succes- 
sive samples can provide useful information for constructing the mean chart for 
the same process. The mean chart is more closely linked to our previous work 
with hypothesis testing, and it will be examined first. 



Mean Charts 

The mean chart, or "x chart," relies on the normal distribution and the central limit 
theorem. Introduced in Chapter 8, the central limit theorem holds that the means of 
simple random samples from any population will tend to be approximately normally 
distributed, with the distribution of sample means more closely approaching the nor- 
mal distribution as the sample size is increased. Also, when the population is 
normally distributed, the sample means will tend to be normally distributed even for 
relatively small sample sizes. This latter point, which was covered in Chapter 8, is 
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especially important in statistical process control, where it is common practice to 
assume that x (i.e., the width, diameter, weight, or thickness) is normally distributed. 

In the mean chart, upper and lower control limits are usually selected as 
"3-sigma," so called because they represent the area under the normal curve that 
corresponds to ±3 standard deviation units from the mean. This area is 0.9974, so 
there is only a (1.000 — 0.9974) = 0.0026 chance that the mean of a given sample 
will fall outside the "process is in control" boundaries that the limits represent. 



Construction of a Mean Chart 

The process mean (/jl) may be either (1) known from past experience or (2) esti- 
mated on the basis of sample data that have been collected. Our treatment of the 
mean control chart will assume the latter case, and the centerline of the chart will 
be x, the mean of the sample means. If /a happens to be known, it is only neces- 
sary to substitute /x for x in the descriptions that follow. 

Likewise, the process standard deviation (a) may be either (1) known on the 
basis of previous experience with the process or (2) estimated from the amount 
of variability in the samples that have been collected. Accordingly, there are two 
approaches to the construction of a mean chart: 

The mean chart when the process standard deviation (cr) is known: 

• Centerline = x, the mean of the sample means. 

• Upper and lower control limits (UCL and LCL): 



3 



O" 

Vn 



where cr = the process standard deviation 
n = the sample size 
tr/Vw = the standard error of the sample means 



Example 



Mean Chart, Sigma Known 

A machine produces aluminum cylinders, the diameters of which are approxi- 
mately normally distributed. Twenty samples, each with n = 5, have been taken, 
and the mean of the sample means is a diameter of x = 2.1350 inches. Based on 
past experience, the process standard deviation is known to be cr = 0.002 inches. 

SOLUTION 

• If further samples are taken, what are the upper and lower 3-sigma control 
limits for determining whether the process is in control? 



For cr = 0.002 inches and n = 5, the upper and lower control limits will be 

2.1350 ± 3(0.0009) 



cr 0.002 
= 2.1350 ± 3 



n\ 



V5 



or 2.1350±0.0027, or 2.1323 (LCL) and 2.1377 (UCL) inches. 

Provided that a sample mean is no less than 2.1323 inches and no greater than 
2.1377 inches, the process is considered to be in control. 
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• If the process is in control, what is the probability that the next sample taken 
will falsely signal that the process is out of control? 

The 3-sigma limits include 99.74% of the area under the normal curve. As 
long as the process remains in control, the probability is 1.0000 — 0.9974, or 
0.0026, that random variation alone will result in a given sample mean incor- 
rectly indicating that the process has gone out of control. 



If the process standard deviation is not known, the approach is similar, except 
that the upper and lower control limits are determined by using the average range 
(R) of the samples as a measure of process variability. For simplicity, the procedure 
also involves the application of a special factor (A 7 ) obtained from the 3-sigma 
control chart factors table in Appendix A of the text. The table is reproduced here 
as Table 20.2. Note that the value of A 2 depends on n, the size of each sample, 
and that A 2 becomes smaller as n increases. Table 20.2 also includes two addi- 
tional factors. These are used in constructing the range chart, covered in the 
next section. 

The mean chart when the process standard deviation (a) is not known: 

• Centerline = x, the mean of the sample means. 

• Upper and lower control limits (UCL and LCL): 

x ± A 2 R where R = the average range for the samples 
A 2 = value from the 3-sigma 
control chart factors table 




Factors for Determining 
Control Limits, Control 
Chart for the Range 



The 3-sigma control 
chart factors table from 
Appendix A. A 2 depends 
on the size of the individual 



Number of 
Observations 
in Each Sample, 



Factor for Determining 
Control Limits, Control 
Chart for the Mean, 

*2 



samples and is used in 
constructing the mean 
chart. Factors D 3 and D 4 , 
also dependent on the 
value of n, are used in 
making the range chart. 



n 



2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 



1.880 
1.023 
0.729 
0.577 
0.483 
0.419 
0.373 
0.337 
0.308 
0.285 
0.266 
0.249 
0.235 
0.223 



0 
0 
0 
0 
0 

0.076 
0.136 
0.184 
0.223 
0.256 
0.284 
0.308 
0.329 
0.348 



3 
2 
2 
2 
2 



267 
575 
282 
115 
004 
924 
864 
816 
777 
744 
716 
692 
671 
652 



Source: From E. S. Pearson, "The Percentage Limits for the Distribution of Range in Samples from a Normal 
Population," Biometrika 24(1932): 416. 
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Example 




Mean Chart, Sigma Unknown 

During the first few hours of a production run, 20 samples have been drawn from 
the output of a machine that produces steel rivets for industrial construction. The 
samples were taken at ten-minute intervals, and each sample consisted of n = 4 riv- 
ets, the lengths of which were measured. The results are shown in Table 20.3 and 
are also listed in file CX20RIVS. 



SOLUTION 

In determining the upper and lower control limits, several preliminary calcula- 
tions are needed. As Table 20.3 shows, the mean of the 20 sample means is x = 
1.398 inches. Since the process standard deviation is not known, we will use the 
average range and the appropriate factor from Table 20.2 to construct the mean 
control chart. The column at the far right in Table 20.3 shows the range for each 
sample. For example, the range in sample number 3 was the difference between 
the largest and smallest rivet, or (1.399 — 1.385) = 0.014 inches. At the bottom 
of the table, the average range for these samples has been calculated as R = 
0.022 inches. 

Given that the size of each of the samples was n = 4 rivets, we refer to 
Table 20.2 and find the appropriate factor to be A 2 = 0.729. Having already 



Sample # 


Item 1 


Item 2 


Item 3 


Item 4 


X 


Range 


1 


1.405 


1.419 


1.377 


1.400 


1.400 


0.042 


2 


1.407 


1.397 


1.377 


1.393 


1.394 


0.030 


3 


1.385 


1.392 


1.399 


1.392 


1.392 


0.014 


4 


1.386 


1.419 


1.387 


1.417 


1.402 


0.033 


5 


1.382 


1.391 


1.390 


1.397 


1.390 


0.015 


6 


1.404 


1.406 


1.404 


1.402 


1.404 


0.004 


7 


1.409 


1.386 


1.399 


1.403 


1.399 


0.023 


8 


1.399 


1.382 


1.389 


1.410 


1.395 


0.028 


9 


1.408 


1.411 


1.394 


1.388 


1.400 


0.023 


10 


1.399 


1.421 


1.400 


1.407 


1.407 


0.022 


11 


1.394 


1.397 


1.396 


1.409 


1.399 


0.015 


12 


1.409 


1.389 


1.398 


1.399 


1.399 


0.020 


13 


1.405 


1.387 


1.399 


1.393 


1.396 


0.018 


14 


1.390 


1.410 


1.388 


1.384 


1.393 


0.026 


15 


1.393 


1.403 


1.387 


1.415 


1.400 


0.028 


16 


1.413 


1.390 


1.395 


1.411 


1.402 


0.023 


17 


1.410 


1.415 


1.392 


1.397 


1.404 


0.023 


18 


1.407 


1.386 


1.396 


1.393 


1.396 


0.021 


19 


1.411 


1.406 


1.392 


1.387 


1.399 


0.024 


20 


1.404 


1.396 


1.391 


1.390 


1.395 


0.014 


























R = 1.398 


R = 0.022 



TABLE 20.3 



Results from 20 samples, 
each with n = 4. The mean 
of the sample means is 
1 .398 inches and the mean 
of the sample ranges is 
0.022 inches. This informa- 
tion, also listed in file 
CX20RIVS, is the basis for 
the Excel and Minitab mean 
charts generated in Com- 
puter Solutions 20.1 . 
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calculated x = 1.398 inches and R = 0.022 inches, the upper and lower con- 
trol limits will be 

x ± A 2 R = 1.398 ± (0.729)(0.022) = 1.398 ± 0.016 

and the control limits are 1.382 inches (LCL) and 1.414 inches (UCL). 



Computer Solutions 20.1 describes the procedures and shows the results when 
Excel and Minitab are used to generate the mean chart for the data in Table 20.3. 
Each plot shows the upper and lower control limits, the centerline (x = 1.398 
inches), and the sequence of 20 sample means. All of the sample means are within 
the control limits, and the process is judged to be under control. If this were not 
the case, it would be necessary to identify the source of the assignable, or nonran- 
dom, variation and take corrective action. 





computer solutions 20.1 







Mean Chart 

In these procedures, the process standard deviation is assumed to be unknown and is 
estimated on the basis of the sample ranges. 

EXCEL 



B 



D 



Statistical Process Control 



Length 



Upper control limit 



1.4145 



Centerline 



1 .3983 



Lower control limit 



1 .3820 



10 



11 



12 



13 



14 



15 



16 



17 



XBar-Chart (using R) 




10 12 14 16 18 20 



Excel Mean Chart, Sigma Unknown 

1 . In file CX20RIVS.XLS, the label is in A1 and the sequence of rivet lengths is in A2:A81 . Click Tools. Click Data Analy- 
sis Plus. Select Statistical Process Control. Click OK. 

2. Enter A1:A81 into the Input Range box. Click Labels. Enter 4 into the Sample size box. Select XBAR (Using R). 

Click OK. This provides the mean chart for the 20 samples in Table 20.3. (Note: As an alternative, you can use Excel 
worksheet template TMSPCMR.XLS, on the disk accompanying the text. The instructions are within the template.) 
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MINITAB 

Minitab Mean Chart, Sigma Unknown 

Xbar Chart of Length 



UCL=1.41450 



X=1.39825 



LCL=1.38200 



2 4 6 8 10 12 14 16 18 20 
Sample 



1. Using file CX20RIVS.MTW, with the 80 rivet lengths in C1: Click Stat. Select Control Charts. Select Variables 
Charts for Subgroups. Click Xbar. 

2. Select All observations for a chart are in one column and enter C1 into the box. Enter 4 (the size of each sample) 
into the Subgroup sizes box. 

3. Click Xbar Options. Click Estimate and select Rbar in the Method for Estimating Standard Deviation menu. 
Click OK. Click OK. The mean chart appears as shown here. 

Note: If the process standard deviation, a, is known, the mean chart with known a can be easily generated. In step 2, 
above, just click Xbar Options, click Parameters, and enter the known process standard deviation into the Historical 
standard deviation box. 



An important note: The procedures in Computer Solutions 20.1 make no as- 
sumptions about the process standard deviation, and estimate it based on the 
sample ranges. In this chapter, our emphasis is on the so-called unknown-sigma 
mean chart available from both Excel and Minitab. However, as pointed out in 
the Minitab portion of Computer Solutions 20.1, Minitab is also capable of 
generating a mean chart when we do happen to know the process standard 
deviation. 

Range Charts 

The purpose of the range chart (or "R chart") is similar to that of the mean chart. 
Relying on collected data, the range chart shows the ranges for consecutive 



1.415 
1.410 
1.405 

s 

CO 

I 1.400 
E 1.395 
1.390 
1.385 
1.380 
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samples and provides an indication of whether the ranges of these and later sam- 
ples are within the limits that random variation would allow. As with the mean 
chart, there are upper and lower control limits to signal whether the process is in 
control. 

Although there is a counterpart to the standard error of the sample mean (i.e., 
a R instead of a-), it is again simpler to refer to a statistical table like Table 20.2. 
Using the appropriate factors from this table, the result will again be a "3-sigma" 
chart. If the process is in control, any given sample will have just a 0.0026 chance 
of violating the chart's limits due to random variation alone. Like the mean chart, 
the range chart is based on successive samples of the same size, but the centerline 
will be the average range (R), and the upper and lower control limits are deter- 
mined as follows: 

The range chart: 

• Centerline = R, the average of the sample ranges. 

• Upper and lower control limits: 

UCL = D 4 R where R = the average range for the 

samples 

LCL = D^R D 3 and D 4 = values from 3-sigma control 

chart factors table 

As an example of the construction of a range chart, we'll begin with the infor- 
mation from the 20 samples shown in Table 20.3. The average value for the range 
has already been calculated as R = 0.022 inches, and this will be the centerline for 
the control chart. Referring to the "w = 4" row of Table 20.2, we find the factors 
for determining the upper and lower control limits to be D 4 = 2.282 and D 3 = 0, 
respectively. The control chart limits are 

UCL = D 4 R, or 2.282(0.022) = 0.050 inches 

LCL = DJl, or 0.000(0.022) = 0.000 inches 

Computer Solutions 20.2 describes the procedures and shows the results when 
Excel and Minitab are used to generate a range chart for the data in Table 20.3. 
Each plot shows the upper and lower control limits, the centerline (R = 0.022 
inches), and the sequence of 20 sample ranges. All of the sample ranges are within 
the control limits, and the process appears to be in control. 

In the range charts in Computer Solutions 20.2, note that the lower control 
limit is zero. As shown in Table 20.2, the value of D 3 is zero whenever n < 6, so 
the lower control limit will always be zero for these sample sizes. As noted in the 
Minitab portion of Computer Solutions 20.2, Minitab can also generate a range 
chart if the process standard deviation happens to be known. 

For a given series of samples, the mean and range charts should be used to- 
gether in determining whether the process is in control. Either chart (or both) may 
signal an "out of control" condition. In part (a) of Figure 20.7 (page 781), the sample 
ranges are within their control limits, but the means have drifted upward and the 
process is no longer in control. The trend of the mean chart in part (a) could have 
occurred, for example, if a cutting tool were gradually wearing down during the 
production process. Part (b) of Figure 20.7 shows a series of samples in which the 
reverse has occurred. While the sample means remain within the control limits, 
the range chart has detected a shift in the variability of the process. Either 
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Range Chart 

In these procedures, the process standard deviation is assumed to be unknown and is 
estimated on the basis of the sample ranges. 

EXCEL 



B 



Statistical Process Control 



Length 



Upper control limit 



0.0509 



Centerline 



0.0223 



Lower control limit 



0 



10 



11 



12 



13 



14 



15 



16 



17 



R-Chart 




10 



12 14 16 18 20 



Excel Range Chart, Sigma Unknown 



1. 



2. 



In file CX20RIVS.XLS, the label is in A1 and the sequence of rivet lengths is in A2:A81 . Click Tools. Click Data Analy- 
sis Plus. Select Statistical Process Control. Click OK. 

Enter A1:A81 into the Input Range box. Click Labels. Enter 4 into the Sample size box. Select R. Click OK. This pro- 
vides the range chart for the 20 samples in Table 20.3. (Note: As an alternative, you can use Excel worksheet template 
TMSPCMR.XLS, on the disk accompanying the text. The instructions are within the template.) 



MINITAB 

Minitab Range Chart, Sigma Unknown 

1. Using file CX20RIVS.MTW, with the 80 rivet lengths in C1 : Click Stat. Select Control Charts. Select Variables Charts 
for Subgroups. Click R. 

2. Select All observations for a chart are in one column and enter C1 into the box. Enter 4 (the size of each sample) 
into the Subgroup sizes box. 

3. Click R Options. Click Estimate and select Rbar in the Method for Estimating Standard Deviation menu. Click 
OK. Click OK. The range chart appears as shown here. 

Note: If the process standard deviation, a, is known, the range chart with known a can be easily generated. In step 2, 
above, just click R Options, click Parameters, and enter the known process standard deviation into the Historical stan- 
dard deviation box. 

(continued) 
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mechanical wear or the need for maintenance may be increasing the dimensional 
variation in the output. 

A process that is out of control can also be identified by a lone sample statis- 
tic that falls outside the control limits. For example, in the range chart in part (b) 
of Figure 20.7, the range for the seventh sample was unusually high. Even if the 
ranges had settled down toward R after that sample was taken, the existence of 
the exceptionally large range for that sample would be a warning signal that the 
process might not have been in control at that time. A single sample may exhibit 
unusual characteristics due to an influence that is relatively temporary, such as an 
electrical surge. 

In Figure 20.7, note that x and R are not necessarily the same as the average 
mean or average range for the samples shown. In practice, the centerlines and the 
upper and lower control limits may be based on an earlier series of twenty or 
more samples during which the process was deemed to be in control. For exam- 
ple, the sample values shown in either segment of Figure 20.7 might represent 
samples 21-30 for the process examined by the control charts in Computer Solu- 
tions 20.1 and 20.2. 

By using Seeing Statistics Applet 21, at the end of the chapter, you can easily 
select consecutive samples and generate two different mean control charts. The 
first chart represents a "well-behaved" process; the second one represents a pro- 
cess that tends to go out of control in some fashion. Notice how even the well- 
behaved process can occasionally give an indication that it might be out of control, 
but keep in mind that this is just due to random variation. 
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Mean UCL 
chart 



LCL 

Range UCL 
chart 

R 



LCL 

(a) The mean is out of control. 




FIGURE 20.7 



The mean chart and the 
range chart should be used 
together, since both are 
important in determining 
whether a process is in 
control. In part (a), the vari- 
ability is within limits, but 
the mean is out of control. 
In part (b), the mean is in 
control, but the range con- 
trol chart has detected a 
shift in the variability of the 
process. 



Mean UCL 
chart 



LCL 

Range UCL 
chart 



LCL 




(b) The mean is in control, but the range has shifted. 


























- 













20.34 In the mean chart, what assumption is made about 
the distribution of individual measurements for the pro- 
cess that is being monitored? 

20.35 In a 3-sigma mean control chart for a process that 
is in control, what is the probability that a given sample 
mean will fall outside the control limits as the result of 
random variation alone? 

20.36 In general, how will the upper and lower control 
limits change if a 2-sigma control chart is used instead of 
a 3-sigma control chart? 

20.37 How does the purpose of a range chart differ from 
that of a mean chart? Why is it a good idea to use both of 
them in monitoring the same process? 

20.38 When the robot welder is functioning properly, 
the natural gas cylinders being produced have an 



average burst strength of 3400 psi, with a standard 
deviation of 100 psi. Determine the centerline and 
the upper and lower control limits for a 3-sigma 
mean control chart in which each sample consists of 
4 cylinders. 

20.39 Use the control chart limits developed in 
Exercise 20.38 and the following sample results to 
evaluate whether the process is in control: 

Sample 



1 



Mean 3390 3447 3410 3466 3440 3384 

burst 

strength 
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20.40 Measured at the lid center, the thickness of the zinc 
coating applied to container lids averages 3.000 mil, with 
a process standard deviation of 0.300 mil. Determine 

the centerline and the upper and lower control limits for 
a 3-sigma mean control chart in which each sample con- 
sists of 4 lids. 

20.41 Use the control chart limits developed in Exercise 
20.40 and the following sample results to evaluate 
whether the process is in control: 

Sample 

1 2 3 4 5 6 



Mean 2.95 2.97 3.03 3.11 2.35 3.19 

thickness 

I data Set / Note: For Exercises 20.42 and 20.43, a 
computer and statistical software are recommended. 

20.42 Samples of 12-volt batteries are taken at 30-minute 
intervals during a production run. Each sample consists of 
3 batteries, and a technician records how long each battery 
will produce 400 amperes during a standard test. Given the 
following data, also listed in file XR20042, construct 3-sigma 
mean and range control charts and evaluate whether the 
process is in control. 



Minutes, Battery Number 



Sample 

Number 12 3 



1 13.3 9.4 12.1 

2 12.2 13.4 8.5 

3 1 1.2 8.2 9.2 

4 7.8 9.7 10.0 

5 10.1 11.4 13.8 

6 9.9 11.7 8.5 



20.43 The tensile strength of sheet metal screws is moni- 
tored by selecting a sample of 4 screws at 30-minute inter- 
vals, then seeing how many pounds of force are required to 
break each screw. Given the following data, also listed in 
file XR20043, construct 3-sigma mean and range control 
charts and evaluate whether the process is in control. 

Pounds, Screw Number 



Sample 

Number 12 3 4 



1 760 843 810 706 

2 771 800 747 742 

3 710 809 876 754 

4 882 875 873 763 

5 718 770 728 939 

6 988 927 902 956 

7 843 739 792 787 



20.8 CONTROL CHARTS FOR ATTRIBUTES 



When we wish to count something regarding the output of a process, a control 
chart for attributes can be used. If we're counting the number of acceptable ver- 
sus defective items in each sample, the p-chart is used. This would be the case if 
output consisted of metal containers that either leaked or did not leak. An at- 
tribute such as leakage is either present or it is not. Other processes involve out- 
put where the number of defects in a unit of production can be counted — for 
example, the number of surface flaws in a 30-foot length of fabric or the number 
of air bubbles in a sheet of glass. For such processes, the c-chart may be used in 
determining whether the process is in control. 

p-Charts 

The p-chart monitors a process by showing the proportion of defectives in a se- 
ries of samples. It is also referred to as a "fraction-defective," "percent-defective," 
or "p chart." Like the mean and range charts, the p-chart has a centerline, and 
upper and lower control limits. The normal approximation to the binomial distri- 
bution is used in constructing a p-chart, and sample sizes must be larger than with 
the mean or range charts. In other respects, the concept is similar to the charts of 
the preceding section. 
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The p-chart: 

• Centerline: p, the average of the sample proportions, or 

Number of samples 

• Upper and lower control limits (UCL and LCL): 



z \l where p = the average of the sample proportions 

n = the size of each sample 
Z = standard error units, normal distribution 



In the expression for the control limits above, the quantity represented by the 
square root symbol is our estimate of a p , the standard error of the sample propor- 
tion for samples of this size. A 3-sigma (z = 3) control chart is common practice, 
and 99.74% of the sample proportions should fall within the limits if the process 
is in control. Both rip and n(l — p) should be &5 for the normal approximation 
to the binomial distribution to apply. If a process generates either a very high or 
a very low proportion of defectives, the samples may need to be quite large to 
satisfy this requirement. 



Example 




p-Chart 

As a test of filament integrity, lightbulbs coming off the manufacturing line are 
sampled and subjected to a voltage 30% higher than that for which they are 
rated. Each sample consists of 100 bulbs, and a bulb is categorized as "defective" 
if it burns out during a 2-second exposure to this higher voltage. Sixteen samples 
from yesterday's production run provided the data shown in Table 20.4 (page 784) 
and listed in file CX20FILS. 



SOLUTION 

For each of the 16 samples described in Table 20.4, the numbers of acceptable and 
defective bulbs have been counted. The samples are of equal size, and the average 
of the 16 sample proportions is the total of the proportion-defective column di- 
vided by the number of samples, or p = 0.135. This is the centerline for the con- 
trol chart. 

Constructing a 3-sigma control chart for these data, z is 3, the average pro- 
portion has been calculated asp = 0.135, and each sample consists of n = 100 
bulbs. The upper and lower control limits for the sample proportion are 

. Ipll - p) * /0.135(1 - 0.135) 

p ± z\ — = 0.135 ± 3V L — = 0.135 ± 0.103 

V n V 100 

and the control limits are 



UCL = 0.135 + 0.103 = 0.238 
LCL = 0.135 - 0.103 = 0.032 
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TABLE 20.4 



Results from 16 samples, 
each with n = 100 items. 
The mean of the propor- 
tion defective values is 
p = 0.135. These data, 
also listed in file CX20FILS, 
are the basis for the Excel 
and Minitab p-charts in 
Computer Solutions 20.3. 



Sample # 



1 



10 
11 
12 
13 
14 
15 
16 



Defectives 



14 



16 



14 



15 



20 



19 
12 
17 



10 
15 



17 
13 
11 



Nondefectives 



92 



86 



84 



86 



85 



80 



83 
94 
90 
85 
91 
83 
87 
89 



Proportion Defective, p 



0.080 



0.140 



0.160 



0.140 



0.150 



0.200 



0.190 
0.120 
0.170 
0.060 
0.100 
0.150 
0.090 
0.170 
0.130 
0.1 10 



p = 0.135 



In the event that calculations result in a lower control limit that is negative, we 
use LCL = 0, because it is not possible to have either a negative number or a neg- 
ative proportion of defectives. Like the mean and range charts, p-charts for later 
samples may employ the centerline and control limits that were determined for 
a p-chart constructed during a period when the process was judged as being in 
control. 

Computer Solutions 20.3 describes the procedures and shows the results 
when Excel and Minitab are used to generate a p-chart for the data in Table 20.4. 
Each plot shows the upper and lower control limits, the centerline (p = 0.135), 
and the sequence of 16 sample proportions. All of the sample proportions are 
within the control limits, and the process appears to be in control. 

c-Charts 

If the objective of statistical process control is to monitor the number of defects 
from each sample, the c-chart is used. Also referred to as the "c chart," it can apply 
to such observations as the number of flaws in 10 feet of tubing, the number of 
scratches on a panel, or the number of "fit-and-finish" defects on an automobile 
body. Each sample consists of a unit of output, the statistic is c = the number of 
defects or flaws, and the distribution of the number of such defects tends to be 
Poisson. The c-chart is constructed using a normal approximation to the Poisson 
distribution. 

The c-chart: 

• Centerline: c, the average number of defects per sample, or 

2c 

Number of samples 
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• Upper and lower control limits (UCL and LCL): 

c±zVc where c = the average number of defects per sample 
Z = standard error units, normal distribution 



computer solutions 20.3 


5 - 

-. 









p-Chart 
EXCEL 



10 



11 



12 



13 



14 



15 



16 



17 



B 



Statistical Process Control 



Upper control limit 



Centerline 



Lower control limit 



DefBulbs 



0.2375 



0.135 



0.0325 



D 



P-Chart 




6 



10 



12 



14 



16 



Excel p-Chart 



1. 



2. 



In file CX20FILS.XLS, the label is in A1 and the sequence containing the number of defectives in each sample is in A2:A1 7. 
Click Tools. Click Data Analysis Plus. Select Statistical Process Control. Click OK. 

Enter A1:A17 into the Input Range box. Click Labels. Enter 100 into the Sample size box. Select P. Click OK. This 
provides the p-chart for the 20 samples in Table 20.4. (Wore: As an alternative, you can use Excel worksheet template 
TMSPCPCH.XLS, on the disk accompanying the text. The instructions are within the template.) 



MINITAB 

Minitab p-Chart 

1 . Using file CX20FILS.MTW, with the sequence containing the number of defectives in each sample in C 1 : Click Stat. Se- 
lect Control Charts. Select Attributes Charts. Click P. 

2. Enter C1 into the Variables box. In the Subgroup box, select Size and enter 100 (the size of each sample) into the 
Subgroup sizes box. Click OK. The p-chart appears as shown here. 

(Note: If we either know the process proportion defective or wish to specify an estimate that is based on past data, just 
select P Chart Options, click Parameters, and enter the known or specified process proportion defective into the 
Proportion box.) 



(continued) 
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P Chart of DefBulbs 










0.25- 




LCL =0.2375 


















0.20- 














srtion 


0.15- 












n n 1 


Propi 


0.10- 
0.05- 


























LCL =0.0325 












2 


4 


6 8 10 12 
Sample 


14 


16 





example i 





c-Chart 

During the production of continuous sheet steel, periodic samples consisting of 
10-foot lengths are examined to determine the number of surface scratches, 
blemishes, or other flaws. Twenty samples have been inspected at periodic inter- 
vals during a production run, with results as shown in Table 20.5 and listed in 
file CX20DEFS. 

SOLUTION 

For each of the 20 samples in Table 20.5, the number of surface defects has been 
counted. The average number of defects per sample is the total number of defects 
divided by the number of samples, or ~c = 106/20 = 5.30. 

Constructing a 3-sigma control chart for these data, the upper and lower con- 
trol limits for c are 

c ± zV! = 5.30 ± 3V530 = 5.30 ± 6.91 

and the control limits are 

UCL = 5.30 + 6.91 = 12.21 

LCL = 5.30 - 6.91 = -1.61, which reverts to 0 

Since it's impossible to have a negative number of defects, the lower control 
limit (calculated as —1.61) is replaced by zero. The same reasoning was introduced 
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Sample # 



1 



10 



Number of 
Defects, c 



Sample # 



11 



12 



13 



14 



15 



16 



17 



19 

20 



Number of 
Defects, c 



10 



TABLE 20.5 



Each of the 20 samples 
consists of a 10-foot length 
of sheet steel. The measure- 
ment for each is c = the 
number of scratches, blem- 
ishes, and other defects 
over the 10-foot length. 
The mean number of 
defects is c = 5.30. These 
data, also listed in file 
CX20DEFS, are the basis 
for the Excel and Minitab 
c-charts in Computer 
Solutions 20.4. 



c = 5.30 



during our discussion of the p-chart and its control limits. As with other control 
charts, later samples may be evaluated on the basis of c-chart centerline and con- 
trol limit values determined during a period when the process was deemed to be 
in control. 



Computer Solutions 20.4 describes the procedures and shows the results 
when Excel and Minitab are used to generate a c-chart for the data in 
Table 20.5. Each plot shows the upper and lower control limits, the centerline 
(c = 5.30), and the number of defects in each of the 20 samples. Each of the c 
values falls within the upper and lower control limits, and the process is judged 
to be in control. 



exercises 





20.44 Differentiate between the p-chart and the c-chart. 
Under what circumstances would each be used? 

20.45 When a given process is operating properly, the 
percentage of defectives is 4%. What will be the 3-sigma 
upper and lower control limits for a p-chart in which 
each sample consists of 150 items? 

20.46 When a given process is operating properly, the 
percentage of defectives is 15%. For a 3-sigma p-chart in 



which the upper and lower control limits are 4 percentage 
points apart, how large will each sample have to be? 

20.47 In the data collected for construction of a 
c-chart, the average number of defects per unit of 
output is c = 7.5. What are the 3-sigma upper and 
lower control limits for the control chart? 

/ data Set / Note: For Exercises 20.48-20.53, a 
computer and statistical software are recommended. 

(exercises continue on page 789) 
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M 



N 



10 
11 
12 
13 
14 
15 
16 



o 
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o 
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<u 
Q 
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c-Chart 




■ Number of Defects, c 



•Centerline 



•UCL 



•LCL 



17 



c-bar = 5.300 



UCL = 



12.207 



LCL = 0.000 



Excel c-Chart 

1 . Open Excel worksheet template TMSPCCCH.XLS, on the disk that accompanies the text. 

2. With the template file remaining open, open file CX20DEFS.XLS, which contains the data shown in Table 20.5. Follow 
the short instruction set in the template. The centerline, control limits, and c-chart plot for the 20 samples will appear 
as shown here. 



MINITAB 

Minitab c-Chart 

1. Using file CX20DEFS.MTW, with the sequence containing the number of defects in each sample in C1: Click Stat. 
Select Control Charts. Select Attributes Charts. Click C. 

2. Enter C1 into the Variables box. Click OK. The c-chart appears as shown here. 

(Note: If we either know the process mean or wish to specify an estimated mean that is based on past data, just select 
C Chart Options, click Parameters, and enter the known or specified process mean into the Mean box.) 
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C Chart of Defects 




10 12 
Sample 



14 



16 



18 



UCt=12.21 



C=5.3 



LCL=0 



20 



20.48 Twelve samples have been taken from a production 
run. Given the following data, construct a 3-sigma p-chart 
and evaluate whether the process is in control. 



Sample 
Number 



Sample 
Size 



Number of 
Defectives 



1 

2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 



100 
100 
100 
100 
100 
100 
100 
100 
100 
100 
100 
100 



6 
12 
13 
5 
6 
6 
7 
16 
9 
5 
14 
5 



8 


200 


42 


9 


200 


46 


10 


200 


45 


1 1 


200 


32 


12 


200 


37 


13 


200 


38 


14 


200 


37 


15 


200 


36 


16 


200 


39 


17 


200 


50 


18 


200 


39 


19 


200 


38 


20 


200 


36 



20.50 Using the control limits determined in Exercise 
20.49, evaluate whether the process was in control when 
the following data were collected for samples 21-32. 



20.49 Twenty samples have been taken from a production 
run. Given the following data, construct a 3-sigma p-chart 
and evaluate whether the process is in control. 



Sample 
Number 



Sample 
Size 



Number of 
Defectives 



Sample 
Number 

1 

2 
3 
4 
5 
6 
7 



Sample 
Size 

200 
200 
200 
200 
200 
200 
200 



Number of 
Defectives 

38 
26 
37 
42 
48 
45 
38 



21 

22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 



200 
200 
200 
200 
200 
200 
200 
200 
200 
200 
200 
200 



55 
45 
45 
56 
54 
53 
53 
56 
53 
57 
42 
53 
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20.51 After the stamping operation, door panels are peri- 
odically selected for a detailed examination in which the 
number of surface scratches, dents, and other flaws are 
counted. During today's production run, 20 panels were 
inspected. Given the data in the table, construct a 3-sigma 
c-chart, then reach a conclusion regarding whether the 
process is in control. 



Panel 
Number 



c = Number 
of Surface 
Defects 



Panel 
Number 



c = Number 
of Surface 
Defects 



10 



1 

2 
4 
6 
1 
5 
1 
3 
1 1 
12 



1 1 

12 
13 
14 
15 
16 
17 
18 
19 
20 



20.52 Automobiles are randomly selected for a close 
examination for flt-and-finish defects, including such 
items as dirt specks in the paint and panels that are not 
properly aligned. Based on data collected for the follow- 
ing vehicles that were so inspected, construct a 3-sigma 



c-chart, then reach a conclusion regarding whether the 
process is in control. 



Vehicle 
Number 



c = Number 
of Defects 



Vehicle 
Number 



c = Number 
of Defects 



1 


4 


1 1 


g 


2 


7 


12 


10 


3 


4 


13 


5 


4 


9 


14 


10 


5 


9 


15 


7 


6 


8 


16 


6 


7 


7 


17 


8 


8 


6 


18 


5 


9 


7 


19 


7 


10 


5 


20 


1 1 



20.53 Using the control limits determined in Exercise 
20.52, evaluate whether the process was in control when 
the following data were collected for vehicles 21-30. 



Vehicle 


c = Number 


Vehicle 


c = Number 


Number 


of Defects 


Number 


of Defects 


21 


6 


26 


4 


22 


4 


27 


8 


23 


4 


28 


13 


24 


3 


29 


7 


25 


10 


30 


8 



20.9 MORE ON COMPUTER-ASSISTED 

STATISTICAL PROCESS CONTROL 



Besides the mean, range, p- and c-charts, other possibilities include control charts 
for sample standard deviations, individual observations, moving averages, and 
moving ranges. In the latter two cases, the vertical axis represents either the aver- 
age or the range for a "sample" of fixed size that changes in composition each 
time a new observation is made. This is analogous to the concept of the moving 
average discussed in Chapter 18. 

Minitab offers a wide variety of control charts like those just described, but 
the current Excel (even with the Data Analysis Plus add-in) is more limited in the 
variety and flexibility of the control charts it can generate. However, both 
Minitab and Excel (with Data Analysis Plus) provide a complete battery of tests 
regarding the state of control of the process. This self-diagnosis takes the form of 
looking for and reporting any of 8 conditions and patterns. 9 Excel automatically 
does all available diagnostic tests. With Minitab, we must select the "Test" option 
and specify "Select all tests." Shown below are the available diagnostic tests 
provided by Excel and Minitab: 

1. A point is more than 3 sigmas away from the centerline. 

2. Nine points in a row are on the same side of the centerline. 

3. Six points in a row are all headed either upward or downward. 



9 See Minitab, Inc., Minitab Help Guide, v. 15. 
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4. Fourteen points in a row alternate upward and downward (sawtooth pattern). 

5. Two out of three consecutive points are on the same side of the centerline and 
more than 2 sigmas away from it. 

6. Four out of five consecutive points are on the same side of the centerline and 
more than 1 sigma away from it. 

7. Fifteen points in a row are within 1 sigma of the centerline. 

8. Eight points in a row are more than 1 sigma from the centerline. 



exercisES 

Pfci ^ ♦ 



/ data Set / Note: Exercises 20.54-20.58 require a 
computer and statistical software. 

20.54 Repeat Exercise 20.42 and apply all of the diag- 
nostic tests available with your statistical software. Has 
applying these diagnostics to the mean and range charts 
raised any concerns regarding the state of control of the 
process? If so, in what way(s)? 

20.55 Repeat Exercise 20.43 and apply all of the 
diagnostic tests available with your statistical software. 
Has applying these diagnostics to the mean and range 
charts raised any concerns regarding the state of control 
of the process? If so, in what way(s)? 

20.56 Repeat Exercise 20.52 and apply all of the diag- 
nostic tests available with your statistical software. Has 
applying these diagnostics to the c-chart raised any 



SUMMARY 20.10 



• TQM and the process orientation 

Organizations seeking to improve the quality of their goods and services are in- 
creasingly adopting the principles and practices of total quality management 
(TQM). Companies are finding a close parallel between the adage, "a stitch in 
time saves nine" and the TQM concept of dealing with defects through a process 
orientation that concentrates on preventing them from occurring in the first place. 

• Process variation and defect prevention 

Much of the emphasis in TQM is on understanding and reducing variation within 
the process through which the product is produced, and samples of output are used 
in reaching inferences about the stability of the process. The two sources of pro- 
cess variation are random (common-cause) variation and assignable (special-cause) 
variation. 

The traditional philosophy of defect detection generally involved a corporate at- 
mosphere where quality was a separate entity within the firm. The TQM philosophy 
of defect prevention has quality integrated throughout the organizational structure 
and central to corporate thinking in all matters. The emergence of TQM saw pioneers 
like W. Edwards Deming spreading the teachings of TQM and its process orientation. 



concerns regarding the state of control of the process? If 
so, in what way(s)? 

20.57 The data in file XR20057 represent 30 samples, each 
with n = 3 items measured in inches. For the associated 
mean and range charts, have your statistical software 
apply all of the diagnostic tests of which it is capable. 
Has applying these diagnostics to the mean and range 
charts raised any concerns regarding the state of control 
of the process? If so, in what way(s)? 

20.58 The data in file XR20058 represent the number of 
anonymous tips phoned in during each of 50 consecutive 
days to a local law enforcement hotline. Use your statisti- 
cal software in generating an appropriate control chart for 
these data and diagnosing unusual conditions and patterns. 
Have the diagnostics cast any doubt on whether the 
process is in control? If so, in what way(s)? 



Part 6: Special Topics 

• Kaizen and TQM tools 

The practices of TQM are closely related to the Kaizen philosophy of continuing 
improvement and include guidelines such as Deming's 14 points, quality audits, 
competitive benchmarking, just-in-time manufacturing, and worker empower- 
ment. The chapter discusses several of the statistical tools used in applying TQM, 
including the process flow chart, the cause-and-effect diagram, the Pareto dia- 
gram, the check sheet, and the Taguchi view of quality loss. 

• Statistical process control charts 

A popular and important tool for applying TQM is statistical process control, 
where sampled products are used in making inferences about the process from 
which they came. The control charts used in monitoring the process can provide 
a warning signal that the process may have become unstable due to the presence 
of assignable variation. In this event, the process should receive attention and cor- 
rective action. A process that is unstable will tend to generate products that are of 
unpredictable and/or unsatisfactory quality. 

• Control charts for variables versus attributes 

Control charts may be constructed for either variables (measurements) or at- 
tributes (counts). The mean chart and the range chart apply to variables and 
monitor central tendency and variability, respectively. Applicable to attributes, 
the p-chart shows the proportion of defects in successive samples, while the c- 
chart reflects a count of the number of defects found in units of the output. In 
each of these charts, the upper and lower control limits are the boundaries for 
the amount of variation that can be attributed to chance alone. These and 
other control charts can be generated with the use of a computer statistical 
package. 



Lesw equations 



Mean Chart When the Process Standard Deviation Is Known 

• Centerline = x, the mean of the sample means. 

• Upper and lower control limits (UCL and LCL): 

= _/ 0" 



where a = the process standard deviation 
n = the sample size 
(t/Vm = the standard error of the sample means 



Mean Chart When the Process Standard Deviation Is Not Known 

• Centerline = x, the mean of the sample means. 

• Upper and lower control limits (UCL and LCL): 

x ± A 2 R where R = the average range for the samples 
A 2 = value from the 3-sigma control 
chart factors table 



Range Chart 

• Centerline = R, the average of the sample ranges. 
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• Upper and lower control limits: 

UCL = D 4 R where R = the average range for the samples 

LCL = D 3 R D 3 and D 4 = values from 3-sigma control chart factors table 

p-Chart 

• Centerline: p, the average of the sample proportions, or 

Number of samples 

• Upper and lower control limits (UCL and LCL): 



where p = the average of the sample proportions 

n = the size of each sample 
z = standard error units, normal distribution 

c-Chart 

• Centerline: c, the average number of defects per sample, or 

Number of samples 

• Upper and lower control limits (UCL and LCL): 

where c = the average number of defects per sample 
Z = standard error units, normal distribution 
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20.59 For a person who eats hamburgers with ketchup, 
how might the traditional and the Taguchi approaches to 
quality loss apply with regard to the amount of ketchup 
that is put on the hamburger? 

20.60 For each of the following, identify which kind of 
variation is at work — random or assignable — and 
explain your reasoning. 

a. During the noon hour, there is an electrical power 
disruption and nuts machine-tightened during the five 
minutes that emergency power is on have 15% less 
torque than normal. 

b. Although they are all labeled "9-D," the shoes on a 
retailer's shelf are not exactly the same size. Any given 
shoe is just as likely to be slightly larger than 9-D as 
it is to be slightly smaller. 

c. The number of cars leaving downtown Cleveland 
during a Friday rush hour is greater than during 
other days of the week. 

d. In a production run of curtain rods, the lengths are 
normally distributed, with a mean of 24.135 inches 
and a standard deviation of 0.142 inches. 

20.61 A university is starting a new college of engineer- 
ing, has extensive funding, and wants to use competitive 



benchmarking. What U.S. universities might be useful as 
objects of the competitive benchmarking effort? 

20.62 As a machine produces steel rods, accumulated 
deposits and mechanical wear cause the rod diameters to 
increase gradually over time. What type of control chart 
should be used in determining when the process has gone 
out of control? Why? 

20.63 In producing bulk rolls of paper for sale to news- 
papers, a firm wishes to control the quality of the 
paper by keeping track of how many discolored spots 
or other flaws are present in continuous strips of a 
given length. What type of control chart should be 
used and why? 

/ data Set /Note: For Exercises 20.64-20.70, a com- 
puter and statistical software are recommended. 

20.64 In a production process for canvas, a sample con- 
sisting of 25 square feet contains 4 discolored spots that 
must be touched up before shipping. Data for this and 
subsequent samples, each consisting of 25 square feet 

of canvas, are shown on the next page. Use an appropriate 
3-sigma control chart in determining whether the process 
is in control. 
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Sample 


Number 


Sample 


Number 






Measurement 




Number 


of Spots 


Number 


of Spots 






(mm) 












Sample 






















1 


4 


16 


13 


Number 


1 


2 


3 


2 


6 


17 


10 










3 


3 


18 


9 


1 


41.2 


45.6 


43.7 


4 


9 


19 


8 


2 


47.4 


43.7 


49.5 


5 


9 


20 


12 


3 


41.8 


49.7 


44.9 


6 


8 


21 


10 


4 


53.9 


42.6 


53.7 


7 


3 


22 


12 


5 


42.5 


49.1 


50.1 


8 


5 


23 


9 


6 


47.8 


44.8 


47.1 


9 


8 


24 


12 


7 


43.5 


44.5 


48.3 


10 


12 


25 


10 


8 


46.9 


49.5 


51.4 


1 1 


15 


26 


12 


9 


52.0 


54.5 


48.6 


12 


8 


27 


12 


10 


51.9 


38.9 


47.7 


13 


10 


28 


10 


1 1 


39.4 


42.5 


49.8 


14 


8 


29 


1 1 


12 


48.9 


52.4 


60.9 


15 


10 


30 


16 


13 


59.0 


48.4 


56.2 










14 


39.6 


52.4 


41.6 


20.65 The number of defective products in each of 40 sam- 


15 


53.6 


43.4 


56.8 


ples taken from 


a production run are shown below. In each 


16 


49.8 


64.6 


57.8 


sample, 200 units were examined and classified as either 


17 


58.5 


49.4 


46.9 


good or defective, dependin 


g on whether dimensional toler- 


18 


54.5 


49.1 


61.1 


ances had been met. Use an 


appropriate 3 


sigma control 


19 


57.0 


54.0 


57.2 


chart in commenting on the state of control of the process. 


20 


45.2 


57.0 


50.2 



Sample 
Number 



Number of 
Defectives 



Sample 
Number 



Number of 
Defectives 



10 
1 1 
12 
13 
14 
15 
16 
17 



20 



12 
9 

15 
1 1 
15 
8 

14 
13 
10 
12 
7 
12 
12 
14 
8 
9 
7 
8 

10 
12 



21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
37 
38 
39 
40 



10 
10 
7 
8 
12 
9 
12 
16 
12 
1 1 
11 
9 
7 
12 



12 
9 
6 

1 1 



20.67 An adhesives manufacturer periodically selects a 
sample of its product and glues 6 toothpicks together, in 
pairs. A measurement is then made of the number of 
pounds required to separate the toothpicks. The follow- 
ing data represent 15 samples, each consisting of 3 pairs 
of toothpicks, and the number of pounds required to 
separate each pair. Given these measurements, use one or 
more appropriate 3-sigma control charts in evaluating 
whether the process is in control. 

Pounds to 
Separate Pair 



20.66 Following their manufacture, plastic shims are 
sampled at periodic intervals and their thickness (in 
millimeters) measured. Each sample consists of n = 3 
shims, and the following data have been collected for the 
past 20 samples. Given these measurements, use one or 
more appropriate 3-sigma control charts in evaluating 
whether the process is in control. 



Sample 
Number 


1 


2 


3 


1 


22 


28 


28 


2 


30 


32 


29 


3 


31 


30 


29 


4 


30 


32 


30 


5 


33 


28 


28 


6 


29 


29 


27 


7 


27 


34 


28 


8 


31 


28 


33 


9 


28 


27 


29 


10 


28 


25 


18 


1 1 


26 


30 


21 


12 


34 


13 


25 


13 


28 


27 


24 


14 


21 


30 


22 


15 


18 


28 


24 
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20.68 During the production of 15-ampere circuit break- 
ers, samples are taken at periodic intervals and tested for 
the exact amperage at which they break a circuit. Each 
sample consists of n = 4 circuit breakers. For the follow- 
ing data, representing the results from the past 20 
samples, construct one or more appropriate control 
charts and comment on the state of control of the 
process. 



Sample 
Number 



1 

2 
3 
4 
5 
6 
7 
8 
9 
10 
1 1 
12 
13 
14 
15 
16 
17 
18 
19 
20 



Measurement 
Number 



1 



14.8 
15.1 
15.0 
15.1 
15.2 
15.1 
15.6 
15.2 
14.9 
15.1 
15.1 
15.3 
15.3 
15.2 
15.2 
15.3 
16.2 
15.9 
14.7 
15.1 



15.1 
14.8 
15.0 
15.2 
15.2 
15.0 
14.7 
14.9 
14.9 
15.0 
14.7 
15.1 
14.6 
15.1 
14.1 
15.9 
15.9 
15.0 
16.6 
15.2 



15.2 
15.1 
15.0 
14.7 
15.2 
15.2 
14.8 
14.7 
15.4 
14.6 
14.7 
15.0 
14.5 
15.1 
14.9 
15.4 
15.5 
14.9 
15.8 
16.9 



14.7 
15.2 
15.2 
15.1 
14.7 
15.2 
15.2 
15.0 
14.9 
15.2 
14.8 
14.8 
14.9 
15.0 
15.4 
15.6 
15.1 
14.9 
14.7 
15.6 



20.69 Telemarketers for a mail-order firm are assigned to 
make a total of 200 calls each day, then record the num- 
ber of potential customers who are interested in receiving 
a catalog and first-order discount coupon in the mail. For 
each of the past 50 days, the number of catalog/coupon 



packages sent is as shown here. Construct one or more 
appropriate control charts and comment on the state of 
control of the process. 



Days 








Successes 








1-15 


21 


16 


23 


16 


22 


19 


23 


25 




15 


22 


17 


21 


21 


23 


15 




16-30 


18 


20 


14 


24 


18 


18 


22 


27 




15 


13 


29 


18 


21 


21 


25 




31-45 


20 


24 


15 


20 


21 


1 1 


19 


17 




22 


16 


17 


28 


23 


22 


16 




46-50 


22 


18 


19 


15 


21 









20.70 A cabinet manufacturer randomly selects cabinets 
from the production line and examines them for defects. 
For each of the 30 cabinets sampled during today's pro- 
duction, the number of defects is shown below. Given 
these data, construct one or more appropriate control 
charts and comment on the state of control of the 
process. 



Cabinet 
Number 



Number 
of Defects 



Cabinet 
Number 



Number 
of Defects 



2 
3 
4 
5 
6 
7 
8 
9 

10 
1 1 
12 
13 
14 
15 



16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 



integrated case 



Thorndike Sports Equipment 




Like any other large firm, Thorndike Sports Equipment re- 
ceives daily telephone calls in which customers either 
praise or complain about a Thorndike product. Luke 
Thorndike is especially sensitive about the number of com- 
plaint calls received daily. Twelve weeks ago, he asked the 
customer service department to keep a daily log of the 
number of telephoned complaints received each day. Just 



six weeks ago, Thorndike ads began carrying a toll-free 
number through which customers could inquire or com- 
ment about Thorndike products. 

The customer service department has just handed Luke 
a summary of the daily complaints over the past 60 business 
days. Luke has just handed it to Ted, who finds the daily to- 
tals shown below. The data are also in file THORN20. 
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1. Assuming the daily number of calls to be Poisson dis- 
tributed, construct a c-chart for the first 30 days. 
Using 3-sigma limits, were there any days in which the 
volume of complaints fell outside the limits? 

2. If any unusually high or low days were identified in 
part 1, construct a new pair of 3-sigma limits based on 
the days that remain from the initial 30-day period. 



3. Using the upper and lower limits identified in part 2, 
extend the c-chart to include days 31-60. Based on the 
3-sigma limits in this chart, does it appear that the toll- 
free number has caused a shift in the daily number of 
complaints received? 



Number of 
Day Complaints 



1 


2 


2 


4 


3 


1 


4 


7 


5 


1 


6 


3 


7 


0 


8 


2 


9 


3 


10 


2 


11 


2 


12 


3 


13 


0 


14 


3 


15 


9 



Number of 
Day Complaints 



16 


3 


17 


1 


18 


1 


19 


0 


20 


2 


21 


1 


22 


5 


23 


7 


24 


6 


25 


1 


26 


0 


27 


3 


28 


3 


29 


3 


30 


2 



Number of 
Day Complaints 



31 


4 


32 


6 


33 


4 


34 


3 


35 


5 


36 


5 


37 


3 


38 


5 


39 


2 


40 


4 


41 


5 


42 


8 


43 


3 


44 


5 


45 


2 



Number of 
Day Complaints 



46 


2 


47 


2 


48 


6 


49 


7 


50 


2 


51 


5 


52 


9 


53 


3 


54 


4 


55 


2 


56 


9 


57 


3 


58 


3 


59 


10 


60 


7 



Willard Bolt Company 

At regular intervals during the last three work shifts, tech- 
nicians at the Willard Bolt Company have taken a total of 
40 samples, each with n = 5 bolts. The measurement 
taken is the bolt diameter, in inches, and the data are listed 
in file CDB20. 

1. Using only the first 20 samples, construct 3-sigma con- 
trol charts for the mean and range. For each chart, plot 
all 20 sample means or ranges. Using the mean and 
range charts together, does the process appear to have 
been in control when each of these samples was taken? 

a. If the process was in control for samples 1-20, 
use the upper and lower control limits from these 
samples in a second set of control charts that in- 
cludes all 40 samples. On this basis, was the pro- 
cess in control during the collection of samples 
21-40? Use the mean chart and the range chart 
together in evaluating process control. 

b. If the process was not in control for each of sam- 
ples 1-20, assume that an assignable cause has 
been identified for those that were outside the 
limits. Use the remaining samples as the basis for 
UCL and LCL values with which to evaluate 
samples 21-40. 



Note: The preceding possibilities would be likely to occur 
in practice, since data that have already been gathered 
through sample 20 might be employed in monitoring the 
process through samples 21-40. In part 2, all of the sam- 
ples will be considered at once. 

2. Considering all 40 samples, construct 3-sigma con- 
trol charts for the mean and range. For each chart, 
plot all 40 sample means or ranges. Using the mean 
and range charts together, does the process appear to 
have been in control when each of these samples was 
taken? 

In general, compared to the charts constructed in 
part 1, do the sample means and ranges for samples 
21-40 fit more easily or less easily into the control 
limits? Would you expect that they should fit more 
easily or less easily into the control limits? Why? 

3. If your computer statistical package provides addi- 
tional tests for determining whether a process is in 
control, carry these out for the entire series of 40 sam- 
ple means and sample ranges, then interpret the re- 
sults. Did the computer identify any "out of control" 
clues that were not caught in part 2? 



Mean Control Chart 



This applet has two different mean charts: 

1. The first mean chart represents a "well-behaved" pro- 
cess that tends to be under control, with samples of 
n = 5 taken from a production process known to have 
/a = 2.135 inches and a = 0.002 inches. Each time we 
press the "New Sample" button, a new sample is se- 
lected and its mean is displayed on the chart. We can 



select up to 100 samples in a row before it's necessary 
to reload the applet and try again. 
2. The second mean chart is similar to the first, but is not 
so well behaved: the process has a tendency to go out 
of control in various ways somewhere between the 1 5 th 
sample and the 35 th sample. 



Examples of the two applets are shown here. 
1 . The "well-behaved" process that tends to be in control: 



2.14 



Maximum Number Samples (1 00) Reached 



2.1375 - 



2.1 



::s 

























































I 



New Sample 



2. The process that has a tendency to go out of control: 



2.1425 



Maximum Number Samples (1 00) Reached 




20.0 30.0 



40.0 50.0 60.0 
New Sample 



80.0 90.0 



(continued) 
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Applet Exercises 

21.1 Using the first applet, press the "New Sample" button 
10 times. You have now generated 10 samples from the 
process and the means are shown on the chart. Does the 
process appear to be in control? 

21.2 Continuing with the first applet, press the "New Sam- 
ple" button 90 more times to reach the limit of 100 con- 
secutive samples. Does it appear that the process was in 
control during the 100 samples? 

21.3 Using the second applet, press the "New Sample" 
button 35 times. You have now generated 35 samples from 
the process and the means are shown on the chart. Does 
the process appear to be in control? If not, describe the 



condition(s) in the chart that led to your "not in control" 
conclusion. 

21.4 Continuing with the second applet, press the "New 
Sample" button 65 more times to reach the limit of 100 
consecutive samples. Describe the condition of the process 
during these last 65 samples. If the process was out of con- 
trol earlier, did it seem to "settle down" and regain control 
during these later samples, or did it continue to have 
control problems? 

21.5 Reload the two mean charts and repeat Applet Exer- 
cises 1 through 4. In what way(s) were the results similar 
and in what way(s) were they different the second time 
around? 




n = 2 



tt: 0.1 



0.2 



0.3 



0.4 



0.5 



0.6 



0.7 



0.8 



k:Q 0.8100 

1 0.1800 

2 0.0100 



0.6400 
0.3200 
0.0400 



0.4900 
0.4200 
0.0900 



0.3600 
0.4800 
0.1600 



0.2500 
0.5000 
0.2500 



0.1600 
0.4800 
0.3600 



0.0900 
0.4200 
0.4900 



0.9 



0.0400 0.0100 
0.3200 0.1800 
0.6400 0.8100 



TABLE A.1 



Binomial Distribution, 
Individual Probabilities for 
x = number of successes in 
n trials, prob(x = k) 



n = 3 



tt: 0.1 

k: 0 0.7290 

1 0.2430 

2 0.0270 

3 0.0010 



0.2 

0.5120 
0.3840 
0.0960 
0.0080 



0.3 

0.3430 
0.4410 
0.1890 
0.0270 



0.4 

0.2160 
0.4320 
0.2880 
0.0640 



0.5 

0.1250 
0.3750 
0.3750 
0.1250 



0.6 

0.0640 
0.2880 
0.4320 
0.2160 



0.7 

0.0270 
0.1890 
0.4410 
0.3430 



0.8 



0.9 



0.0080 0.0010 

0.0960 0.0270 

0.3840 0.2430 

0.5120 0.7290 



tt: 0.1 



0.2 



0.3 



n = 4 



0.4 



0.5 



0.6 



0.7 



0.8 



0.9 



k:Q 0.6561 

1 0.2916 

2 0.0486 

3 0.0036 

4 0.0001 



0.4096 
0.4096 
0.1536 
0.0256 
0.0016 



0.2401 
0.4116 
0.2646 
0.0756 
0.0081 



0.1296 
0.3456 
0.3456 
0.1536 
0.0256 



0.0625 
0.2500 
0.3750 
0.2500 
0.0625 



0.0256 
0.1536 
0.3456 
0.3456 
0.1296 



0.0081 
0.0756 
0.2646 
0.4116 
0.2401 



0.0016 0.0001 

0.0256 0.0036 

0.1536 0.0486 

0.4096 0.2916 

0.4096 0.6561 



tt: 0.1 



0.2 



0.3 



n = 5 



0.4 



0.5 



0.6 



0.7 



0.8 



0.9 



k:0 


0.5905 


0.3277 


0.1681 


0.0778 


0.0313 


0.0102 


0.0024 


0.0003 


0.0000 


1 


0.3281 


0.4096 


0.3601 


0.2592 


0.1562 


0.0768 


0.0284 


0.0064 


0.0005 


2 


0.0729 


0.2048 


0.3087 


0.3456 


0.3125 


0.2304 


0.1323 


0.0512 


0.0081 


3 


0.0081 


0.0512 


0.1323 


0.2304 


0.3125 


0.3456 


0.3087 


0.2048 


0.0729 


4 


0.0005 


0.0064 


0.0284 


0.0768 


0.1562 


0.2592 


0.3601 


0.4096 


0.3281 


5 


0.0000 


0.0003 


0.0024 


0.0102 


0.0313 


0.0778 


0.1681 


0.3277 


0.5905 



Source: Probabilities generated by Minitab, then compiled as shown here. 
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TABLE A.1 



(continued) 




k: 0 
1 
2 
3 
4 
5 
6 



0.5314 
0.3543 
0.0984 
0.0146 
0.0012 
0.0001 
0.0000 



tt: 0.1 



0.2621 
0.3932 
0.2458 
0.0819 
0.0154 
0.0015 
0.0001 



0.2 



0.1176 
0.3025 
0.3241 
0.1852 
0.0595 
0.0102 
0.0007 



0.3 



0.0467 
0.1866 
0.3110 
0.2765 
0.1382 
0.0369 
0.0041 



0.0156 
0.0937 
0.2344 
0.3125 
0.2344 
0.0937 
0.0156 



n = 7 



0.4 



0.5 



0.0041 
0.0369 
0.1382 
0.2765 
0.31 10 
0.1866 
0.0467 



0.6 



0.0007 
0.0102 
0.0595 
0.1852 
0.3241 
0.3025 
0.1 176 



0.7 



0.0001 
0.0015 
0.0154 
0.0819 
0.2458 
0.3932 
0.2621 



0.8 



0.0000 
0.0001 
0.0012 
0.0146 
0.0984 
0.3543 
0.5314 



0.9 



k: 0 


0.4783 


0.2097 


0.0824 


0.0280 


0.0078 


0.0016 


0.0002 


0.0000 




1 


0.3720 


0.3670 


0.2471 


0.1306 


0.0547 


0.0172 


0.0036 


0.0004 


0.0000 


2 


0.1240 


0.2753 


0.3177 


0.2613 


0.1641 


0.0774 


0.0250 


0.0043 


0.0002 


3 


0.0230 


0.1147 


0.2269 


0.2903 


0.2734 


0.1935 


0.0972 


0.0287 


0.0026 


4 


0.0026 


0.0287 


0.0972 


0.1935 


0.2734 


0.2903 


0.2269 


0.1147 


0.0230 


5 


0.0002 


0.0043 


0.0250 


0.0774 


0.1641 


0.2613 


0.3177 


0.2753 


0.1240 


6 


0.0000 


0.0004 


0.0036 


0.0172 


0.0547 


0.1306 


0.2471 


0.3670 


0.3720 


7 




0.0000 


0.0002 


0.0016 


0.0078 


0.0280 


0.0824 


0.2097 


0.4783 



tt: 0.1 



0.2 



0.3 



n = 8 



0.4 



0.5 



0.6 



0.7 



0.8 



0.9 



k:0 


0.4305 


0.1678 


0.0576 


0.0168 


0.0039 


0.0007 


0.0001 


0.0000 




1 


0.3826 


0.3355 


0.1977 


0.0896 


0.0313 


0.0079 


0.0012 


0.0001 




2 


0.1488 


0.2936 


0.2965 


0.2090 


0.1094 


0.0413 


0.0100 


0.001 1 


0.0000 


3 


0.0331 


0.1468 


0.2541 


0.2787 


0.2187 


0.1239 


0.0467 


0.0092 


0.0004 


4 


0.0046 


0.0459 


0.1361 


0.2322 


0.2734 


0.2322 


0.1361 


0.0459 


0.0046 


5 


0.0004 


0.0092 


0.0467 


0.1239 


0.2187 


0.2787 


0.2541 


0.1468 


0.0331 


6 


0.0000 


0.001 1 


0.0100 


0.0413 


0.1094 


0.2090 


0.2965 


0.2936 


0.1488 


7 




0.0001 


0.0012 


0.0079 


0.0313 


0.0896 


0.1977 


0.3355 


0.3826 


8 




0.0000 


0.0001 


0.0007 


0.0039 


0.0168 


0.0576 


0.1678 


0.4305 



tt: 0.1 



0.2 



0.3 



n = 9 



0.4 



0.5 



0.6 



0.7 



0.8 



0.9 



k:0 


0.3874 


0.1342 


0.0404 


0.0101 


0.0020 


0.0003 


0.0000 






1 


0.3874 


0.3020 


0.1556 


0.0605 


0.0176 


0.0035 


0.0004 


0.0000 




2 


0.1722 


0.3020 


0.2668 


0.1612 


0.0703 


0.0212 


0.0039 


0.0003 


0.0000 


3 


0.0446 


0.1762 


0.2668 


0.2508 


0.1641 


0.0743 


0.0210 


0.0028 


0.0001 


4 


0.0074 


0.0661 


0.1715 


0.2508 


0.2461 


0.1672 


0.0735 


0.0165 


0.0008 


5 


0.0008 


0.0165 


0.0735 


0.1672 


0.2461 


0.2508 


0.1715 


0.0661 


0.0074 


6 


0.0001 


0.0028 


0.0210 


0.0743 


0.1641 


0.2508 


0.2668 


0.1762 


0.0446 


7 


0.0000 


0.0003 


0.0039 


0.0212 


0.0703 


0.1612 


0.2668 


0.3020 


0.1722 


8 




0.0000 


0.0004 


0.0035 


0.0176 


0.0605 


0.1556 


0.3020 


0.3874 


9 






0.0000 


0.0003 


0.0020 


0.0101 


0.0404 


0.1342 


0.3874 
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n = 10 



tt: 0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


k: 0 


0.3487 


0.1074 


0.0282 


0.0060 


0.0010 


0.0001 


0.0000 






1 


0.3874 


0.2684 


0.121 1 


0.0403 


0.0098 


0.0016 


0.0001 


0.0000 




2 


0.1937 


0.3020 


0.2335 


0.1209 


0.0439 


0.0106 


0.0014 


0.0001 




3 


0.0574 


0.2013 


0.2668 


0.2150 


0.1172 


0.0425 


0.0090 


0.0008 


0.0000 


4 


0.0112 


0.0881 


0.2001 


0.2508 


0.2051 


0.1115 


0.0368 


0.0055 


0.0001 


5 


0.0015 


0.0264 


0.1029 


0.2007 


0.2461 


0.2007 


0.1029 


0.0264 


0.0015 


6 


0.0001 


0.0055 


0.0368 


0.1115 


0.2051 


0.2508 


0.2001 


0.0881 


0.0112 


7 


0.0000 


0.0008 


0.0090 


0.0425 


0.1172 


0.2150 


0.2668 


0.2013 


0.0574 


8 




0.0001 


0.0014 


0.0106 


0.0439 


0.1209 


0.2335 


0.3020 


0.1937 


9 




0.0000 


0.0001 


0.0016 


0.0098 


0.0403 


0.1211 


0.2684 


0.3874 


10 






0.0000 


0.0001 


0.0010 


0.0060 


0.0282 


0.1074 


0.3487 



TABLE A.1 



(continued) 



n = 11 



tt: 0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


k:0 


0.3138 


0.0859 


0.0198 


0.0036 


0.0005 


0.0000 








1 


0.3835 


0.2362 


0.0932 


0.0266 


0.0054 


0.0007 


0.0000 






2 


0.2131 


0.2953 


0.1998 


0.0887 


0.0269 


0.0052 


0.0005 


0.0000 




3 


0.0710 


0.2215 


0.2568 


0.1774 


0.0806 


0.0234 


0.0037 


0.0002 




4 


0.0158 


0.1107 


0.2201 


0.2365 


0.1611 


0.0701 


0.0173 


0.0017 


0.0000 


5 


0.0025 


0.0388 


0.1321 


0.2207 


0.2256 


0.1471 


0.0566 


0.0097 


0.0003 


6 


0.0003 


0.0097 


0.0566 


0.1471 


0.2256 


0.2207 


0.1321 


0.0388 


0.0025 


7 


0.0000 


0.0017 


0.0173 


0.0701 


0.1611 


0.2365 


0.2201 


0.1107 


0.0158 


8 




0.0002 


0.0037 


0.0234 


0.0806 


0.1774 


0.2568 


0.2215 


0.0710 


9 




0.0000 


0.0005 


0.0052 


0.0269 


0.0887 


0.1998 


0.2953 


0.2131 


10 






0.0000 


0.0007 


0.0054 


0.0266 


0.0932 


0.2362 


0.3835 


11 








0.0000 


0.0005 


0.0036 


0.0198 


0.0859 


0.3138 



n = 12 



tt: 0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


k:0 


0.2824 


0.0687 


0.0138 


0.0022 


0.0002 


0.0000 








1 


0.3766 


0.2062 


0.0712 


0.0174 


0.0029 


0.0003 


0.0000 






2 


0.2301 


0.2835 


0.1678 


0.0639 


0.0161 


0.0025 


0.0002 


0.0000 




3 


0.0852 


0.2362 


0.2397 


0.1419 


0.0537 


0.0125 


0.0015 


0.0001 




4 


0.0213 


0.1329 


0.231 1 


0.2128 


0.1208 


0.0420 


0.0078 


0.0005 




5 


0.0038 


0.0532 


0.1585 


0.2270 


0.1934 


0.1009 


0.0291 


0.0033 


0.0000 


6 


0.0005 


0.0155 


0.0792 


0.1766 


0.2256 


0.1766 


0.0792 


0.0155 


0.0005 


7 


0.0000 


0.0033 


0.0291 


0.1009 


0.1934 


0.2270 


0.1585 


0.0532 


0.0038 


8 




0.0005 


0.0078 


0.0420 


0.1208 


0.2128 


0.2311 


0.1329 


0.0213 


9 




0.0001 


0.0015 


0.0125 


0.0537 


0.1419 


0.2397 


0.2362 


0.0852 


10 




0.0000 


0.0002 


0.0025 


0.0161 


0.0639 


0.1678 


0.2835 


0.2301 


11 






0.0000 


0.0003 


0.0029 


0.0174 


0.0712 


0.2062 


0.3766 


12 








0.0000 


0.0002 


0.0022 


0.0138 


0.0687 


0.2824 
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TABLE A.1 



tt: 0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


t:0 


0.2542 


0.0550 


0.0097 


0.0013 


0.0001 


0.0000 








1 


0.3672 


0.1787 


0.0540 


0.0113 


0.0016 


0.0001 


0.0000 






2 


0.2448 


0.2680 


0.1388 


0.0453 


0.0095 


0.0012 


0.0001 






3 


0.0997 


0.2457 


0.2181 


0.1 107 


0.0349 


0.0065 


0.0006 


0.0000 




4 


0.0277 


0.1535 


0.2337 


0.1845 


0.0873 


0.0243 


0.0034 


0.0001 




5 


0.0055 


0.0691 


0.1803 


0.2214 


0.1571 


0.0656 


0.0142 


0.001 1 


0.0000 


6 


0.0008 


0.0230 


0.1030 


0.1968 


0.2095 


0.1312 


0.0442 


0.0058 


0.0001 


7 


0.0001 


0.0058 


0.0442 


0.1312 


0.2095 


0.1968 


0.1030 


0.0230 


0.0008 


8 


0.0000 


0.001 1 


0.0142 


0.0656 


0.1571 


0.2214 


0.1803 


0.0691 


0.0055 


9 




0.0001 


0.0034 


0.0243 


0.0873 


0.1845 


0.2337 


0.1535 


0.0277 


10 




0.0000 


0.0006 


0.0065 


0.0349 


0.1107 


0.2181 


0.2457 


0.0997 


11 






0.0001 


0.0012 


0.0095 


0.0453 


0.1388 


0.2680 


0.2448 


12 






0.0000 


0.0001 


0.0016 


0.0113 


0.0540 


0.1787 


0.3672 


13 








0.0000 


0.0001 


0.0013 


0.0097 


0.0550 


0.2542 



n = 14 



tt: 0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


k:0 


0.2288 


0.0440 


0.0068 


0.0008 


0.0001 


0.0000 








1 


0.3559 


0.1539 


0.0407 


0.0073 


0.0009 


0.0001 








2 


0.2570 


0.2501 


0.1134 


0.0317 


0.0056 


0.0005 


0.0000 






3 


0.1142 


0.2501 


0.1943 


0.0845 


0.0222 


0.0033 


0.0002 






4 


0.0349 


0.1720 


0.2290 


0.1549 


0.0611 


0.0136 


0.0014 


0.0000 




5 


0.0078 


0.0860 


0.1963 


0.2066 


0.1222 


0.0408 


0.0066 


0.0003 




6 


0.0013 


0.0322 


0.1262 


0.2066 


0.1833 


0.0918 


0.0232 


0.0020 


0.0000 


7 


0.0002 


0.0092 


0.0618 


0.1574 


0.2095 


0.1574 


0.0618 


0.0092 


0.0002 


8 


0.0000 


0.0020 


0.0232 


0.0918 


0.1833 


0.2066 


0.1262 


0.0322 


0.0013 


9 




0.0003 


0.0066 


0.0408 


0.1222 


0.2066 


0.1963 


0.0860 


0.0078 


10 




0.0000 


0.0014 


0.0136 


0.0611 


0.1549 


0.2290 


0.1720 


0.0349 


11 






0.0002 


0.0033 


0.0222 


0.0845 


0.1943 


0.2501 


0.1 142 


12 






0.0000 


0.0005 


0.0056 


0.0317 


0.1 134 


0.2501 


0.2570 


13 








0.0001 


0.0009 


0.0073 


0.0407 


0.1539 


0.3559 


14 








0.0000 


0.0001 


0.0008 


0.0068 


0.0440 


0.2288 



n = 15 



tt: 0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


k:0 


0.2059 


0.0352 


0.0047 


0.0005 


0.0000 










1 


0.3432 


0.1319 


0.0305 


0.0047 


0.0005 


0.0000 








2 


0.2669 


0.2309 


0.0916 


0.0219 


0.0032 


0.0003 


0.0000 






3 


0.1285 


0.2501 


0.1700 


0.0634 


0.0139 


0.0016 


0.0001 






4 


0.0428 


0.1876 


0.2186 


0.1268 


0.0417 


0.0074 


0.0006 


0.0000 




5 


0.0105 


0.1032 


0.2061 


0.1859 


0.0916 


0.0245 


0.0030 


0.0001 




6 


0.0019 


0.0430 


0.1472 


0.2066 


0.1527 


0.0612 


0.0116 


0.0007 




7 


0.0003 


0.0138 


0.081 1 


0.1771 


0.1964 


0.1181 


0.0348 


0.0035 


0.0000 


8 


0.0000 


0.0035 


0.0348 


0.1 181 


0.1964 


0.1771 


0.0811 


0.0138 


0.0003 


9 




0.0007 


0.0116 


0.0612 


0.1527 


0.2066 


0.1472 


0.0430 


0.0019 


10 




0.0001 


0.0030 


0.0245 


0.0916 


0.1859 


0.2061 


0.1032 


0.0105 


11 




0.0000 


0.0006 


0.0074 


0.0417 


0.1268 


0.2186 


0.1876 


0.0428 


12 






0.0001 


0.0016 


0.0139 


0.0634 


0.1700 


0.2501 


0.1285 


13 






0.0000 


0.0003 


0.0032 


0.0219 


0.0916 


0.2309 


0.2669 


14 








0.0000 


0.0005 


0.0047 


0.0305 


0.1319 


0.3432 


15 










0.0000 


0.0005 


0.0047 


0.0352 


0.2059 
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n = 16 



tt: 0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


k: 0 


0.1853 


0.0281 


0.0033 


0.0003 


0.0000 










1 


0.3294 


0.1126 


0.0228 


0.0030 


0.0002 


0.0000 








2 


0.2745 


0.2111 


0.0732 


0.0150 


0.0018 


0.0001 








3 


0.1423 


0.2463 


0.1465 


0.0468 


0.0085 


0.0008 


0.0000 






4 


0.0514 


0.2001 


0.2040 


0.1014 


0.0278 


0.0040 


0.0002 






5 


0.0137 


0.1201 


0.2099 


0.1623 


0.0667 


0.0142 


0.0013 


0.0000 




6 


0.0028 


0.0550 


0.1649 


0.1983 


0.1222 


0.0392 


0.0056 


0.0002 




7 


0.0004 


0.0197 


0.1010 


0.1889 


0.1746 


0.0840 


0.0185 


0.0012 


0.0000 


8 


0.0001 


0.0055 


0.0487 


0.1417 


0.1964 


0.1417 


0.0487 


0.0055 


0.0001 


9 


0.0000 


0.0012 


0.0185 


0.0840 


0.1746 


0.1889 


0.1010 


0.0197 


0.0004 


10 




0.0002 


0.0056 


0.0392 


0.1222 


0.1983 


0.1649 


0.0550 


0.0028 


11 




0.0000 


0.0013 


0.0142 


0.0667 


0.1623 


0.2099 


0.1201 


0.0137 


12 






0.0002 


0.0040 


0.0278 


0.1014 


0.2040 


0.2001 


0.0514 


13 






0.0000 


0.0008 


0.0085 


0.0468 


0.1465 


0.2463 


0.1423 


14 








0.0001 


0.0018 


0.0150 


0.0732 


0.2111 


0.2745 


15 








0.0000 


0.0002 


0.0030 


0.0228 


0.1126 


0.3294 


16 










0.0000 


0.0003 


0.0033 


0.0281 


0.1853 



TABLE A.1 



(continued) 



n = 17 



tt: 0.1 


0.2 


0.3 




0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


k:0 


0.1668 


0.0225 


0.0023 


0 


0002 


0.0000 










1 


0.3150 


0.0957 


0.0169 


0 


0019 


0.0001 


0.0000 








2 


0.2800 


0.1914 


0.0581 


0 


0102 


0.0010 


0.0001 








3 


0.1556 


0.2393 


0.1245 


0 


0341 


0.0052 


0.0004 


0.0000 






4 


0.0605 


0.2093 


0.1868 


0 


0796 


0.0182 


0.0021 


0.0001 






5 


0.0175 


0.1361 


0.2081 


0 


1379 


0.0472 


0.0081 


0.0006 


0.0000 




6 


0.0039 


0.0680 


0.1784 


0 


1839 


0.0944 


0.0242 


0.0026 


0.0001 




7 


0.0007 


0.0267 


0.1201 


0 


1927 


0.1484 


0.0571 


0.0095 


0.0004 




8 


0.0001 


0.0084 


0.0644 


0 


1606 


0.1855 


0.1070 


0.0276 


0.0021 


0.0000 


9 


0.0000 


0.0021 


0.0276 


0 


1070 


0.1855 


0.1606 


0.0644 


0.0084 


0.0001 


10 




0.0004 


0.0095 


0 


0571 


0.1484 


0.1927 


0.1201 


0.0267 


0.0007 


11 




0.0001 


0.0026 


0 


0242 


0.0944 


0.1839 


0.1784 


0.0680 


0.0039 


12 




0.0000 


0.0006 


0 


0081 


0.0472 


0.1379 


0.2081 


0.1361 


0.0175 


13 






0.0001 


0 


0021 


0.0182 


0.0796 


0.1868 


0.2093 


0.0605 


14 






0.0000 


0 


0004 


0.0052 


0.0341 


0.1245 


0.2393 


0.1556 


15 








0 


0001 


0.0010 


0.0102 


0.0581 


0.1914 


0.2800 


16 








0 


0000 


0.0001 


0.0019 


0.0169 


0.0957 


0.3150 


17 












0.0000 


0.0002 


0.0023 


0.0225 


0.1668 
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(continued) 



tt: 0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


k: 0 


0.1501 


0.0180 


0.0016 


0.0001 


0.0000 










1 


0.3002 


0.081 1 


0.0126 


0.0012 


0.0001 










2 


0.2835 


0.1723 


0.0458 


0.0069 


0.0006 


0.0000 








3 


0.1680 


0.2297 


0.1046 


0.0246 


0.0031 


0.0002 








4 


0.0700 


0.2153 


0.1681 


0.0614 


0.01 17 


0.0011 


0.0000 






5 


0.0218 


0.1507 


0.2017 


0.1 146 


0.0327 


0.0045 


0.0002 






6 


0.0052 


0.0816 


0.1873 


0.1655 


0.0708 


0.0145 


0.0012 


0.0000 




7 


0.0010 


0.0350 


0.1376 


0.1892 


0.1214 


0.0374 


0.0046 


0.0001 




8 


0.0002 


0.0120 


0.081 1 


0.1734 


0.1669 


0.0771 


0.0149 


0.0008 




9 


0.0000 


0.0033 


0.0386 


0.1284 


0.1855 


0.1284 


0.0386 


0.0033 


0.0000 


10 




0.0008 


0.0149 


0.0771 


0.1669 


0.1734 


0.0811 


0.0120 


0.0002 


11 




0.0001 


0.0046 


0.0374 


0.1214 


0.1892 


0.1376 


0.0350 


0.0010 


12 




0.0000 


0.0012 


0.0145 


0.0708 


0.1655 


0.1873 


0.0816 


0.0052 


13 






0.0002 


0.0045 


0.0327 


0.1146 


0.2017 


0.1507 


0.0218 


14 






0.0000 


0.0011 


0.01 17 


0.0614 


0.1681 


0.2153 


0.0700 


15 








0.0002 


0.0031 


0.0246 


0.1046 


0.2297 


0.1680 


16 








0.0000 


0.0006 


0.0069 


0.0458 


0.1723 


0.2835 


17 










0.0001 


0.0012 


0.0126 


0.081 1 


0.3002 


18 










0.0000 


0.0001 


0.0016 


0.0180 


0.1501 



n = 19 



tt: 0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


k:Q 


0.1351 


0.0144 


0.001 1 


0.0001 












1 


0.2852 


0.0685 


0.0093 


0.0008 


0.0000 










2 


0.2852 


0.1540 


0.0358 


0.0046 


0.0003 


0.0000 








3 


0.1796 


0.2182 


0.0869 


0.0175 


0.0018 


0.0001 








4 


0.0798 


0.2182 


0.1491 


0.0467 


0.0074 


0.0005 


0.0000 






5 


0.0266 


0.1636 


0.1916 


0.0933 


0.0222 


0.0024 


0.0001 






6 


0.0069 


0.0955 


0.1916 


0.1451 


0.0518 


0.0085 


0.0005 






7 


0.0014 


0.0443 


0.1525 


0.1797 


0.0961 


0.0237 


0.0022 


0.0000 




8 


0.0002 


0.0166 


0.0981 


0.1797 


0.1442 


0.0532 


0.0077 


0.0003 




9 


0.0000 


0.0051 


0.0514 


0.1464 


0.1762 


0.0976 


0.0220 


0.0013 




10 




0.0013 


0.0220 


0.0976 


0.1762 


0.1464 


0.0514 


0.0051 


0.0000 


11 




0.0003 


0.0077 


0.0532 


0.1442 


0.1797 


0.0981 


0.0166 


0.0002 


12 




0.0000 


0.0022 


0.0237 


0.0961 


0.1797 


0.1525 


0.0443 


0.0014 


13 






0.0005 


0.0085 


0.0518 


0.1451 


0.1916 


0.0955 


0.0069 


14 






0.0001 


0.0024 


0.0222 


0.0933 


0.1916 


0.1636 


0.0266 


15 






0.0000 


0.0005 


0.0074 


0.0467 


0.1491 


0.2182 


0.0798 


16 








0.0001 


0.0018 


0.0175 


0.0869 


0.2182 


0.1796 


17 








0.0000 


0.0003 


0.0046 


0.0358 


0.1540 


0.2852 


18 










0.0000 


0.0008 


0.0093 


0.0685 


0.2852 


19 












0.0001 


0.0011 


0.0144 


0.1351 
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tt: 0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


k: 0 


0.1216 


0.0115 


0.0008 


0.0000 












1 


0.2702 


0.0576 


0.0068 


0.0005 


0.0000 










2 


0.2852 


0.1369 


0.0278 


0.0031 


0.0002 










3 


0.1901 


0.2054 


0.0716 


0.0123 


0.0011 


0.0000 








4 


0.0898 


0.2182 


0.1304 


0.0350 


0.0046 


0.0003 








5 


0.0319 


0.1746 


0.1789 


0.0746 


0.0148 


0.0013 


0.0000 






6 


0.0089 


0.1091 


0.1916 


0.1244 


0.0370 


0.0049 


0.0002 






7 


0.0020 


0.0545 


0.1643 


0.1659 


0.0739 


0.0146 


0.0010 


0.0000 




8 


0.0004 


0.0222 


0.1144 


0.1797 


0.1201 


0.0355 


0.0039 


0.0001 




9 


0.0001 


0.0074 


0.0654 


0.1597 


0.1602 


0.0710 


0.0120 


0.0005 




10 


0.0000 


0.0020 


0.0308 


0.1 171 


0.1762 


0.1171 


0.0308 


0.0020 


0.0000 


11 




0.0005 


0.0120 


0.0710 


0.1602 


0.1597 


0.0654 


0.0074 


0.0001 


12 




0.0001 


0.0039 


0.0355 


0.1201 


0.1797 


0.1 144 


0.0222 


0.0004 


13 




0.0000 


0.0010 


0.0146 


0.0739 


0.1659 


0.1643 


0.0545 


0.0020 


14 






0.0002 


0.0049 


0.0370 


0.1244 


0.1916 


0.1091 


0.0089 


15 






0.0000 


0.0013 


0.0148 


0.0746 


0.1789 


0.1746 


0.0319 


16 








0.0003 


0.0046 


0.0350 


0.1304 


0.2182 


0.0898 


17 








0.0000 


0.0011 


0.0123 


0.0716 


0.2054 


0.1901 


18 










0.0002 


0.0031 


0.0278 


0.1369 


0.2852 


19 










0.0000 


0.0005 


0.0068 


0.0576 


0.2702 


20 












0.0000 


0.0008 


0.01 15 


0.1216 



TABLE A.1 



(continued) 



n = 25 



tt: 0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


k:Q 


0.0718 


0.0038 


0.0001 














1 


0.1994 


0.0236 


0.0014 


0.0000 












2 


0.2659 


0.0708 


0.0074 


0.0004 


0.0000 










3 


0.2265 


0.1358 


0.0243 


0.0019 


0.0001 










4 


0.1384 


0.1867 


0.0572 


0.0071 


0.0004 










5 


0.0646 


0.1960 


0.1030 


0.0199 


0.0016 


0.0000 








6 


0.0239 


0.1633 


0.1472 


0.0442 


0.0053 


0.0002 








7 


0.0072 


0.1108 


0.1712 


0.0800 


0.0143 


0.0009 


0.0000 






8 


0.0018 


0.0623 


0.1651 


0.1200 


0.0322 


0.0031 


0.0001 






9 


0.0004 


0.0294 


0.1336 


0.1511 


0.0609 


0.0088 


0.0004 






10 


0.0001 


0.0118 


0.0916 


0.1612 


0.0974 


0.0212 


0.0013 


0.0000 




11 


0.0000 


0.0040 


0.0536 


0.1465 


0.1328 


0.0434 


0.0042 


0.0001 




12 




0.0012 


0.0268 


0.1 140 


0.1550 


0.0760 


0.0115 


0.0003 




13 




0.0003 


0.0115 


0.0760 


0.1550 


0.1140 


0.0268 


0.0012 




14 




0.0001 


0.0042 


0.0434 


0.1328 


0.1465 


0.0536 


0.0040 


0.0000 


15 




0.0000 


0.0013 


0.0212 


0.0974 


0.1612 


0.0916 


0.0118 


0.0001 


16 






0.0004 


0.0088 


0.0609 


0.1511 


0.1336 


0.0294 


0.0004 


17 






0.0001 


0.0031 


0.0322 


0.1200 


0.1651 


0.0623 


0.0018 


18 






0.0000 


0.0009 


0.0143 


0.0800 


0.1712 


0.1108 


0.0072 


19 








0.0002 


0.0053 


0.0442 


0.1472 


0.1633 


0.0239 


20 








0.0000 


0.0016 


0.0199 


0.1030 


0.1960 


0.0646 


21 










0.0004 


0.0071 


0.0572 


0.1867 


0.1384 


22 










0.0001 


0.0019 


0.0243 


0.1358 


0.2265 


23 










0.0000 


0.0004 


0.0074 


0.0708 


0.2659 


24 












0.0000 


0.0014 


0.0236 


0.1994 


25 














0.0001 


0.0038 


0.0718 
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TABLE A.2 



Binomial Distribution, 
Cumulative Probabilities 
for x = number of 
successes in n trials, 
prob(x < k) 



n = 2 



tt: 0.1 



0.2 



0.3 



0.4 



0.5 



0.6 



0.7 



0.8 



k:Q 0.8100 

1 0.9900 

2 1.0000 



0.6400 
0.9600 
1.0000 



0.4900 
0.9100 
1.0000 



0.3600 
0.8400 
1.0000 



0.2500 
0.7500 
1.0000 



0.1600 
0.6400 
1.0000 



0.0900 
0.5100 
1.0000 



0.9 



0.0400 0.0100 
0.3600 0.1900 
1.0000 1.0000 



tt: 0.1 



0.2 



0.3 



n = 3 



0.4 



0.5 



0.6 



0.7 



0.8 



0.9 



k: 0 0.7290 

1 0.9720 

2 0.0990 

3 1.0000 



0.5120 
0.8960 
0.9920 
1.0000 



0.3430 
0.7840 
0.9730 
1.0000 



0.2160 
0.6480 
0.9360 
1.0000 



0.1250 
0.5000 
0.8750 
1.0000 



0.0640 
0.3520 
0.7840 
1.0000 



0.0270 
0.2160 
0.6570 
1.0000 



0.0080 0.0010 

0.1040 0.0280 

0.4880 0.2710 

1.0000 1.0000 



tt: 0.1 



0.2 



0.3 



n = 4 



0.4 



0.5 



0.6 



0.7 



0.8 



0.9 



k:Q 0.6561 

1 0.9477 

2 0.9963 

3 0.9999 

4 1.0000 



0.4096 
0.8192 
0.9728 
0.9984 
1.0000 



0.2401 
0.6517 
0.9163 
0.9919 
1.0000 



0.1296 
0.4752 
0.8208 
0.9744 
1.0000 



0.0625 
0.3125 
0.6875 
0.9375 
1.0000 



0.0256 
0.1792 
0.5248 
0.8704 
1.0000 



0.0081 
0.0837 
0.3483 
0.7599 
1.0000 



0.0016 0.0001 

0.0272 0.0037 

0.1808 0.0523 

0.5904 0.3439 

1.0000 1.0000 



tt: 0.1 



0.2 



0.3 



n = 5 



0.4 



0.5 



0.6 



0.7 



0.8 



0.9 



k:0 


0.5905 


0.3277 


0.1681 


0.0778 


0.0313 


0.0102 


0.0024 


0.0003 


0.0000 


1 


0.9185 


0.7373 


0.5282 


0.3370 


0.1875 


0.0870 


0.0308 


0.0067 


0.0005 


2 


0.9914 


0.9421 


0.8369 


0.6826 


0.5000 


0.3174 


0.1631 


0.0579 


0.0086 


3 


0.9995 


0.9933 


0.9692 


0.9130 


0.8125 


0.6630 


0.4718 


0.2627 


0.0815 


4 


1.0000 


0.9997 


0.9976 


0.9898 


0.9688 


0.9222 


0.8319 


0.6723 


0.4095 


5 




1.0000 


1.0000 


1.0000 


1.0000 


1.0000 


1.0000 


1.0000 


1.0000 



tt: 0.1 



0.2 



0.3 



n = 6 



0.4 



0.5 



0.6 



0.7 



0.8 



0.9 



k:0 


0.5314 


0.2621 


0.1176 


0.0467 


0.0156 


0.0041 


0.0007 


0.0001 


0.0000 


1 


0.8857 


0.6554 


0.4202 


0.2333 


0.1094 


0.0410 


0.0109 


0.0016 


0.0001 


2 


0.9841 


0.901 1 


0.7443 


0.5443 


0.3437 


0.1792 


0.0705 


0.0170 


0.0013 


3 


0.9987 


0.9830 


0.9295 


0.8208 


0.6563 


0.4557 


0.2557 


0.0989 


0.0159 


4 


0.9999 


0.9984 


0.9891 


0.9590 


0.8906 


0.7667 


0.5798 


0.3446 


0.1 143 


5 


1 .0000 


0.9999 


0.9993 


0.9959 


0.9844 


0.9533 


0.8824 


0.7379 


0.4686 


6 




1.0000 


1.0000 


1.0000 


1.0000 


1.0000 


1.0000 


1.0000 


1.0000 



Source: Probabilities generated by Minitab, then compiled as shown here. 
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n = 7 



tt: 0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


k:Q 


0.4783 


0.2097 


0.0824 


0.0280 


0.0078 


0.0016 


0.0002 


0.0000 




1 


0.8503 


0.5767 


0.3294 


0.1586 


0.0625 


0.0188 


0.0038 


0.0004 


0.0000 


2 


0.9743 


0.8520 


0.6471 


0.4199 


0.2266 


0.0963 


0.0288 


0.0047 


0.0002 


3 


0.9973 


0.9667 


0.8740 


0.7102 


0.5000 


0.2898 


0.1260 


0.0333 


0.0027 


4 


0.9998 


0.9953 


0.9712 


0.9037 


0.7734 


0.5801 


0.3529 


0.1480 


0.0257 


5 


1 .0000 


0.9996 


0.9962 


0.9812 


0.9375 


0.8414 


0.6706 


0.4233 


0.1497 


6 




1.0000 


0.9998 


0.9984 


0.9922 


0.9720 


0.9176 


0.7903 


0.5217 


7 






1.0000 


1.0000 


1.0000 


1.0000 


1.0000 


1.0000 


1.0000 



TABLE A. 2 



(continued) 



n = 8 



tt: 0.1 



0.2 



0.3 



0.4 



0.5 



0.6 



0.7 



0.8 



0.9 



k:0 


0.4305 


0.1678 


0.0576 


0.0168 


0.0039 


0.0007 


0.0001 


0.0000 




1 


0.8131 


0.5033 


0.2553 


0.1064 


0.0352 


0.0085 


0.0013 


0.0001 




2 


0.9619 


0.7969 


0.5518 


0.3154 


0.1445 


0.0498 


0.0113 


0.0012 


0.0000 


3 


0.9950 


0.9437 


0.8059 


0.5941 


0.3633 


0.1737 


0.0580 


0.0104 


0.0004 


4 


0.9996 


0.9896 


0.9420 


0.8263 


0.6367 


0.4059 


0.1941 


0.0563 


0.0050 


5 


1 .0000 


0.9988 


0.9887 


0.9502 


0.8555 


0.6846 


0.4482 


0.2031 


0.0381 


6 




0.9999 


0.9987 


0.9915 


0.9648 


0.8936 


0.7447 


0.4967 


0.1869 


7 




1.0000 


0.9999 


0.9993 


0.9961 


0.9832 


0.9424 


0.8322 


0.5695 


8 






1.0000 


1.0000 


1.0000 


1.0000 


1.0000 


1 .0000 


1.0000 



n = 9 



tt: 0.1 



0.2 



0.3 



0.4 



0.5 



0.6 



0.7 



0.8 



0.9 



k:0 


0.3874 


0.1342 


0.0404 


0.0101 


0.0020 


0.0003 


0.0000 






1 


0.7748 


0.4362 


0.1960 


0.0705 


0.0195 


0.0038 


0.0004 


0.0000 




2 


0.9470 


0.7382 


0.4628 


0.2318 


0.0898 


0.0250 


0.0043 


0.0003 


0.0000 


3 


0.9917 


0.9144 


0.7297 


0.4826 


0.2539 


0.0994 


0.0253 


0.0031 


0.0001 


4 


0.9991 


0.9804 


0.9012 


0.7334 


0.5000 


0.2666 


0.0988 


0.0196 


0.0009 


5 


0.9999 


0.9969 


0.9747 


0.9006 


0.7461 


0.5174 


0.2703 


0.0856 


0.0083 


6 


1 .0000 


0.9997 


0.9957 


0.9750 


0.9102 


0.7682 


0.5372 


0.2618 


0.0530 


7 




1.0000 


0.9996 


0.9962 


0.9805 


0.9295 


0.8040 


0.5638 


0.2252 


8 






1.0000 


0.9997 


0.9980 


0.9899 


0.9596 


0.8658 


0.6126 


9 








1.0000 


1.0000 


1.0000 


1.0000 


1.0000 


1.0000 



n = 10 



tt: 0.1 



0.2 



0.3 



0.4 



0.5 



0.6 



0.7 



0.8 



0.9 



k:0 


0.3487 


0.1074 


0.0282 


0.0060 


0.0010 


0.0001 


0.0000 






1 


0.7361 


0.3758 


0.1493 


0.0464 


0.0107 


0.0017 


0.0001 


0.0000 




2 


0.9298 


0.6778 


0.3828 


0.1673 


0.0547 


0.0123 


0.0016 


0.0001 




3 


0.9872 


0.8791 


0.6496 


0.3823 


0.1719 


0.0548 


0.0106 


0.0009 


0.0000 


4 


0.9984 


0.9672 


0.8497 


0.6331 


0.3770 


0.1662 


0.0473 


0.0064 


0.0001 


5 


0.9999 


0.9936 


0.9527 


0.8338 


0.6230 


0.3669 


0.1503 


0.0328 


0.0016 


6 


1.0000 


0.9991 


0.9894 


0.9452 


0.8281 


0.6177 


0.3504 


0.1209 


0.0128 


7 




0.9999 


0.9984 


0.9877 


0.9453 


0.8327 


0.6172 


0.3222 


0.0702 


8 




1.0000 


0.9999 


0.9983 


0.9893 


0.9536 


0.8507 


0.6242 


0.2639 


9 






1.0000 


0.9999 


0.9990 


0.9940 


0.9718 


0.8926 


0.6513 


10 








1.0000 


1.0000 


1.0000 


1.0000 


1.0000 


1.0000 
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tt: 0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


k: 0 


0.3138 


0.0859 


0.0198 


0.0036 


0.0005 


0.0000 








1 


0.6974 


0.3221 


0.1130 


0.0302 


0.0059 


0.0007 


0.0000 






2 


0.9104 


0.6174 


0.3127 


0.1 189 


0.0327 


0.0059 


0.0006 


0.0000 




3 


0.9815 


0.8389 


0.5696 


0.2963 


0.1133 


0.0293 


0.0043 


0.0002 




4 


0.9972 


0.9496 


0.7897 


0.5328 


0.2744 


0.0994 


0.0216 


0.0020 


0.0000 


5 


0.9997 


0.9883 


0.9218 


0.7535 


0.5000 


0.2465 


0.0782 


0.0117 


0.0003 


6 


1 .0000 


0.9980 


0.9784 


0.9006 


0.7256 


0.4672 


0.2103 


0.0504 


0.0028 


7 




0.9998 


0.9957 


0.9707 


0.8867 


0.7037 


0.4304 


0.1611 


0.0185 


8 




1.0000 


0.9994 


0.9941 


0.9673 


0.8811 


0.6873 


0.3826 


0.0896 


9 






1.0000 


0.9993 


0.9941 


0.9698 


0.8870 


0.6779 


0.3026 


10 








1.0000 


0.9995 


0.9964 


0.9802 


0.9141 


0.6862 


11 










1.0000 


1.0000 


1.0000 


1.0000 


1.0000 



n = 12 



tt: 0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


k:0 


0.2824 


0.0687 


0.0138 


0.0022 


0.0002 


0.0000 








1 


0.6590 


0.2749 


0.0850 


0.0196 


0.0032 


0.0003 


0.0000 






2 


0.8891 


0.5583 


0.2528 


0.0834 


0.0193 


0.0028 


0.0002 


0.0000 




3 


0.9744 


0.7946 


0.4925 


0.2253 


0.0730 


0.0153 


0.0017 


0.0001 




4 


0.9957 


0.9274 


0.7237 


0.4382 


0.1938 


0.0573 


0.0095 


0.0006 


0.0000 


5 


0.9995 


0.9806 


0.8822 


0.6652 


0.3872 


0.1582 


0.0386 


0.0039 


0.0001 


6 


0.9999 


0.9961 


0.9614 


0.8418 


0.6128 


0.3348 


0.1 178 


0.0194 


0.0005 


7 


1.0000 


0.9994 


0.9905 


0.9427 


0.8062 


0.5618 


0.2763 


0.0726 


0.0043 


8 




0.9999 


0.9983 


0.9847 


0.9270 


0.7747 


0.5075 


0.2054 


0.0256 


9 




1.0000 


0.9998 


0.9972 


0.9807 


0.9166 


0.7472 


0.4417 


0.1 109 


10 






1 .0000 


0.9997 


0.9968 


0.9804 


0.9150 


0.7251 


0.3410 


11 








1.0000 


0.9998 


0.9978 


0.9862 


0.9313 


0.7176 


12 










1.0000 


1.0000 


1.0000 


1.0000 


1.0000 



n = 13 



tt: 0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


k:0 


0.2542 


0.0550 


0.0097 


0.0013 


0.0001 


0.0000 








1 


0.6213 


0.2336 


0.0637 


0.0126 


0.0017 


0.0001 


0.0000 






2 


0.8661 


0.5017 


0.2025 


0.0579 


0.01 12 


0.0013 


0.0001 






3 


0.9658 


0.7473 


0.4206 


0.1686 


0.0461 


0.0078 


0.0007 


0.0000 




4 


0.9935 


0.9009 


0.6543 


0.3530 


0.1334 


0.0321 


0.0040 


0.0002 




5 


0.9991 


0.9700 


0.8346 


0.5744 


0.2905 


0.0977 


0.0182 


0.0012 


0.0000 


6 


0.9999 


0.9930 


0.9376 


0.7712 


0.5000 


0.2288 


0.0624 


0.0070 


0.0001 


7 


1.0000 


0.9988 


0.9818 


0.9023 


0.7095 


0.4256 


0.1654 


0.0300 


0.0009 


8 




0.9998 


0.9960 


0.9679 


0.8666 


0.6470 


0.3457 


0.0991 


0.0065 


9 




1.0000 


0.9993 


0.9922 


0.9539 


0.8314 


0.5794 


0.2527 


0.0342 


10 






0.9999 


0.9987 


0.9888 


0.9421 


0.7975 


0.4983 


0.1339 


11 






1 .0000 


0.9999 


0.9983 


0.9874 


0.9363 


0.7664 


0.3787 


12 








1.0000 


0.9999 


0.9987 


0.9903 


0.9450 


0.7458 


13 










1.0000 


1.0000 


1.0000 


1.0000 


1.0000 
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n = 14 



tt: 0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


k: 0 


0.2288 


0.0440 


0.0068 


0.0008 


0.0001 


0.0000 








1 


0.5846 


0.1979 


0.0475 


0.0081 


0.0009 


0.0001 








2 


0.841 6 


0.4481 


0.1608 


0.0398 


0.0065 


0.0006 


0.0000 






3 


0.9559 


0.6982 


0.3552 


0.1243 


0.0287 


0.0039 


0.0002 






4 


0.9908 


0.8702 


0.5842 


0.2793 


0.0898 


0.0175 


0.0017 


0.0000 




5 


0.9985 


0.9561 


0.7805 


0.4859 


0.2120 


0.0583 


0.0083 


0.0004 




6 


0.9998 


0.9884 


0.9067 


0.6925 


0.3953 


0.1501 


0.0315 


0.0024 


0.0000 


7 


1.0000 


0.9976 


0.9685 


0.8499 


0.6047 


0.3075 


0.0933 


0.0116 


0.0002 


8 




0.9996 


0.9917 


0.9417 


0.7880 


0.5141 


0.2195 


0.0439 


0.0015 


9 




1.0000 


0.9983 


0.9825 


0.9102 


0.7207 


0.4158 


0.1298 


0.0092 


10 






0.9998 


0.9961 


0.9713 


0.8757 


0.6448 


0.3018 


0.0441 


11 






1.0000 


0.9994 


0.9935 


0.9602 


0.8392 


0.5519 


0.1584 


12 








0.9999 


0.9991 


0.9919 


0.9525 


0.8021 


0.4154 


13 








1.0000 


0.9999 


0.9992 


0.9932 


0.9560 


0.7712 


14 










1.0000 


1.0000 


1.0000 


1.0000 


1 .0000 



TABLE A. 2 



(continued) 



n = 15 



tt: 0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


k:0 


0.2059 


0.0352 


0.0047 


0.0005 


0.0000 










1 


0.5490 


0.1671 


0.0353 


0.0052 


0.0005 


0.0000 








2 


0.8159 


0.3980 


0.1268 


0.0271 


0.0037 


0.0003 


0.0000 






3 


0.9444 


0.6482 


0.2969 


0.0905 


0.0176 


0.0019 


0.0001 






4 


0.9873 


0.8358 


0.5155 


0.2173 


0.0592 


0.0093 


0.0007 


0.0000 




5 


0.9978 


0.9389 


0.7216 


0.4032 


0.1509 


0.0338 


0.0037 


0.0001 




6 


0.9997 


0.9819 


0.8689 


0.6098 


0.3036 


0.0950 


0.0152 


0.0008 




7 


1.0000 


0.9958 


0.9500 


0.7869 


0.5000 


0.2131 


0.0500 


0.0042 


0.0000 


8 




0.9992 


0.9848 


0.9050 


0.6964 


0.3902 


0.1311 


0.0181 


0.0003 


9 




0.9999 


0.9963 


0.9662 


0.8491 


0.5968 


0.2784 


0.061 1 


0.0022 


10 




1.0000 


0.9993 


0.9907 


0.9408 


0.7827 


0.4845 


0.1642 


0.0127 


11 






0.9999 


0.9981 


0.9824 


0.9095 


0.7031 


0.3518 


0.0556 


12 






1 .0000 


0.9997 


0.9963 


0.9729 


0.8732 


0.6020 


0.1841 


13 








1.0000 


0.9995 


0.9948 


0.9647 


0.8329 


0.4510 


14 










1 .0000 


0.9995 


0.9953 


0.9648 


0.7941 


15 












1.0000 


1.0000 


1.0000 


1.0000 
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TABLE A.2 



(continued) 



n = 16 



tt: 0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


k: 0 


0.1853 


0.0281 


0.0033 


0.0003 


0.0000 










1 


0.5147 


0.1407 


0.0261 


0.0033 


0.0003 


0.0000 








2 


0.7892 


0.3518 


0.0994 


0.0183 


0.0021 


0.0001 








3 


0.9316 


0.5981 


0.2459 


0.0651 


0.0106 


0.0009 


0.0000 






4 


0.9830 


0.7982 


0.4499 


0.1666 


0.0384 


0.0049 


0.0003 






5 


0.9967 


0.9183 


0.6598 


0.3288 


0.1051 


0.0191 


0.0016 


0.0000 




6 


0.9995 


0.9733 


0.8247 


0.5272 


0.2272 


0.0583 


0.0071 


0.0002 




7 


0.9999 


0.9930 


0.9256 


0.7161 


0.4018 


0.1423 


0.0257 


0.0015 


0.0000 


8 


1.0000 


0.9985 


0.9743 


0.8577 


0.5982 


0.2839 


0.0744 


0.0070 


0.0001 


9 




0.9998 


0.9929 


0.9417 


0.7728 


0.4728 


0.1753 


0.0267 


0.0006 


10 




1.0000 


0.9984 


0.9809 


0.8949 


0.6712 


0.3402 


0.0817 


0.0033 


11 






0.9997 


0.9951 


0.9616 


0.8334 


0.5501 


0.2018 


0.0170 


12 






1 .0000 


0.9991 


0.9894 


0.9349 


0.7541 


0.4019 


0.0684 


13 








0.9999 


0.9979 


0.9817 


0.9006 


0.6482 


0.2108 


14 








1.0000 


0.9997 


0.9967 


0.9739 


0.8593 


0.4853 


15 










1 .0000 


0.9997 


0.9967 


0.9719 


0.8147 


16 












1.0000 


1.0000 


1.0000 


1.0000 



n = 17 



tt: 0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


k:Q 


0.1668 


0.0225 


0.0023 


0.0002 


0.0000 










1 


0.4818 


0.1182 


0.0193 


0.0021 


0.0001 


0.0000 








2 


0.7618 


0.3096 


0.0774 


0.0123 


0.0012 


0.0001 








3 


0.9174 


0.5489 


0.2019 


0.0464 


0.0064 


0.0005 


0.0000 






4 


0.9779 


0.7582 


0.3887 


0.1260 


0.0245 


0.0025 


0.0001 






5 


0.9953 


0.8943 


0.5968 


0.2639 


0.0717 


0.0106 


0.0007 


0.0000 




6 


0.9992 


0.9623 


0.7752 


0.4478 


0.1662 


0.0348 


0.0032 


0.0001 




7 


0.9999 


0.9891 


0.8954 


0.6405 


0.3145 


0.0919 


0.0127 


0.0005 




8 


1 .0000 


0.9974 


0.9597 


0.8011 


0.5000 


0.1989 


0.0403 


0.0026 


0.0000 


9 




0.9995 


0.9873 


0.9081 


0.6855 


0.3595 


0.1046 


0.0109 


0.0001 


10 




0.9999 


0.9968 


0.9652 


0.8338 


0.5522 


0.2248 


0.0377 


0.0008 


11 




1.0000 


0.9993 


0.9894 


0.9283 


0.7361 


0.4032 


0.1057 


0.0047 


12 






0.9999 


0.9975 


0.9755 


0.8740 


0.6113 


0.2418 


0.0221 


13 






1 .0000 


0.9995 


0.9936 


0.9536 


0.7981 


0.451 1 


0.0826 


14 








0.9999 


0.9988 


0.9877 


0.9226 


0.6904 


0.2382 


15 








1.0000 


0.9999 


0.9979 


0.9807 


0.8818 


0.5182 


16 










1 .0000 


0.9998 


0.9977 


0.9775 


0.8332 


17 












1.0000 


1.0000 


1.0000 


1.0000 
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tt: 0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


k: 0 


0.1501 


0.0180 


0.0016 


0.0001 


0.0000 










1 


0.4503 


0.0991 


0.0142 


0.0013 


0.0001 










2 


0.7338 


0.2713 


0.0600 


0.0082 


0.0007 


0.0000 








3 


0.9018 


0.5010 


0.1646 


0.0328 


0.0038 


0.0002 








4 


0.9718 


0.7164 


0.3327 


0.0942 


0.0154 


0.0013 


0.0000 






5 


0.9936 


0.8671 


0.5344 


0.2088 


0.0481 


0.0058 


0.0003 






6 


0.9988 


0.9487 


0.7217 


0.3743 


0.1189 


0.0203 


0.0014 


0.0000 




7 


0.9998 


0.9837 


0.8593 


0.5634 


0.2403 


0.0576 


0.0061 


0.0002 




8 


1 .0000 


0.9957 


0.9404 


0.7368 


0.4073 


0.1347 


0.0210 


0.0009 




9 




0.9991 


0.9790 


0.8653 


0.5927 


0.2632 


0.0596 


0.0043 


0.0000 


10 




0.9998 


0.9939 


0.9424 


0.7597 


0.4366 


0.1407 


0.0163 


0.0002 


11 




1.0000 


0.9986 


0.9797 


0.8811 


0.6257 


0.2783 


0.0513 


0.0012 


12 






0.9997 


0.9942 


0.9519 


0.7912 


0.4656 


0.1329 


0.0064 


13 






1 .0000 


0.9987 


0.9846 


0.9058 


0.6673 


0.2836 


0.0282 


14 








0.9998 


0.9962 


0.9672 


0.8354 


0.4990 


0.0982 


15 








1.0000 


0.9993 


0.9918 


0.9400 


0.7287 


0.2662 


16 










0.9999 


0.9987 


0.9858 


0.9009 


0.5497 


17 










1 .0000 


0.9999 


0.9984 


0.9820 


0.8499 


18 












1.0000 


1.0000 


1.0000 


1.0000 



TABLE A. 2 



(continued) 



n = 19 



tt: 0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


k:0 


0.1351 


0.0144 


0.001 1 


0.0001 












1 


0.4203 


0.0829 


0.0104 


0.0008 


0.0000 










2 


0.7054 


0.2369 


0.0462 


0.0055 


0.0004 


0.0000 








3 


0.8850 


0.4551 


0.1332 


0.0230 


0.0022 


0.0001 








4 


0.9648 


0.6733 


0.2822 


0.0696 


0.0096 


0.0006 


0.0000 






5 


0.9914 


0.8369 


0.4739 


0.1629 


0.0318 


0.0031 


0.0001 






6 


0.9983 


0.9324 


0.6655 


0.3081 


0.0835 


0.01 16 


0.0006 






7 


0.9997 


0.9767 


0.8180 


0.4878 


0.1796 


0.0352 


0.0028 


0.0000 




8 


1 .0000 


0.9933 


0.9161 


0.6675 


0.3238 


0.0885 


0.0105 


0.0003 




9 




0.9984 


0.9674 


0.8139 


0.5000 


0.1861 


0.0326 


0.0016 




10 




0.9997 


0.9895 


0.9115 


0.6762 


0.3325 


0.0839 


0.0067 


0.0000 


11 




1.0000 


0.9972 


0.9648 


0.8204 


0.5122 


0.1820 


0.0233 


0.0003 


12 






0.9994 


0.9884 


0.9165 


0.6919 


0.3345 


0.0676 


0.0017 


13 






0.9999 


0.9969 


0.9682 


0.8371 


0.5261 


0.1631 


0.0086 


14 






1 .0000 


0.9994 


0.9904 


0.9304 


0.7178 


0.3267 


0.0352 


15 








0.9999 


0.9978 


0.9770 


0.8668 


0.5449 


0.1 150 


16 








1.0000 


0.9996 


0.9945 


0.9538 


0.7631 


0.2946 


17 










1 .0000 


0.9992 


0.9896 


0.9171 


0.5797 


18 












0.9999 


0.9989 


0.9856 


0.8649 


19 












1.0000 


1.0000 


1.0000 


1.0000 
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TABLE A.2 



(continued) 



n = 20 



tt: 0.1 



0.2 



0.3 



0.4 



0.5 



0.6 



0.7 



0.8 



k:0 
1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 



1216 
3917 
6769 
8670 
9568 
9887 
9976 
9996 
9999 
0000 



0.0115 
0.0692 
0.2061 
0.4114 
0.6296 
0.8042 
0.9133 
0.9679 
0.9900 
0.9974 
0.9994 
0.9999 
1.0000 



0.0008 
0.0076 
0.0355 
0.1071 
0.2375 
0.4164 
0.6080 
0.7723 
0.8867 
0.9520 
0.9829 
0.9949 
0.9987 
0.9997 
1.0000 



0.0000 
0.0005 
0.0036 
0.0160 
0.0510 
0.1256 
0.2500 
0.4159 
0.5956 
0.7553 
0.8725 
0.9435 
0.9790 
0.9935 
0.9984 
0.9997 
1.0000 



0.0000 
0.0002 
0.0013 
0.0059 
0.0207 
0.0577 
0.1316 
0.2517 
0.41 19 
0.5881 
0.7483 
0.8684 
0.9423 
0.9793 
0.9941 
0.9987 
0.9998 
1.0000 



0.0000 
0.0003 
0.0016 
0.0065 
0.0210 
0.0565 
0.1275 
0.2447 
0.4044 
0.5841 
0.7500 
0.8744 
0.9490 
0.9840 
0.9964 
0.9995 
1.0000 



0.0000 
0.0003 
0.0013 
0.0051 
0.0171 
0.0480 
0.1 133 
0.2277 
0.3920 
0.5836 
0.7625 
0.8929 
0.9645 
0.9924 
0.9992 
1.0000 



0.0000 
0.0001 
0.0006 
0.0026 
0.0100 
0.0321 
0.0867 
0.1958 
0.3704 
0.5886 
0.7939 
0.9308 
0.9885 
1.0000 



0.9 



0.0000 
0.0001 
0.0004 
0.0024 
0.0113 
0.0432 
0.1330 
0.3231 
0.6083 
0.8784 
1.0000 



n = 25 



tt: 0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


k:Q 


0.0718 


0.0038 


0.0001 


0.0000 












1 


0.2712 


0.0274 


0.0016 


0.0001 












2 


0.5371 


0.0982 


0.0090 


0.0004 


0.0000 










3 


0.7636 


0.2340 


0.0332 


0.0024 


0.0001 










4 


0.9020 


0.4207 


0.0905 


0.0095 


0.0005 


0.0000 








5 


0.9666 


0.6167 


0.1935 


0.0294 


0.0020 


0.0001 








6 


0.9905 


0.7800 


0.3407 


0.0736 


0.0073 


0.0003 








7 


0.9977 


0.8909 


0.5118 


0.1536 


0.0216 


0.0012 


0.0000 






8 


0.9995 


0.9532 


0.6769 


0.2735 


0.0539 


0.0043 


0.0001 






9 


0.9999 


0.9827 


0.8106 


0.4246 


0.1148 


0.0132 


0.0005 






10 


1.0000 


0.9944 


0.9022 


0.5858 


0.2122 


0.0344 


0.0018 


0.0000 




11 




0.9985 


0.9558 


0.7323 


0.3450 


0.0778 


0.0060 


0.0001 




12 




0.9996 


0.9825 


0.8462 


0.5000 


0.1538 


0.0175 


0.0004 




13 




0.9999 


0.9940 


0.9222 


0.6550 


0.2677 


0.0442 


0.0015 




14 




1.0000 


0.9982 


0.9656 


0.7878 


0.4142 


0.0978 


0.0056 


0.0000 


15 






0.9995 


0.9868 


0.8852 


0.5754 


0.1894 


0.0173 


0.0001 


16 






0.9999 


0.9957 


0.9461 


0.7265 


0.3231 


0.0468 


0.0005 


17 






1.0000 


0.9988 


0.9784 


0.8464 


0.4882 


0.1091 


0.0023 


18 








0.9997 


0.9927 


0.9264 


0.6593 


0.2200 


0.0095 


19 








0.9999 


0.9980 


0.9706 


0.8065 


0.3833 


0.0334 


20 








1.0000 


0.9995 


0.9905 


0.9095 


0.5793 


0.0980 


21 










0.9999 


0.9976 


0.9668 


0.7660 


0.2364 


22 










1 .0000 


0.9996 


0.9910 


0.9018 


0.4629 


23 












0.9999 


0.9984 


0.9726 


0.7288 


24 












1 .0000 


0.9999 


0.9962 


0.9282 


25 














1.0000 


1.0000 


1.0000 
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\: 0.1 



0.2 



0.3 



0.4 



0.5 



0.6 



0.7 



0.8 



0.9 



1.0 



k:0 
1 
2 
3 
4 
5 
6 
7 
8 



0.9048 
0.0905 
0.0045 
0.0002 
0.0000 



0.8187 
0.1637 
0.0164 
0.0011 
0.0001 
0.0000 



0.7408 
0.2222 
0.0333 
0.0033 
0.0003 
0.0000 



0.6703 
0.2681 
0.0536 
0.0072 
0.0007 
0.0001 
0.0000 



0.6065 0.5488 

0.3033 0.3293 

0.0758 0.0988 

0.0126 0.0198 

0.0016 0.0030 

0.0002 0.0004 

0.0000 0.0000 



0.4966 
0.3476 
0.1217 
0.0284 
0.0050 
0.0007 
0.0001 
0.0000 



0.4493 
0.3595 
0.1438 
0.0383 
0.0077 
0.0012 
0.0002 
0.0000 



0.4066 
0.3659 
0.1647 
0.0494 
0.0111 
0.0020 
0.0003 
0.0000 



0.3679 
0.3679 
0.1839 
0.0613 
0.0153 
0.0031 
0.0005 
0.0001 
0.0000 



TABLE A. 3 



Poisson Distribution, 
Individual Probabilities 
for x = number of 
occurrences, prob(x = k) 



\: 1.1 



1.2 



1.3 



1.4 



1.5 



1.6 



1.7 



1.8 



1.9 



2.0 



k:0 
1 
2 
3 
4 
5 
6 
7 
8 
9 
10 



0.3329 
0.3662 
0.2014 
0.0738 
0.0203 
0.0045 
0.0008 
0.0001 
0.0000 



0.3012 
0.3614 
0.2169 
0.0867 
0.0260 
0.0062 
0.0012 
0.0002 
0.0000 



0.2725 
0.3543 
0.2303 
0.0998 
0.0324 
0.0084 
0.0018 
0.0003 
0.0001 
0.0000 



0.2466 
0.3452 
0.2417 
0.1 128 
0.0395 
0.0111 
0.0026 
0.0005 
0.0001 
0.0000 



0.2231 
0.3347 
0.2510 
0.1255 
0.0471 
0.0141 
0.0035 
0.0008 
0.0001 
0.0000 



0.2019 
0.3230 
0.2584 
0.1378 
0.0551 
0.0176 
0.0047 
0.0011 
0.0002 
0.0000 



0.1827 
0.3106 
0.2640 
0.1496 
0.0636 
0.0216 
0.0061 
0.0015 
0.0003 
0.0001 
0.0000 



0.1653 
0.2975 
0.2678 
0.1607 
0.0723 
0.0260 
0.0078 
0.0020 
0.0005 
0.0001 
0.0000 



0.1496 
0.2842 
0.2700 
0.1710 
0.0812 
0.0309 
0.0098 
0.0027 
0.0006 
0.0001 
0.0000 



0.1353 
0.2707 
0.2707 
0.1804 
0.0902 
0.0361 
0.0120 
0.0034 
0.0009 
0.0002 
0.0000 



\: 2.1 



2.2 



2.3 



2.4 



2.5 



2.6 



2.7 



2.8 



2.9 



3.0 



k:0 0.1225 
1 0.2572 
0.2700 
0.1890 
0.0992 
0.0417 
0.0146 
0.0044 



2 
3 
4 
5 
6 
7 



8 0.0011 

9 0.0003 
10 0.0001 



11 
12 
13 



0.0000 



0.1108 
0.2438 
0.2681 
0.1966 
0.1082 
0.0476 
0.0174 
0.0055 
0.0015 
0.0004 
0.0001 
0.0000 



0.1003 
0.2306 
0.2652 
0.2033 
0.1169 
0.0538 
0.0206 
0.0068 
0.0019 
0.0005 
0.0001 
0.0000 



0.0907 
0.2177 
0.2613 
0.2090 
0.1254 
0.0602 
0.0241 
0.0083 
0.0025 
0.0007 
0.0002 
0.0000 



0.0821 
0.2052 
0.2565 
0.2138 
0.1336 
0.0668 
0.0278 
0.0099 
0.0031 
0.0009 
0.0002 
0.0000 



0.0743 
0.1931 
0.2510 
0.2176 
0.1414 
0.0735 
0.0319 
0.01 18 
0.0038 
0.0011 
0.0003 
0.0001 
0.0000 



0.0672 
0.1815 
0.2450 
0.2205 
0.1488 
0.0804 
0.0362 
0.0139 
0.0047 
0.0014 
0.0004 
0.0001 
0.0000 



0.0608 
0.1703 
0.2384 
0.2225 
0.1557 
0.0872 
0.0407 
0.0163 
0.0057 
0.0018 
0.0005 
0.0001 
0.0000 



0.0550 
0.1596 
0.2314 
0.2237 
0.1622 
0.0940 
0.0455 
0.0188 
0.0068 
0.0022 
0.0006 
0.0002 
0.0000 



0.0498 
0.1494 
0.2240 
0.2240 
0.1680 
0.1008 
0.0504 
0.0216 
0.0081 
0.0027 
0.0008 
0.0002 
0.0001 
0.0000 



Source: Probabilities generated by Minitab, then compiled as shown here. 
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TABLE A.3 



(continued) 



\: 3.1 



3.2 



3.3 



3.4 



3.5 



3.6 



3.7 



3.8 



3.9 



4.0 



k:0 0 



8 0 

9 0 

10 0 

11 0 

12 0 

13 0 
14 
15 



0450 
1397 
2165 
2237 
1733 
1075 
0555 
0246 
0095 
0033 
0010 
0003 
0001 
0000 



\: 4.1 



0.0408 
0.1304 
0.2087 
0.2226 
0.1781 
0.1140 
0.0608 
0.0278 
0.0111 
0.0040 
0.0013 
0.0004 
0.0001 
0.0000 



4.2 



0.0369 
0.1217 
0.2008 
0.2209 
0.1823 
0.1203 
0.0662 
0.0312 
0.0129 
0.0047 
0.0016 
0.0005 
0.0001 
0.0000 



4.3 



0.0334 
0.1 135 
0.1929 
0.2186 
0.1858 
0.1264 
0.0716 
0.0348 
0.0148 
0.0056 
0.0019 
0.0006 
0.0002 
0.0000 



4.4 



0.0302 
0.1057 
0.1850 
0.2158 
0.1888 
0.1322 
0.0771 
0.0385 
0.0169 
0.0066 
0.0023 
0.0007 
0.0002 
0.0001 
0.0000 

4.5 



0.0273 
0.0984 
0.1771 
0.2125 
0.1912 
0.1377 
0.0826 
0.0425 
0.0191 
0.0076 
0.0028 
0.0009 
0.0003 
0.0001 
0.0000 

4.6 



0.0247 
0.0915 
0.1692 
0.2087 
0.1931 
0.1429 
0.0881 
0.0466 
0.0215 
0.0089 
0.0033 
0.0011 
0.0003 
0.0001 
0.0000 

4.7 



0.0224 
0.0850 
0.1615 
0.2046 
0.1944 
0.1477 
0.0936 
0.0508 
0.0241 
0.0102 
0.0039 
0.0013 
0.0004 
0.0001 
0.0000 

4.8 



0.0202 
0.0789 
0.1539 
0.2001 
0.1951 
0.1522 
0.0989 
0.0551 
0.0269 
0.0116 
0.0045 
0.0016 
0.0005 
0.0002 
0.0000 

4.9 



0.0183 
0.0733 
0.1465 
0.1954 
0.1954 
0.1563 
0.1042 
0.0595 
0.0298 
0.0132 
0.0053 
0.0019 
0.0006 
0.0002 
0.0001 
0.0000 

5.0 



k:0 0 



8 0 

9 0 

10 0 

11 0 

12 0 

13 0 

14 0 

15 0 
16 



0166 
0679 
1393 
1904 
1951 
1600 
1093 
0640 
0328 
0150 
0061 
0023 
0008 
0002 
0001 
0000 



\: 5.1 



0.0150 
0.0630 
0.1323 
0.1852 
0.1944 
0.1633 
0.1143 
0.0686 
0.0360 
0.0168 
0.0071 
0.0027 
0.0009 
0.0003 
0.0001 
0.0000 

5.2 



0.0136 
0.0583 
0.1254 
0.1798 
1933 
1662 
1191 
0.0732 
0.0393 
0.0188 
0.0081 
0.0032 
0.001 1 
0.0004 
0.0001 
0.0000 

5.3 



0.0123 
0.0540 
0.1 188 
743 
917 
687 
237 
0.0778 
0.0428 
0.0209 
0.0092 
0.0037 
0.0013 
0.0005 
0.0001 
0.0000 

5.4 



0.0111 
0.0500 
0.1125 
0.1687 
0.1898 
0.1708 
0.1281 
0.0824 
0.0463 
0.0232 
0.0104 
0.0043 
0.0016 
0.0006 
0.0002 
0.0001 
0.0000 

5.5 



0.0101 
0.0462 
0.1063 
0.1631 
0.1875 
0.1725 
0.1323 
0.0869 
0.0500 
0.0255 
0.01 18 
0.0049 
0.0019 
0.0007 
0.0002 
0.0001 
0.0000 

5.6 



0.0091 
0.0427 
0.1005 
0.1574 
0.1849 
0.1738 
0.1362 
0.0914 
0.0537 
0.0281 
0.0132 
0.0056 
0.0022 
0.0008 
0.0003 
0.0001 
0.0000 

5.7 



0.0082 
0.0395 
0.0948 
0.1517 
0.1820 
0.1747 
0.1398 
0.0959 
0.0575 
0.0307 
0.0147 
0.0064 
0.0026 
0.0009 
0.0003 
0.0001 
0.0000 

5.8 



0.0074 
0.0365 
0.0894 
0.1460 
0.1789 
0.1753 
0.1432 
0.1002 
0.0614 
0.0334 
0.0164 
0.0073 
0.0030 
0.001 1 
0.0004 
0.0001 
0.0000 

5.9 



0.0067 
0.0337 
0.0842 
0.1404 
0.1755 
0.1755 
0.1462 
0.1044 
0.0653 
0.0363 
0.0181 
0.0082 
0.0034 
0.0013 
0.0005 
0.0002 
0.0000 

6.0 



k:0 0 



8 0 

9 0 

10 0 

11 0 

12 0 

13 0 

14 0 

15 0 

16 0 

17 0 
18 



0061 
031 1 
0793 
1348 
1719 
1753 
1490 
1086 
0692 
0392 
0200 
0093 
0039 
0015 
0006 
0002 
0001 
0000 



0.0055 
0.0287 
0.0746 
0.1293 
0.1681 
0.1748 
0.1515 
0.1125 
0.0731 
0.0423 
0.0220 
0.0104 
0.0045 
0.0018 
0.0007 
0.0002 
0.0001 
0.0000 



0.0050 
0.0265 
0.0701 
0.1239 
0.1641 
0.1740 
0.1537 
0.1163 
0.0771 
0.0454 
0.0241 
0.0116 
0.0051 
0.0021 
0.0008 
0.0003 
0.0001 
0.0000 



0.0045 
0.0244 
0.0659 



185 
600 
728 
555 
200 
0.0810 
0.0486 
0.0262 
0.0129 
0.0058 
0.0024 
0.0009 
0.0003 
0.0001 
0.0000 



0.0041 
0.0225 
0.0618 
0.1133 
0.1558 
0.1714 
0.1571 
0.1234 
0.0849 
0.0519 
0.0285 
0.0143 
0.0065 
0.0028 
0.001 1 
0.0004 
0.0001 
0.0000 



0.0037 
0.0207 
0.0580 
0.1082 
0.1515 
0.1697 
0.1584 
0.1267 
0.0887 
0.0552 
0.0309 
0.0157 
0.0073 
0.0032 
0.0013 
0.0005 
0.0002 
0.0001 
0.0000 



0.0033 
0.0191 
0.0544 
0.1033 
0.1472 
0.1678 
0.1594 
0.1298 
0.0925 
0.0586 
0.0334 
0.0173 
0.0082 
0.0036 
0.0015 
0.0006 
0.0002 
0.0001 
0.0000 



0.0030 
0.0176 
0.0509 
0.0985 
0.1428 
0.1656 
0.1601 
0.1326 
0.0962 
0.0620 
0.0359 
0.0190 
0.0092 
0.0041 
0.0017 
0.0007 
0.0002 
0.0001 
0.0000 



0.0027 
0.0162 
0.0477 
0.0938 
0.1383 
0.1632 
0.1605 
0.1353 
0.0998 
0.0654 
0.0386 
0.0207 
0.0102 
0.0046 
0.0019 
0.0008 
0.0003 
0.0001 
0.0000 



0.0025 
0.0149 
0.0446 
0.0892 
0.1339 
0.1606 
0.1606 
0.1377 
0.1033 
0.0688 
0.0413 
0.0225 
0.01 13 
0.0052 
0.0022 
0.0009 
0.0003 
0.0001 
0.0000 
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\: 6.1 



6.2 



6.3 



6.4 



6.5 



6.6 



6.7 



6.8 



6.9 



7.0 



TABLE A. 3 



(continued) 



k:0 0 



8 0 

9 0 

10 0 

11 0 

12 0 

13 0 

14 0 

15 0 

16 0 

17 0 

18 0 
19 
20 



0022 
0137 
0417 
0848 
1294 
1579 
1605 
1399 
1066 
0723 
0441 
0244 
0124 
0058 
0025 
0010 
0004 
0001 
0000 



0.0020 
0.0126 
0.0390 
0.0806 
0.1249 
0.1549 
0.1601 
0.1418 
0.1099 
0.0757 
0.0469 
0.0265 
0.0137 
0.0065 
0.0029 
0.0012 
0.0005 
0.0002 
0.0001 
0.0000 



0.0018 
0.0116 
0.0364 
0.0765 
0.1205 
0.1519 
1595 
1435 
1130 
0.0791 
0.0498 
0.0285 
0.0150 
0.0073 
0.0033 
0.0014 
0.0005 
0.0002 
0.0001 
0.0000 



0.0017 
0.0106 
0.0340 
0.0726 
0.1 162 
0.1487 
0.1586 
0.1450 
0.1 160 
0.0825 
0.0528 
0.0307 
0.0164 
0.0081 
0.0037 
0.0016 
0.0006 
0.0002 
0.0001 
0.0000 



0.0015 
0.0098 
0.0318 
0.0688 
0.1118 
0.1454 
0.1575 
0.1462 
0.1188 
0.0858 
0.0558 
0.0330 
0.0179 
0.0089 
0.0041 
0.0018 
0.0007 
0.0003 
0.0001 
0.0000 



0.0014 
0.0090 
0.0296 
0.0652 
0.1076 
0.1420 
0.1562 
0.1472 
0.1215 
0.0891 
0.0588 
0.0353 
0.0194 
0.0099 
0.0046 
0.0020 
0.0008 
0.0003 
0.0001 
0.0000 



0.0012 
0.0082 
0.0276 
0.0617 
0.1034 
0.1385 
0.1546 
0.1480 
0.1240 
0.0923 
0.0618 
0.0377 
0.0210 
0.0108 
0.0052 
0.0023 
0.0010 
0.0004 
0.0001 
0.0001 
0.0000 



0.0011 
0.0076 
0.0258 
0.0584 
0.0992 
0.1349 
0.1529 
0.1486 
0.1263 
0.0954 
0.0649 
0.0401 
0.0227 
0.0119 
0.0058 
0.0026 
0.0011 
0.0004 
0.0002 
0.0001 
0.0000 



0.0010 
0.0070 
0.0240 
0.0552 
0.0952 
0.1314 
0.151 1 
0.1489 
0.1284 
0.0985 
0.0679 
0.0426 
0.0245 
0.0130 
0.0064 
0.0029 
0.0013 
0.0005 
0.0002 
0.0001 
0.0000 



0.0009 
0.0064 
0.0223 
0.0521 
0.0912 
0.1277 
0.1490 
0.1490 
0.1304 
0.1014 
0.0710 
0.0452 
0.0263 
0.0142 
0.0071 
0.0033 
0.0014 
0.0006 
0.0002 
0.0001 
0.0000 



\: 7.1 



7.2 



7.3 



7.4 



7.5 



7.6 



7.7 



7.8 



7.9 



8.0 



k:0 0 



8 0 

9 0 

10 0 

11 0 

12 0 

13 0 

14 0 

15 0 

16 0 

17 0 

18 0 

19 0 

20 0 
21 
22 



0008 
0059 
0208 
0492 
0874 
1241 
1468 
1489 
1321 
1042 
0740 
0478 
0283 
0154 
0078 
0037 
0016 
0007 
0003 
0001 
0000 



0.0007 
0.0054 
0.0194 
0.0464 
0.0836 
0.1204 
0.1445 
0.1486 
0.1337 
0.1070 
0.0770 
0.0504 
0.0303 
0.0168 
0.0086 
0.0041 
0.0019 
0.0008 
0.0003 
0.0001 
0.0000 



0.0007 
0.0049 
0.0180 
0.0438 
0.0799 
0.1167 
0.1420 
1481 
1351 
1096 
0.0800 
0.0531 
0.0323 
0.0181 
0.0095 
0.0046 
0.0021 
0.0009 
0.0004 
0.0001 
0.0001 
0.0000 



0.0006 
0.0045 
0.0167 
0.0413 
0.0764 
0.1 130 
0.1394 
0.1474 
0.1363 
0.1 121 
0.0829 
0.0558 
0.0344 
0.0196 
0.0104 
0.0051 
0.0024 
0.0010 
0.0004 
0.0002 
0.0001 
0.0000 



0.0006 
0.0041 
0.0156 
0.0389 
0.0729 
0.1094 
0.1367 
0.1465 
0.1373 
0.1144 
0.0858 
0.0585 
0.0366 
0.021 1 
0.0113 
0.0057 
0.0026 
0.0012 
0.0005 
0.0002 
0.0001 
0.0000 



0.0005 
0.0038 
0.0145 
0.0366 
0.0696 
0.1057 
0.1339 
0.1454 
0.1381 
0.1167 
0.0887 
0.0613 
0.0388 
0.0227 
0.0123 
0.0062 
0.0030 
0.0013 
0.0006 
0.0002 
0.0001 
0.0000 



0.0005 
0.0035 
0.0134 
0.0345 
0.0663 
0.1021 
0.1311 
0.1442 
0.1388 
0.1 187 
0.0914 
0.0640 
0.0411 
0.0243 
0.0134 
0.0069 
0.0033 
0.0015 
0.0006 
0.0003 
0.0001 
0.0000 



0.0004 
0.0032 
0.0125 
0.0324 
0.0632 
0.0986 
0.1282 
0.1428 
0.1392 
0.1207 
0.0941 
0.0667 
0.0434 
0.0260 
0.0145 
0.0075 
0.0037 
0.0017 
0.0007 
0.0003 
0.0001 
0.0000 



0.0004 
0.0029 
0.0116 
0.0305 
0.0602 
0.0951 
0.1252 
0.1413 
0.1395 
0.1224 
0.0967 
0.0695 
0.0457 
0.0278 
0.0157 
0.0083 
0.0041 
0.0019 
0.0008 
0.0003 
0.0001 
0.0001 
0.0000 



0.0003 
0.0027 
0.0107 
0.0286 
0.0573 
0.0916 
0.1221 
0.1396 
0.1396 
0.1241 
0.0993 
0.0722 
0.0481 
0.0296 
0.0169 
0.0090 
0.0045 
0.0021 
0.0009 
0.0004 
0.0002 
0.0001 
0.0000 
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TABLE A.3 



(continued) 



\: 8.1 



8.2 



8.3 



8.4 



8.5 



8.6 



8.7 



8.8 



8.9 



9.0 



k:0 0 



8 0 

9 0 

10 0 

11 0 

12 0 

13 0 

14 0 

15 0 

16 0 

17 0 

18 0 

19 0 

20 0 

21 0 

22 0 
23 



0003 
0025 
0100 
0269 
0544 
0882 
1191 
1378 
1395 
1256 
1017 
0749 
0505 
0315 
0182 
0098 
0050 
0024 
001 1 
0005 
0002 
0001 
0000 



0.0003 
0.0023 
0.0092 
0.0252 
0.0517 
0.0849 
0.1160 
0.1358 
0.1392 
0.1269 
0.1040 
0.0776 
0.0530 
0.0334 
0.0196 
0.0107 
0.0055 
0.0026 
0.0012 
0.0005 
0.0002 
0.0001 
0.0000 



0.0002 
0.0021 
0.0086 
0.0237 
0.0491 
0.0816 
0.1128 
0.1338 
0.1388 
0.1280 
0.1063 
0.0802 
0.0555 
0.0354 
0.0210 
0.0116 
0.0060 
0.0029 
0.0014 
0.0006 
0.0002 
0.0001 
0.0000 



0.0002 
0.0019 
0.0079 
0.0222 
0.0466 
0.0784 
0.1097 
0.1317 
0.1382 
0.1290 
0.1084 
0.0828 
0.0579 
0.0374 
0.0225 
0.0126 
0.0066 
0.0033 
0.0015 
0.0007 
0.0003 
0.0001 
0.0000 



0.0002 
0.0017 
0.0074 
0.0208 
0.0443 
0.0752 
0.1066 
0.1294 
0.1375 
0.1299 
0.1104 
0.0853 
0.0604 
0.0395 
0.0240 
0.0136 
0.0072 
0.0036 
0.0017 
0.0008 
0.0003 
0.0001 
0.0001 
0.0000 



0.0002 
0.0016 
0.0068 
0.0195 
0.0420 
0.0722 
0.1034 
0.1271 
0.1366 
0.1306 
0.1123 
0.0878 
0.0629 
0.0416 
0.0256 
0.0147 
0.0079 
0.0040 
0.0019 
0.0009 
0.0004 
0.0002 
0.0001 
0.0000 



0.0002 
0.0014 
0.0063 
0.0183 
0.0398 
0.0692 



003 
247 
356 
311 
140 
0.0902 
0.0654 
0.0438 
0.0272 
0.0158 
0.0086 
0.0044 
0.0021 
0.0010 
0.0004 
0.0002 
0.0001 
0.0000 



0.0002 
0.0013 
0.0058 
0.0171 
0.0377 
0.0663 
0.0972 
0.1222 
0.1344 
0.1315 
0.1 157 
0.0925 
0.0679 
0.0459 
0.0289 
0.0169 
0.0093 
0.0048 
0.0024 
0.0011 
0.0005 
0.0002 
0.0001 
0.0000 



0.0001 
0.0012 
0.0054 
0.0160 
0.0357 
0.0635 
0.0941 
0.1197 
0.1332 
0.1317 
0.1172 
0.0948 
0.0703 
0.0481 
0.0306 
0.0182 
0.0101 
0.0053 
0.0026 
0.0012 
0.0005 
0.0002 
0.0001 
0.0000 



0.0001 
0.0011 
0.0050 
0.0150 
0.0337 
0.0607 
0.0911 
0.1171 
0.1318 
0.1318 
0.1186 
0.0970 
0.0728 
0.0504 
0.0324 
0.0194 
0.0109 
0.0058 
0.0029 
0.0014 
0.0006 
0.0003 
0.0001 
0.0000 



\: 9.1 



9.2 



9.3 



9.4 



9.5 



9.6 



9.7 



9.8 



9.9 



10.0 



k:0 0.0001 
'.0010 
.0046 
.0140 
.0319 
.0581 



8 0 

9 0 

10 0 

11 0 

12 0 

13 0 

14 0 

15 0 

16 0 

17 0 

18 0 

19 0 

20 0 

21 0 

22 0 

23 0 
24 
25 



1145 
1302 
1317 
1198 
0991 
0752 
0526 
0342 
0208 
0118 
0063 
0032 
0015 
0007 
0003 
0001 
0000 



0.0001 
0.0009 
0.0043 
0.0131 
0.0302 
0.0555 
0.0851 
0.1118 
0.1286 
0.1315 
0.1210 
0.1012 
0.0776 
0.0549 
0.0361 
0.0221 
0.0127 
0.0069 
0.0035 
0.0017 
0.0008 
0.0003 
0.0001 
0.0001 
0.0000 



0.0001 
0.0009 
0.0040 
0.0123 
0.0285 
0.0530 
0.0822 
0.1091 
0.1269 
0.1311 
0.1219 
0.1031 
0.0799 
0.0572 
0.0380 
0.0235 
0.0137 
0.0075 
0.0039 
0.0019 
0.0009 
0.0004 
0.0002 
0.0001 
0.0000 



0.0001 
0.0008 
0.0037 
0.0115 
0.0269 
0.0506 
0.0793 
0.1064 
0.1251 
0.1306 
0.1228 
0.1049 
0.0822 
0.0594 
0.0399 
0.0250 
0.0147 
0.0081 
0.0042 
0.0021 
0.0010 
0.0004 
0.0002 
0.0001 
0.0000 



0.0001 
0.0007 
0.0034 
0.0107 
0.0254 
0.0483 
0.0764 
0.1037 
0.1232 
0.1300 
0.1235 
0.1067 
0.0844 
0.0617 
0.0419 
0.0265 
0.0157 
0.0088 
0.0046 
0.0023 
0.001 1 
0.0005 
0.0002 
0.0001 
0.0000 



0.0001 
0.0007 
0.0031 
0.0100 
0.0240 
0.0460 
0.0736 
0.1010 
0.1212 
0.1293 
0.1241 
0.1083 
0.0866 
0.0640 
0.0439 
0.0281 
0.0168 
0.0095 
0.0051 
0.0026 
0.0012 
0.0006 
0.0002 
0.0001 
0.0000 



0.0001 
0.0006 
0.0029 
0.0093 
0.0226 
0.0439 
0.0709 
0.0982 



1 191 
1284 
1245 
1098 



0. 
0. 
0. 

0. 
0. 

0.0662 
0.0459 
0.0297 
0.0180 
0.0103 
0.0055 
0.0028 
0.0014 
0.0006 
0.0003 
0.0001 
0.0000 



0.0001 
0.0005 
0.0027 
0.0087 
0.0213 
0.0418 
0.0682 
0.0955 
0.1170 
0.1274 
0.1249 
0.1112 
0.0908 
0.0685 
0.0479 
0.0313 
0.0192 
0.0111 
0.0060 
0.0031 
0.0015 
0.0007 
0.0003 
0.0001 
0.0001 
0.0000 



0.0001 
0.0005 
0.0025 
0.0081 
0.0201 
0.0398 
0.0656 
0.0928 
0.1148 
0.1263 
0.1250 
0.1125 
0.0928 
0.0707 
0.0500 
0.0330 
0.0204 
0.0119 
0.0065 
0.0034 
0.0017 
0.0008 
0.0004 
0.0002 
0.0001 
0.0000 



0.0000 
0.0005 
0.0023 
0.0076 
0.0189 
0.0378 
0.0631 
0.0901 
0.1126 
0.1251 
0.1251 
0.1137 
0.0948 
0.0729 
0.0521 
0.0347 
0.0217 
0.0128 
0.0071 
0.0037 
0.0019 
0.0009 
0.0004 
0.0002 
0.0001 
0.0000 



Appendix A: Statistical Tables 



817 



\: 0.1 


0.2 


0.3 


0.4 


0.5 


0.6 


0.7 


0.8 


0.9 


1.0 


k: 0 


0.9048 


0.8187 


0.7408 


0.6703 


0.6065 


0.5488 


0.4966 


0.4493 


0.4066 


0.3679 


1 


0.9953 


0.9825 


0.9631 


0.9384 


0.9098 


0.8781 


0.8442 


0.8088 


0.7725 


0.7358 


2 


0.9998 


0.9989 


0.9964 


0.9921 


0.9856 


0.9769 


0.9659 


0.9526 


0.9371 


0.9197 


3 


1.0000 


0.9999 


0.9997 


0.9992 


0.9982 


0.9966 


0.9942 


0.9909 


0.9865 


0.9810 


4 




1 .0000 


1 .0000 


0.9999 


0.9998 


0.9996 


0.9992 


0.9986 


0.9977 


0.9963 


5 








1.0000 


1 .0000 


1 .0000 


0.9999 


0.9998 


0.9997 


0.9994 


6 














1 .0000 


1.0000 


1 .0000 


0.9999 


7 




















1.0000 



TABLE A.4 



Poisson Distribution, 
Cumulative Probabilities 
for x = number of 
occurrences, prob(x s k) 



\: 1.1 



1.2 



1.3 



1.4 



1.5 



1.6 



1.7 



1.8 



1.9 



2.0 



k: 0 


0.3329 


0.3012 


0.2725 


0.2466 


0.2231 


0.2019 


0.1827 


0.1653 


0.1496 


0.1353 


1 


0.6990 


0.6626 


0.6268 


0.5918 


0.5578 


0.5249 


0.4932 


0.4628 


0.4337 


0.4060 


2 


0.9004 


0.8795 


0.8571 


0.8335 


0.8088 


0.7834 


0.7572 


0.7306 


0.7037 


0.6767 


3 


0.9743 


0.9662 


0.9569 


0.9463 


0.9344 


0.9212 


0.9068 


0.8913 


0.8747 


0.8571 


4 


0.9946 


0.9923 


0.9893 


0.9857 


0.9814 


0.9763 


0.9704 


0.9636 


0.9559 


0.9473 


5 


0.9990 


0.9985 


0.9978 


0.9968 


0.9955 


0.9940 


0.9920 


0.9896 


0.9868 


0.9834 


6 


0.9999 


0.9997 


0.9996 


0.9994 


0.9991 


0.9987 


0.9981 


0.9974 


0.9966 


0.9955 


7 


1.0000 


1.0000 


0.9999 


0.9999 


0.9998 


0.9997 


0.9996 


0.9994 


0.9992 


0.9989 


8 






1 .0000 


1.0000 


1.0000 


1 .0000 


0.9999 


0.9999 


0.9998 


0.9998 


9 














1 .0000 


1.0000 


1 .0000 


1.0000 


\: 2.1 


2.2 


2.3 


2.4 


2.5 


2.6 


2.7 


2.8 


2.9 


3.0 



k:0 0.1225 



0.3796 
0.6496 
0.8386 
0.9379 
0.9796 
0.9941 
0.9985 

8 0.9997 

9 0.9999 
10 
11 
12 



1 

2 
3 
4 
5 
6 
7 



1 .0000 



0.1108 
0.3546 
0.6227 
0.8194 
0.9275 
0.9751 
0.9925 
0.9980 
0.9995 
0.9999 
1.0000 



0.1003 
0.3309 
0.5960 
0.7993 
0.9162 
0.9700 
0.9906 
0.9974 
0.9994 
0.9999 
1.0000 



0.0907 
0.3084 
0.5697 
0.7787 
0.9041 
0.9643 
0.9884 
0.9967 
0.9991 
0.9998 
1.0000 



0.0821 
0.2873 
0.5438 
0.7576 
0.8912 
0.9580 
0.9858 
0.9958 
0.9989 
0.9997 
0.9999 
1.0000 



0.0743 
0.2674 
0.5184 
0.7360 
0.8774 
0.9510 
0.9828 
0.9947 
0.9985 
0.9996 
0.9999 
1.0000 



0.0672 
0.2487 
0.4936 
0.7141 
0.8629 
0.9433 
0.9794 
0.9934 
0.9981 
0.9995 
0.9999 
1.0000 



0.0608 
0.2311 
0.4695 
0.6919 
0.8477 
0.9349 
0.9756 
0.9919 
0.9976 
0.9993 
0.9998 
1.0000 



0.0550 
0.2146 
0.4460 
0.6696 
0.8318 
0.9258 
0.9713 
0.9901 
0.9969 
0.9991 
0.9998 
0.9999 
1.0000 



0.0498 
0.1991 
0.4232 
0.6472 
0.8153 
0.9161 
0.9665 
0.9881 
0.9962 
0.9989 
0.9997 
0.9999 
1.0000 



\: 3.1 



3.2 



3.3 



3.4 



3.5 



3.6 



3.7 



3.8 



3.9 



4.0 



k: 0 
1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 



0.0450 
0.1847 
0.4012 
0.6248 
0.7982 
0.9057 
0.9612 
0.9858 
0.9953 
0.9986 
0.9996 
0.9999 
1.0000 



0.0408 
0.1712 
0.3799 
0.6025 
0.7806 
0.8946 
0.9554 
0.9832 
0.9943 
0.9982 
0.9995 
0.9999 
1.0000 



0.0369 
0.1586 
0.3594 
0.5803 
0.7626 
0.8829 
0.9490 
0.9802 
0.9931 
0.9978 
0.9994 
0.9998 
1.0000 



0.0334 
0.1468 
0.3397 
0.5584 
0.7442 
0.8705 
0.9421 
0.9769 
0.9917 
0.9973 
0.9992 
0.9998 
0.9999 
1.0000 



0.0302 
0.1359 
0.3208 
0.5366 
0.7254 
0.8576 
0.9347 
0.9733 
0.9901 
0.9967 
0.9990 
0.9997 
0.9999 
1.0000 



0.0273 
0.1257 
0.3027 
0.5152 
0.7064 
0.8441 
0.9267 
0.9692 
0.9883 
0.9960 
0.9987 
0.9996 
0.9999 
1.0000 



0.0247 
0.1 162 
0.2854 
0.4942 
0.6872 
0.8301 
0.9182 
0.9648 
0.9863 
0.9952 
0.9984 
0.9995 
0.9999 
1.0000 



0.0224 
0.1074 
0.2689 
0.4735 
0.6678 
0.8156 
0.9091 
0.9599 
0.9840 
0.9942 
0.9981 
0.9994 
0.9998 
1.0000 



0.0202 
0.0992 
0.2531 
0.4532 
0.6484 
0.8006 
0.8995 
0.9546 
0.9815 
0.9931 
0.9977 
0.9993 
0.9998 
0.9999 
1.0000 



0.0183 
0.0916 
0.2381 
0.4335 
0.6288 
0.7851 
0.8893 
0.9489 
0.9786 
0.9919 
0.9972 
0.9991 
0.9997 
0.9999 
1.0000 



Source: Probabilities generated by Minitab, then compiled as shown here. 
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TABLE A.4 



(continued) 



\: 4.1 



4.2 



4.3 



4.4 



4.5 



4.6 



4.7 



4.8 



4.9 



5.0 



k:0 0 



8 0 

9 0 

10 0 

11 0 

12 0 

13 0 

14 1 
15 
16 



0166 
0845 
2238 
4142 
6093 
7693 
8786 
9427 
9755 
9905 
9966 
9989 
9997 
9999 
0000 



0.0150 
0.0780 
0.2102 
0.3954 
0.5898 
0.7531 
0.8675 
0.9361 
0.9721 
0.9889 
0.9959 
0.9986 
0.9996 
0.9999 
1.0000 



0.0136 
0.0719 
0.1974 
0.3772 
0.5704 
0.7367 
0.8558 
0.9290 
0.9683 
0.9871 
0.9952 
0.9983 
0.9995 
0.9998 
1.0000 



0.0123 
0.0663 
0.1851 
0.3594 
0.5512 
0.7199 
0.8436 
0.9214 
0.9642 
0.9851 
0.9943 
0.9980 
0.9993 
0.9998 
0.9999 
1.0000 



0.0111 
0.061 1 
0.1736 
0.3423 
0.5321 
0.7029 
0.831 1 
0.9134 
0.9597 
0.9829 
0.9933 
0.9976 
0.9992 
0.9997 
0.9999 
1.0000 



0.0101 
0.0563 
0.1626 
0.3257 
0.5132 
0.6858 
0.8180 
0.9049 
0.9549 
0.9805 
0.9922 
0.9971 
0.9990 
0.9997 
0.9999 
1.0000 



0.0091 
0.0518 
0.1523 
0.3097 
0.4946 
0.6684 
0.8046 
0.8960 
0.9497 
0.9778 
0.9910 
0.9966 
0.9988 
0.9996 
0.9999 
1.0000 



0.0082 
0.0477 
0.1425 
0.2942 
0.4763 
0.6510 
0.7908 
0.8867 
0.9442 
0.9749 
0.9896 
0.9960 
0.9986 
0.9995 
0.9999 
1.0000 



0.0074 
0.0439 
0.1333 
0.2793 
0.4582 
0.6335 
0.7767 
0.8769 
0.9382 
0.9717 
0.9880 
0.9953 
0.9983 
0.9994 
0.9998 
0.9999 
1.0000 



0.0067 
0.0404 
0.1247 
0.2650 
0.4405 
0.6160 
0.7622 
0.8666 
0.9319 
0.9682 
0.9863 
0.9945 
0.9980 
0.9993 
0.9998 
0.9999 
1.0000 



\: 5.1 



5.2 



5.3 



5.4 



5.5 



5.6 



5.7 



5.8 



5.9 



6.0 



k:0 0 



8 0 

9 0 

10 0 

11 0 

12 0 

13 0 

14 0 

15 0 

16 1 
17 
18 



0061 
0372 
1165 
2513 
4231 
5984 
7474 
8560 
9252 
9644 
9844 
9937 
9976 
9992 
9997 
9999 
0000 



0.0055 
0.0342 
0.1088 
0.2381 
0.4061 
0.5809 
0.7324 
0.8449 
0.9181 
0.9603 
0.9823 
0.9927 
0.9972 
0.9990 
0.9997 
0.9999 
1.0000 



0.0050 
0.0314 
0.1016 
0.2254 
0.3895 
0.5635 
0.7171 
0.8335 
0.9106 
0.9559 
0.9800 
0.9916 
0.9967 
0.9988 
0.9996 
0.9999 
1.0000 



0.0045 
0.0289 
0.0948 
0.2133 
0.3733 
0.5461 
0.7017 
0.8217 
0.9027 
0.9512 
0.9775 
0.9904 
0.9962 
0.9986 
0.9995 
0.9998 
0.9999 
1.0000 



0.0041 
0.0266 
0.0884 
0.2017 
0.3575 
0.5289 
0.6860 
0.8095 
0.8944 
0.9462 
0.9747 
0.9890 
0.9955 
0.9983 
0.9994 
0.9998 
0.9999 
1.0000 



0.0037 
0.0244 
0.0824 
0.1906 
0.3422 
0.51 19 
0.6703 
0.7970 
0.8857 
0.9409 
0.9718 
0.9875 
0.9949 
0.9980 
0.9993 
0.9998 
0.9999 
1.0000 



0.0033 
0.0224 
0.0768 
0.1800 
0.3272 
0.4950 
0.6544 
0.7841 
0.8766 
0.9352 
0.9686 
0.9859 
0.9941 
0.9977 
0.9991 
0.9997 
0.9999 
1.0000 



0.0030 
0.0206 
0.0715 
0.1700 
0.3127 
0.4783 
0.6384 
0.7710 
0.8672 
0.9292 
0.9651 
0.9841 
0.9932 
0.9973 
0.9990 
0.9996 
0.9999 
1.0000 



0.0027 
0.0189 
0.0666 
0.1604 
0.2987 
0.4619 
0.6224 
0.7576 
0.8574 
0.9228 
0.9614 
0.9821 
0.9922 
0.9969 
0.9988 
0.9996 
0.9999 
1.0000 



0.0025 
0.0174 
0.0620 
0.1512 
0.2851 
0.4457 
0.6063 
0.7440 
0.8472 
0.9161 
0.9574 
0.9799 
0.9912 
0.9964 
0.9986 
0.9995 
0.9998 
0.9999 
1.0000 
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\: 6.1 



6.2 



6.3 



6.4 



6.5 



6.6 



6.7 



6.8 



6.9 



7.0 



TABLE A.4 



(continued) 



k:0 0 



8 0 

9 0 

10 0 

11 0 

12 0 

13 0 

14 0 

15 0 

16 0 

17 0 

18 1 
19 



0022 
0159 
0577 
1425 
2719 
4298 
5902 
7301 
8367 
9090 
9531 
9776 
9900 
9958 
9984 
9994 
9998 
9999 
0000 



0.0020 
0.0146 
0.0536 
0.1342 
0.2592 
0.4141 
0.5742 
0.7160 
0.8259 
0.9016 
0.9486 
0.9750 
0.9887 
0.9952 
0.9981 
0.9993 
0.9997 
0.9999 
1.0000 



0.0018 
0.0134 
0.0498 
0.1264 
0.2469 
0.3988 
0.5582 
0.7017 
0.8148 
0.8939 
0.9437 
0.9723 
0.9873 
0.9945 
0.9978 
0.9992 
0.9997 
0.9999 
1.0000 



0.0017 
0.0123 
0.0463 
0.1 189 
0.2351 
0.3837 
0.5423 
0.6873 
0.8033 
0.8858 
0.9386 
0.9693 
0.9857 
0.9937 
0.9974 
0.9990 
0.9996 
0.9999 
1.0000 



0.0015 
0.0113 
0.0430 
0.1118 
0.2237 
0.3690 
0.5265 
0.6728 
0.7916 
0.8774 
0.9332 
0.9661 
0.9840 
0.9929 
0.9970 
0.9988 
0.9996 
0.9998 
0.9999 
1.0000 



0.0014 
0.0103 
0.0400 
0.1052 
0.2127 
0.3547 
0.5108 
0.6581 
0.7796 
0.8686 
0.9274 
0.9627 
0.9821 
0.9920 
0.9966 
0.9986 
0.9995 
0.9998 
0.9999 
1.0000 



0.0012 
0.0095 
0.0371 
0.0988 
0.2022 
0.3406 
0.4953 
0.6433 
0.7673 
0.8596 
0.9214 
0.9591 
0.9801 
0.9909 
0.9961 
0.9984 
0.9994 
0.9998 
0.9999 
1.0000 



0.0011 
0.0087 
0.0344 
0.0928 
0.1920 
0.3270 
0.4799 
0.6285 
0.7548 
0.8502 
0.9151 
0.9552 
0.9779 
0.9898 
0.9956 
0.9982 
0.9993 
0.9997 
0.9999 
1.0000 



0.0010 
0.0080 
0.0320 
0.0871 
0.1823 
0.3137 
0.4647 
0.6136 
0.7420 
0.8405 
0.9084 
0.9510 
0.9755 
0.9885 
0.9950 
0.9979 
0.9992 
0.9997 
0.9999 
1.0000 



0.0009 
0.0073 
0.0296 
0.0818 
0.1730 
0.3007 
0.4497 
0.5987 
0.7291 
0.8305 
0.9015 
0.9467 
0.9730 
0.9872 
0.9943 
0.9976 
0.9990 
0.9996 
0.9999 
1.0000 



\: 7.1 



7.2 



7.3 



7.4 



7.5 



7.6 



7.7 



7.8 



7.9 



8.0 



k:0 0 



8 0 

9 0 

10 0 

11 0 

12 0 

13 0 

14 0 

15 0 

16 0 

17 0 

18 0 

19 0 

20 1 
21 



0008 
0067 
0275 
0767 
1641 
2881 
4349 
5838 
7160 
8202 
8942 
9420 
9703 
9857 
9935 
9972 
9989 
9996 
9998 
9999 
0000 



0.0007 
0.0061 
0.0255 
0.0719 
0.1555 
0.2759 
0.4204 
0.5689 
0.7027 
0.8096 
0.8867 
0.9371 
0.9673 
0.9841 
0.9927 
0.9969 
0.9987 
0.9995 
0.9998 
0.9999 
1.0000 



0.0007 
0.0056 
0.0236 
0.0674 
0.1473 
0.2640 
0.4060 
0.5541 
0.6892 
0.7988 
0.8788 
0.9319 
0.9642 
0.9824 
0.9918 
0.9964 
0.9985 
0.9994 
0.9998 
0.9999 
1.0000 



0.0006 
0.0051 
0.0219 
0.0632 
0.1395 
0.2526 
0.3920 
0.5393 
0.6757 
0.7877 
0.8707 
0.9265 
0.9609 
0.9805 
0.9908 
0.9959 
0.9983 
0.9993 
0.9997 
0.9999 
1.0000 



0.0006 
0.0047 
0.0203 
0.0591 
0.1321 
0.2414 
0.3782 
0.5246 
0.6620 
0.7764 
0.8622 
0.9208 
0.9573 
0.9784 
0.9897 
0.9954 
0.9980 
0.9992 
0.9997 
0.9999 
1.0000 



0.0005 
0.0043 
0.0188 
0.0554 
0.1249 
0.2307 
0.3646 
0.5100 
0.6482 
0.7649 
0.8535 
0.9148 
0.9536 
0.9762 
0.9886 
0.9948 
0.9978 
0.9991 
0.9996 
0.9999 
1.0000 



0.0005 
0.0039 
0.0174 
0.0518 
0.1 181 
0.2203 
0.3514 
0.4956 
0.6343 
0.7531 
0.8445 
0.9085 
0.9496 
0.9739 
0.9873 
0.9941 
0.9974 
0.9989 
0.9996 
0.9998 
0.9999 
1.0000 



0.0004 
0.0036 
0.0161 
0.0485 
0.1117 
0.2103 
0.3384 
0.4812 
0.6204 
0.7411 
0.8352 
0.9020 
0.9454 
0.9714 
0.9859 
0.9934 
0.9971 
0.9988 
0.9995 
0.9998 
0.9999 
1.0000 



0.0004 
0.0033 
0.0149 
0.0453 
0.1055 
0.2006 
0.3257 
0.4670 
0.6065 
0.7290 
0.8257 
0.8952 
0.9409 
0.9687 
0.9844 
0.9926 
0.9967 
0.9986 
0.9994 
0.9998 
0.9999 
1.0000 



0.0003 
0.0030 
0.0138 
0.0424 
0.0996 
0.1912 
0.3134 
0.4530 
0.5925 
0.7166 
0.8159 
0.8881 
0.9362 
0.9658 
0.9827 
0.9918 
0.9963 
0.9984 
0.9993 
0.9997 
0.9999 
1.0000 
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TABLE A.4 



(continued) 



\: 8.1 



8.2 



8.3 



8.4 



8.5 



8.6 



8.7 



8.8 



8.9 



9.0 



k:0 0 



8 0 

9 0 

10 0 

11 0 

12 0 

13 0 

14 0 

15 0 

16 0 

17 0 

18 0 

19 0 

20 0 

21 1 
22 
23 



0003 
0028 
0127 
0396 
0940 
1822 
3013 
4391 
5786 
7041 
8058 
8807 
9313 
9628 
9810 
9908 
9958 
9982 
9992 
9997 
9999 
0000 



0.0003 
0.0025 
0.01 18 
0.0370 
0.0887 
0.1736 
0.2896 
0.4254 
0.5647 
0.6915 
0.7955 
0.8731 
0.9261 
0.9595 
0.9791 
0.9898 
0.9953 
0.9979 
0.9991 
0.9997 
0.9999 
1.0000 



0.0002 
0.0023 
0.0109 
0.0346 
0.0837 
0.1653 
0.2781 
0.4119 
0.5507 
0.6788 
0.7850 
0.8652 
0.9207 
0.9561 
0.9771 
0.9887 
0.9947 
0.9977 
0.9990 
0.9996 
0.9998 
0.9999 
1.0000 



0.0002 
0.0021 
0.0100 
0.0323 
0.0789 
0.1573 
0.2670 
0.3987 
0.5369 
0.6659 
0.7743 
0.8571 
0.9150 
0.9524 
0.9749 
0.9875 
0.9941 
0.9973 
0.9989 
0.9995 
0.9998 
0.9999 
1.0000 



0.0002 
0.0019 
0.0093 
0.0301 
0.0744 
0.1496 
0.2562 
0.3856 
0.5231 
0.6530 
0.7634 
0.8487 
0.9091 
0.9486 
0.9726 
0.9862 
0.9934 
0.9970 
0.9987 
0.9995 
0.9998 
0.9999 
1.0000 



0.0002 
0.0018 
0.0086 
0.0281 
0.0701 
0.1422 
0.2457 
0.3728 
0.5094 
0.6400 
0.7522 
0.8400 
0.9029 
0.9445 
0.9701 
0.9848 
0.9926 
0.9966 
0.9985 
0.9994 
0.9998 
0.9999 
1.0000 



0.0002 
0.0016 
0.0079 
0.0262 
0.0660 
0.1352 
0.2355 
0.3602 
0.4958 
0.6269 
0.7409 
0.8311 
0.8965 
0.9403 
0.9675 
0.9832 
0.9918 
0.9962 
0.9983 
0.9993 
0.9997 
0.9999 
1.0000 



0.0002 
0.0015 
0.0073 
0.0244 
0.0621 
0.1284 
0.2256 
0.3478 
0.4823 
0.6137 
0.7294 
0.8220 
0.8898 
0.9358 
0.9647 
0.9816 
0.9909 
0.9957 
0.9981 
0.9992 
0.9997 
0.9999 
1.0000 



0.0001 
0.0014 
0.0068 
0.0228 
0.0584 
0.1219 
0.2160 
0.3357 
0.4689 
0.6006 
0.7178 
0.8126 
0.8829 
0.931 1 
0.9617 
0.9798 
0.9899 
0.9952 
0.9978 
0.9991 
0.9996 
0.9998 
0.9999 
1.0000 



0.0001 
0.0012 
0.0062 
0.0212 
0.0550 
0.1157 
0.2068 
0.3239 
0.4557 
0.5874 
0.7060 
0.8030 
0.8758 
0.9261 
0.9585 
0.9780 
0.9889 
0.9947 
0.9976 
0.9989 
0.9996 
0.9998 
0.9999 
1.0000 



X: 9.1 



9.2 



9.3 



9.4 



9.5 



9.6 



9.7 



9.8 



9.9 



10.0 



k:0 0 



8 0 

9 0 

10 0 

11 0 

12 0 

13 0 

14 0 

15 0 

16 0 

17 0 

18 0 

19 0 

20 0 

21 0 

22 0 

23 1 
24 



0001 
001 1 
0058 
0198 
0517 
1098 
1978 
3123 
4426 
5742 
6941 
7932 
8684 
9210 
9552 
9760 
9878 
9941 
9973 
9988 
9995 
9998 
9999 
0000 



0.0001 
0.0010 
0.0053 
0.0184 
0.0486 
0.1041 
0.1892 
0.3010 
0.4296 
0.5611 
0.6820 
0.7832 
0.8607 
0.9156 
0.9517 
0.9738 
0.9865 
0.9934 
0.9969 
0.9986 
0.9994 
0.9998 
0.9999 
1.0000 



0.0001 
0.0009 
0.0049 
0.0172 
0.0456 
0.0986 
0.1808 
0.2900 
0.4168 
0.5479 
0.6699 
0.7730 
0.8529 
0.9100 
0.9480 
0.9715 
0.9852 
0.9927 
0.9966 
0.9985 
0.9993 
0.9997 
0.9999 
1.0000 



0.0001 
0.0009 
0.0045 
0.0160 
0.0429 
0.0935 
0.1727 
0.2792 
0.4042 
0.5349 
0.6576 
0.7626 
0.8448 
0.9042 
0.9441 
0.9691 
0.9838 
0.9919 
0.9962 
0.9983 
0.9992 
0.9997 
0.9999 
1.0000 



0.0001 
0.0008 
0.0042 
0.0149 
0.0403 
0.0885 
0.1649 
0.2687 
0.3918 
0.5218 
0.6453 
0.7520 
0.8364 
0.8981 
0.9400 
0.9665 
0.9823 
0.991 1 
0.9957 
0.9980 
0.9991 
0.9996 
0.9999 
0.9999 
1.0000 



0.0001 
0.0007 
0.0038 
0.0138 
0.0378 
0.0838 
0.1574 
0.2584 
0.3796 
0.5089 
0.6329 
0.7412 
0.8279 
0.8919 
0.9357 
0.9638 
0.9806 
0.9902 
0.9952 
0.9978 
0.9990 
0.9996 
0.9998 
0.9999 
1.0000 



0.0001 
0.0007 
0.0035 
0.0129 
0.0355 
0.0793 
0.1502 
0.2485 
0.3676 
0.4960 
0.6205 
0.7303 
0.8191 
0.8853 
0.9312 
0.9609 
0.9789 
0.9892 
0.9947 
0.9975 
0.9989 
0.9995 
0.9998 
0.9999 
1.0000 



0.0001 
0.0006 
0.0033 
0.0120 
0.0333 
0.0750 
0.1433 
0.2388 
0.3558 
0.4832 
0.6080 
0.7193 
0.8101 
0.8786 
0.9265 
0.9579 
0.9770 
0.9881 
0.9941 
0.9972 
0.9987 
0.9995 
0.9998 
0.9999 
1.0000 



0.0001 
0.0005 
0.0030 
0.0111 
0.0312 
0.0710 
0.1366 
0.2294 
0.3442 
0.4705 
0.5955 
0.7081 
0.8009 
0.8716 
0.9216 
0.9546 
0.9751 
0.9870 
0.9935 
0.9969 
0.9986 
0.9994 
0.9997 
0.9999 
1.0000 



0.0000 
0.0005 
0.0028 
0.0103 
0.0293 
0.0671 
0.1301 
0.2202 
0.3328 
0.4579 
0.5830 
0.6968 
0.7916 
0.8645 
0.9165 
0.9513 
0.9730 
0.9857 
0.9928 
0.9965 
0.9984 
0.9993 
0.9997 
0.9999 
1.0000 
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TABLE A. 5 


1-10 


11-20 


21-30 


31-40 


41-50 



! 1 ! 1 | 1 | 1 | 1 Table of Random Digits 

Row 



1 


37220 


84861 


59998 


7731 -| 


87754 


04899 


75699 


22423 


01 200 


79606 


2 


31618 


06840 


451 67 


1 3747 


74382 


39250 


37038 


4281 0 


81 1 83 


27226 


3 


53778 


71813 


50306 


4741 5 


21 682 


74984 


69038 


80269 


1 8955 


47908 


4 


82652 


581 62 


90352 


10751 


49968 


0661 1 


23573 


29523 


78542 


49685 


5 


40372 


531 09 


76995 


24681 


4271 1 


06899 


14208 


63897 


60834 


46618 


6 


24952 


3071 1 


22535 


51 397 


37277 


04343 


84920 


1 1 970 


28098 


41 367 


7 


94953 


96367 


87143 


71 467 


31 194 


39897 


56605 


21810 


72064 


1 1 296 


8 


98972 


1 2203 


90759 


561 57 


40644 


1 6283 


8081 6 


07379 


871 71 


8901 6 


g 


61 759 


32900 


05299 


56687 


92355 


1 1684 


23278 


82492 


00366 


81 502 


10 


14338 


44401 


22630 


06918 


861 57 


02229 


68995 


88455 


3521 0 


84775 


11 


03990 


64484 


50620 


04134 


04132 


35323 


39948 


53446 


1 31 50 


01 522 


12 


04003 


21 357 


21 560 


26820 


97543 


1 5607 


75220 


27896 


29722 


45389 


13 


75942 


36578 


85584 


55496 


88476 


01255 


75094 


26800 


78580 


86252 


14 


29945 


1 7580 


17770 


56883 


90165 


54800 


47262 


14471 


66496 


66618 


15 


05675 


07093 


47456 


58942 


1 3685 


58422 


50987 


46420 


60965 


07193 


16 


38770 


60956 


63383 


54619 


28131 


43657 


25686 


30869 


44495 


68087 


17 


93578 


84503 


8491 1 


86482 


45267 


23662 


591 18 


35926 


1 1854 


1 2602 


18 


54526 


35249 


68501 


02206 


77793 


34752 


67528 


51 706 


94351 


65383 


19 


6601 9 


99264 


8341 0 


1 6479 


55742 


75660 


24750 


1 6698 


1 5630 


33348 


20 


30995 


81658 


22563 


34561 


29004 


64725 


801 37 


97704 


63540 


29097 


21 


54873 


00592 


1 6042 


46776 


31 730 


87258 


1 5861 


031 1 5 


95167 


1 7410 


22 


69350 


23570 


89681 


00936 


22521 


8891 1 


10908 


04029 


21081 


42224 


23 


43775 


21 148 


77549 


86081 


02143 


29803 


17758 


81 909 


09509 


64255 


24 


94238 


08655 


67935 


26234 


1 701 1 


44426 


1 8663 


93938 


1 2767 


38048 


25 


90329 


65940 


30771 


99446 


08749 


86843 


72979 


03701 


72777 


1 7442 


26 


49606 


51896 


1 3569 


581 72 


37655 


22835 


26546 


08220 


22221 


62889 


27 


85876 


62869 


92898 


06662 


89382 


20703 


70312 


34032 


40788 


07235 


28 


99021 


1 7433 


31031 


92399 


28930 


46651 


81288 


32707 


50799 


59045 


29 


43672 


461 99 


83184 


90585 


32998 


82557 


82994 


70297 


7991 0 


86498 


30 


7791 5 


14650 


48304 


1 0282 


1 6356 


50667 


23469 


12116 


42141 


20876 


31 


6281 5 


20732 


36276 


1 941 6 


09552 


52494 


85690 


61 579 


99879 


80023 


32 


35431 


28351 


1 0559 


52307 


49924 


97564 


68685 


49850 


62185 


36563 


33 


2621 2 


06143 


84840 


76247 


08208 


681 91 


7351 1 


51271 


28635 


33390 


34 


03772 


70603 


28430 


72720 


70275 


87396 


54825 


60139 


48572 


90762 


35 


22831 


69776 


07506 


32013 


51906 


49757 


41867 


1 5688 


53729 


44526 


36 


91 527 


1 581 0 


88468 


47664 


28231 


751 04 


24370 


63076 


53040 


01 302 


37 


98889 


36471 


23606 


86428 


36786 


79805 


1 7891 


92903 


52891 


03626 


38 


80596 


42027 


72309 


71805 


39628 


59368 


15410 


41 685 


05732 


87997 


39 


06160 


85702 


5061 7 


52953 


85886 


9901 3 


52135 


35314 


1 3276 


64765 


40 


07145 


80861 


37672 


03747 


76227 


52880 


65131 


92937 


32659 


44959 


41 


25756 


9461 6 


7561 1 


70478 


82600 


89252 


99473 


75367 


36786 


08165 


42 


31698 


84269 


82241 


05714 


68314 


46971 


27440 


12794 


51833 


88879 


43 


65292 


54687 


65808 


38799 


44635 


83610 


83257 


79134 


29222 


1 1974 


44 


55947 


27034 


40437 


52112 


94440 


36443 


89401 


58176 


74072 


57605 


45 


21916 


48584 


21519 


80149 


38375 


27493 


94284 


94171 


65668 


45875 


46 


38532 


95321 


76039 


97791 
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33428 


37024 


73345 
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48764 


47 


33462 


61546 
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6801 1 


47564 


77265 


91710 


89205 


04352 


48 


13526 


46548 


09115 


99063 


45898 
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70038 


88042 


85648 


47177 


49 


34828 


14870 


70575 


24312 


79631 


75697 


72404 


31103 


00016 


42082 


50 


89895 


82390 


89421 


56029 


61854 


05548 


21777 


51553 


17064 


33859 



Source: Random digits generated by Minitab. 
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Appendix A: Statistical Tables 



TABLE A.5 












1-10 


11-20 


21-30 


31-40 


41-50 



(continued) 

Row 



51 


96018 


27032 


35948 


81017 


87632 


68779 


98468 


82882 


58835 


61583 


52 


08657 


49477 


20792 


55367 


27957 


21669 


25807 


01687 


06853 


87214 


53 


05183 


80239 


20082 


32806 


84896 


36574 


99899 


02874 


58987 


07423 


54 


30558 


18243 


96761 


22184 


05203 


62794 


49547 


62406 


82189 


38127 


55 


85574 


37307 


36048 


72513 


49070 


82728 


86719 


96008 


85102 


66717 


56 


30554 


02449 


74975 


54343 


32643 


06124 


18521 


63560 


51 187 


23974 


57 


08408 


1591 1 


13377 


74630 


53089 


1 1912 


93262 


10961 


33576 


60466 


58 


62944 


23357 


49036 


41265 


88109 


02667 


75931 


89523 


53776 


14953 


59 


93160 


32676 


91 167 


63054 


46459 


76406 


02739 


44198 


16693 


85504 


60 


99581 


93196 


04941 


28933 


30480 


81685 


14525 


34978 


68211 


18591 


61 


71317 


85120 


37427 


57745 


78399 


49835 


67713 


10736 


70576 


81235 


62 


88100 


72776 


19450 


63109 


23254 


45376 


89233 


40445 


09325 


73602 


63 


70782 


68998 


78042 


28590 


94522 


49559 


77357 


16808 


45458 


35337 


64 
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34011 


47582 
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32166 
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72272 


26486 


65 


96757 
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55382 
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19294 


66 
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67 
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16636 


6121 1 


15809 


48807 


68 
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17354 
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24965 
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83896 


39712 
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95621 


81415 


69 


43638 


5481 1 


63054 


08267 


02125 


17892 


50132 


01293 


90524 


30664 


70 


67436 


14892 


65654 


88989 


63574 


39532 


26906 


72282 


03567 


54279 


71 


08336 
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04056 


15208 


42517 


97140 
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55647 


76220 


70525 


72 


72678 
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64702 


25685 
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25217 


65463 


65810 


73 


13063 


46035 


35899 


22682 


86499 


28475 


89917 


1 1856 


11202 


45920 


74 


70022 


46043 


37033 


04781 


53014 


68215 


46567 


83996 


51586 


00305 


75 


72803 


61399 


52929 


75693 


38989 


14261 


39257 


381 12 


24227 


15175 


76 


1 1566 


26680 


54965 


42491 


37382 


1 1800 


49989 


96604 


71890 


62676 


77 


63085 


07137 


63898 


54353 


44912 


00010 


04212 


51 192 


86710 


38272 


78 


43029 


84646 


21389 


71859 


85939 


95973 


79164 


23010 


21267 


39741 


79 


25172 


64085 


98417 


45034 


55105 


15375 


50465 


75316 


92495 


93023 


80 


22565 


88294 
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45035 
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46818 


73424 
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89830 


66504 


81 
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82 
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79124 
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85417 
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83 
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90514 
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96627 


21865 


84 
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24366 


96567 


21296 
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46323 


35922 
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82847 
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85 
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37743 
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32722 
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88391 
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14786 


86 


49847 
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00710 


87 
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62403 
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491 15 


91855 


85141 


88 
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90 
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18701 


39495 
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1 2488 
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46531 
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TABLE A.7 



The Chi-Square Distribution 




For a = Right-Tail Area of 



d.f. 


0.99 


0.975 


0.95 


0.90 


0.10 


0.05 


0.025 


0.01 


1 


0 


00016 


0.00098 


0.00039 


0.0158 


Z 


1 UD 


3.841 


5.024 


6.635 


2 


0 


0201 


0.0506 


0.103 


0.211 


A 

4 


DUj 


5.991 


7.378 


9.210 


3 


0 


115 


0.216 


0.352 


0.584 


6 


251 


7.815 


9.348 


11.345 


4 


0 


297 


0.484 


0.711 


1.064 


7 


779 


9.488 


1 1.143 


13.277 


5 


0 


554 


0.831 


1.145 


1.610 


9 


236 


1 1.070 


12.833 


15.086 


6 


0 


872 


1.237 


1.635 


2.204 


10 


645 


12.592 


14.449 


16.812 


7 


1 


239 


1.690 


2.167 


2.833 


12 


017 


14.067 


16.013 


18.475 


8 


1 


646 


2.180 


2.733 


3.490 


13 


362 


15.507 


17.535 


20.090 


9 


2 


088 


2.700 


3.325 


4.168 


14 


684 


16.919 


19.023 


21.666 


10 


2 


558 


3.247 


3.940 


4.865 


15 


987 


18.307 


20.483 


23.209 


11 


3 


053 


3.816 


4.575 


5.578 


17 


275 


19.675 


21.920 


24.725 


12 


3 


571 


4.404 


5.226 


6.304 


18 


549 


21.026 


23.337 


26.217 


13 


4 


107 


5.009 


5.892 


7.042 


19 


812 


22.362 


24.736 


27.688 


14 


4 


660 


5.629 


6.571 


7.790 


21 


064 


23.685 


26.1 19 


29.141 


15 


5 


229 


6.262 


7.261 


8.547 


22 


307 


24.996 


27.488 


30.578 


16 


5 


812 


6.908 


7.962 


9.312 


23 


542 


26.296 


28.845 


32.000 


17 


6.408 


7.564 


8.672 


10.085 


24 


769 


27.587 


30.191 


33.409 


18 


7 


015 


8.231 


9.390 


10.865 


25 


989 


28.869 


31.526 


34.805 


19 


7 


633 


8.907 


10.1 17 


11.651 


27 


204 


30.144 


32.852 


36.191 


20 


8 


260 


9.591 


10.851 


12.443 


28.412 


31.410 


34.170 


37.566 


21 


8 


897 


10.283 


11.591 


13.240 


29 


615 


32.671 


35.479 


38.932 


22 


9 


542 


10.982 


12.338 


14.042 


30 


813 


33.924 


36.781 


40.290 


23 


10 


916 


1 1.689 


13.091 


14.848 


32 


007 


35.172 


38.076 


41.638 


24 


10 


856 


12.401 


13.848 


15.659 


33 


196 


36.415 


39.364 


42.980 


25 


11 


524 


13.120 


14.611 


16.473 


34 


382 


37.652 


40.647 


44.314 


26 


12 


198 


13.844 


15.379 


17.292 


35 


563 


38.885 


41.923 


45.642 


27 


12 


879 


14.573 


16.151 


18.114 


36 


741 


40.113 


43.195 


46.963 


28 


13 


565 


15.308 


16.928 


18.939 


37 


916 


41.337 


44.461 


48.278 


29 


14 


256 


16.047 


17.708 


19.768 


39 


087 


42.557 


45.722 


49.588 


30 


14 


953 


16.791 


18.493 


20.599 


40 


256 


43.773 


46.979 


50.892 


40 


22 


164 


24.433 


26.509 


29.051 


51 


805 


55.759 


59.342 


63.691 


50 


29 


707 


32.357 


34.764 


37.689 


63 


167 


67.505 


71.420 


76.154 


60 


37 


485 


40.482 


43.188 


46.459 


74 


397 


79.082 


83.298 


88.381 


70 


45 


442 


48.758 


51.739 


55.329 


85 


527 


90.531 


95.023 


100.42 


80 


53 


540 


57.153 


60.391 


64.278 


96 


578 


101.88 


106.63 


1 12.33 


90 


61 


754 


65.647 


69.126 


73.291 


107 


57 


113.15 


1 18.14 


124.12 


100 


70 


065 


74.222 


77.930 


82.358 


118 


50 


124.34 


129.56 


135.81 



Source: Chi-square values generated by Minitab, then rounded as shown. 
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Two-Tail Test" 

One-Tail Test: 


(V — 


0.20 
0.10 


(V — 


0.10 
0.05 


OL = 


0.05 
0.025 


(V — 


0.02 
0.01 


(Y = 


0.01 
0.005 


n = 4 


1, 


9 




10 


0, 


10 


0, 


10 


0, 


10 


5 


3, 


12 


1 


14 


0, 


15 


0, 


15 


0, 


15 


6 


4, 


17 


3, 


18 


1, 


20 


0, 


21 


0, 


21 


7 


6, 


22 


4, 


24 


3, 


25 


1, 


27 


0, 


28 


8 


9, 


27 


6, 


30 


4, 


32 


2, 


34 


1, 


35 


9 


11, 


34 


9, 


36 


6, 


39 


4, 


41 


2, 


43 


10 


15, 


40 


11, 


44 


9, 


46 


6, 


49 


4, 


51 


11 


18, 


48 


14, 


52 


11, 


55 


8, 


58 


6, 


60 


12 


22, 


56 


18, 


60 


14, 


64 


10, 


68 


8, 


70 


13 


27, 


64 


22, 


69 


18, 


73 


13, 


78 


10, 


81 


14 


32, 


73 


26, 


79 


22, 


83 


16, 


89 


13, 


92 


15 


37 


83 


31, 


89 


26, 


94 


20 


1 00 


1 5 


1 04 


16 


43, 


93 


36, 100 


30, 106 


24, 


112 


20, 


116 


17 


49, 


104 


42, 


1 1 1 


35, 


118 


28, 


125 


24, 


129 


18 


56, 


115 


48, 123 


41, 


130 


33, 


138 


28, 


143 


19 


63, 


127 


54, 136 


47, 143 


38, 


152 


33, 


157 


20 


70, 


140 


61, 


149 


53, 


157 


44, 


166 


38, 


172 



TABLE A.8 



Wilcoxon Signed Rank Test, 
Lower and Upper Critical 
Values 



Source: Adapted from Roger C. Pfaffenberger and James H. Patterson, Statistical Methods for Business and 
Economics (Homewood, III.: Richard D. Irwin, Inc., 1987), p. 1 10, and R. L. McCornack, "Extended Tables of 
the Wilcoxon Matched Pairs Signed Rank Statistics, " Journal of the American Statistical Association 60 (1 965), 
864-871. 
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TABLE A.9 



Wilcoxon Rank Sum Test, 
Lower and Upper 
Critical Values 



a = 0.025 (one-tail) or a = 0.05 (two-tail) 



n,: 




3 


4 


5 


6 


7 


8 




9 


10 


n 2 :3 


5, 


16 


6, 18 


6, 21 


7, 23 


7, 26 


8, 28 


8, 


31 


9, 


33 


4 


6, 


18 


11, 25 


12, 28 


12, 32 


13, 35 


14, 38 


15, 


41 


16, 


44 


5 


6, 


21 


12, 28 


18, 37 


19, 41 


20, 45 


21, 49 


22, 


53 


24, 


56 


6 


7, 


23 


12, 32 


19, 41 


26, 52 


28, 56 


29, 61 


31, 


65 


32, 


70 


7 


7, 


26 


13, 35 


20, 45 


28, 56 


37, 68 


39, 73 


41, 


78 


43, 


83 


8 


8, 


28 


14, 38 


21, 49 


29, 61 


39, 73 


49, 87 


51, 


93 


54, 


98 


9 


8, 


31 


15, 41 


22, 53 


31, 65 


41, 78 


51, 93 


63, 


108 


66, 


114 


10 


9, 


33 


16, 44 


24, 56 


32, 70 


43, 83 


54, 98 


66, 


1 14 


79, 


131 



(Note: n 1 is the smaller of the two samples — i.e., n, s n 2 .) 



a = 0.05 (one-tail) or a = 0.10 (two-tail) 
n.,: 3 4 5 6 7 8 9 10 



n 2 :3 6, 15 7, 17 

4 7, 17 12, 24 

5 7, 20 13, 27 

6 8, 22 14, 30 

7 9, 24 15, 33 

8 9, 27 16, 36 

9 10, 29 17, 39 
10 11, 31 18, 42 



7, 20 


8, 


22 


13, 27 


14, 


30 


19, 36 


20, 40 


20, 40 


28, 


50 


22, 43 


30, 


54 


24, 46 


32, 


58 


25, 50 


33, 


63 


26, 54 


35, 


67 



9, 24 9, 27 

15, 33 16, 36 

22, 43 24, 46 

30, 54 32, 58 

39, 66 41, 71 

41, 71 52, 84 

43, 76 54, 90 

46, 80 57, 95 



10, 


29 


11, 


31 


17, 


39 


18, 


42 


25, 


50 


26, 


54 


33, 


63 


35, 


67 


43, 


76 


46, 


80 


54, 


90 


57, 


95 


66, 


105 


69, 


111 


69, 


111 


83, 


127 



(Note: n ] is the smaller of the two samples — i.e., s n 2 .) 



Source: F. Wilcoxon and R. A. Wilcox, Some Approximate Statistical Procedures (New York: American 
Cyanamid Company, 1964), pp. 20-23. 
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TABLE A. 10 



Significance Level a r ... tr . t 

. = Critical Values of D for the 

bampie Kolmogorov-Smirnov Test 

Sizen 0.20 0.15 0.10 0.05 0.01 of Normality 



4 


.300 


.319 


.352 


.381 


.41 7 


5 


.285 


.299 


.31 5 


.337 


.405 


6 


.265 


.277 


.294 


.319 


.364 


7 


.247 


.258 


.276 


.300 


.348 


8 


.233 


.244 


.261 


.285 


.331 


9 


.223 


.233 


.249 


.271 


.31 1 


10 


.21 5 


.224 


.239 


.258 


.294 


11 


.206 


.217 


.230 


.249 


.284 


12 


.199 


.212 


.223 


.242 


.275 




1 on 


.zuz 


91/1 
.Z 1 4- 


93/1 
.Z 34 


.Zoo 


14 


.183 


.194 


.207 


.227 


.261 


15 


.177 


.187 


.201 


.220 


.257 


16 


.173 


.182 


.195 


.213 


.250 


17 


.169 


.177 


.189 


.206 


.245 


18 


.166 


.173 


.184 


.200 


.239 


19 


.163 


.169 


.179 


.195 


.235 


20 


.160 


.166 


.174 


.190 


.231 


25 


.142 


.147 


.158 


.173 


.200 


30 


.131 


.136 


.144 


.161 


.187 




.736 


.768 


.805 


.886 


1.031 


r30 


Vn 


Vn 


Vn 


Vn 


Vn 



Source: From H. W. Lilliefors, "On the Kolmogorov-Smirnov Test for Normality with Mean and Variance 
Unknown," Journal of the American Statistical Association, 62 (1 967), pp. 399-402. As adapted by Conover, 
Practical Nonparametric Statistics (New York: John Wiley, 1971), p. 398. 
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TABLE A.11 



Critical Values of 
Spearman's Rank Correla- 
tion Coefficient, r s , One-Tail 
Test. (For a two-tail test, the 
listed values correspond to 
the 2a level of significance.) 



a = 0.05 



a = 0.025 



a = 0.01 



a = 0.005 



5 


0.900 








6 


0.829 


0.886 


0.943 




7 


0.714 


0.786 


0.893 




8 


0.643 


0.738 


0.833 


0.881 


9 


0.600 


0.683 


0.783 


0.833 


10 


0.564 


0.648 


0.745 


0.794 


11 


0.523 


0.623 


0.736 


0.818 


12 


0.497 


0.591 


0.703 


0.780 


13 


0.475 


0.566 


0.673 


0.745 


14 


0.457 


0.545 


0.646 


0.716 


15 


0.441 


0.525 


0.623 


0.689 


16 


0.425 


0.507 


0.601 


0.666 


17 


0.41 2 


0.490 


0.582 


0.645 


18 


0.399 


0.476 


0.564 


0.625 


19 


0.388 


0.462 


0.549 


0.608 


20 


0.377 


0.450 


0.534 


0.591 


21 


0.368 


0.438 


0.521 


0.576 


22 


0.359 


0.428 


0.508 


0.562 




U. D D I 


U.H 1 o 




U. J43 


24 


0.343 


0.409 


0.485 


0.537 


25 


0.336 


0.400 


0.475 


0.526 


26 


0.329 


0.392 


0.465 


0.515 


27 


0.323 


0.385 


0.456 


0.505 


28 


0.317 


0.377 


0.448 


0.496 


29 


0.31 1 


0.370 


0.440 


0.487 


30 


0.305 


0.364 


0.432 


0.478 



Source: E. G. Olds, "Distribution of Sums of Squares of Rank Differences for Small Samples, 
Annals of Mathematical Statistics 9 (1 938). 
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n = number of observations 

k = number of independent variables 





k = 


1 


k = 


2 


k = 


3 


k = 


4 


k = 


5 


n 


d L 


d u 


d L 


d u 


d L 


d u 


d L 


d u 


d i 


d U 


15 


1.08 


1.36 


0.95 


1.54 


0.82 


1.75 


0.69 


1.97 


0.56 


2.21 


16 


1.10 


1.37 


0.98 


1.54 


0.86 


1.73 


0.74 


1.93 


0.62 


2.15 


17 


1.13 


1.38 


1.02 


1.54 


0.90 


1.71 


0.78 


1.90 


0.67 


2.10 


18 


1.16 


1.39 


1.05 


1.53 


0.93 


1.69 


0.82 


1.87 


0.71 


2.06 


19 


1.18 


1.40 


1.08 


1.53 


0.97 


1.68 


0.86 


1.85 


0.75 


2.02 


20 


1.20 


1.41 


1.10 


1.54 


1.00 


1.68 


0.90 


1.83 


0.79 


1.99 


21 


1.22 


1.42 


1.13 


1.54 


1.03 


1.67 


0.93 


1.81 


0.83 


1.96 


22 


1.24 


1.43 


1.15 


1.54 


1.05 


1.66 


0.96 


1.80 


0.86 


1.94 


23 


1.26 


1.44 


1.17 


1.54 


1.08 


1.66 


0.99 


1.79 


0.90 


1.92 


24 


1.27 


1.45 


1.19 


1.55 


1.10 


1.66 


1.01 


1.78 


0.93 


1.90 


25 


1.29 


1.45 


1.21 


1.55 


1.12 


1.66 


1.04 


1.77 


0.95 


1.89 


26 


1.30 


1.46 


1.22 


1.55 


1.14 


1.65 


1.06 


1.76 


0.98 


1.88 


27 


1.32 


1.47 


1.24 


1.56 


1.16 


1.65 


1.08 


1.76 


1.01 


1.86 


28 


1.33 


1.48 


1.26 


1.56 


1.18 


1.65 


1.10 


1.75 


1.03 


1.85 


29 


1.34 


1.48 


1.27 


1.56 


1.20 


1.65 


1.12 


1.74 


1.05 


1.84 


30 


1.35 


1.49 


1.28 


1.57 


1.21 


1.65 


1.14 


1.74 


1.07 


1.83 


31 


1.36 


1.50 


1.30 


1.57 


1.23 


1.65 


1.16 


1.74 


1.09 


1.83 


32 


1.37 


1.50 


1.31 


1.57 


1.24 


1.65 


1.18 


1.73 


1.11 


1.82 


33 


1.38 


1.51 


1.32 


1.58 


1.26 


1.65 


1.19 


1.73 


1.13 


1.81 


34 


1.39 


1.51 


1.33 


1.58 


1.27 


1.65 


1.21 


1.73 


1.15 


1.81 


35 


1.40 


1.52 


1.34 


1.58 


1.28 


1.65 


1.22 


1.73 


1.16 


1.80 


36 


1.41 


1.52 


1.35 


1.59 


1.29 


1.65 


1.24 


1.73 


1.18 


1.80 


37 


1.42 


1.53 


1.36 


1.59 


1.31 


1.66 


1.25 


1.72 


1.19 


1.80 


38 


1.43 


1.54 


1.37 


1.59 


1.32 


1.66 


1.26 


1.72 


1.21 


1.79 


39 


1.43 


1.54 


1.38 


1.60 


1.33 


1.66 


1.27 


1.72 


1.22 


1.79 


40 


1.44 


1.54 


1.39 


1.60 


1.34 


1.66 


1.29 


1.72 


1.23 


1.79 


45 


1.48 


1.57 


1.43 


1.62 


1.38 


1.67 


1.34 


1.72 


1.29 


1.78 


50 


1.50 


1.59 


1.46 


1.63 


1.42 


1.67 


1.38 


1.72 


1.34 


1.77 


55 


1.53 


1.60 


1.49 


1.64 


1.45 


1.68 


1.41 


1.72 


1.38 


1.77 


60 


1.55 


1.62 


1.51 


1.65 


1.48 


1.69 


1.44 


1.73 


1.41 


1.77 


65 


1.57 


1.63 


1.54 


1.66 


1.50 


1.70 


1.47 


1.73 


1.44 


1.77 


70 


1.58 


1.64 


1.55 


1.67 


1.52 


1.70 


1.49 


1.74 


1.46 


1.77 


75 


1.60 


1.65 


1.57 


1.68 


1.54 


1.71 


1.51 


1.74 


1.49 


1.77 


80 


1.61 


1.66 


1.59 


1.69 


1.56 


1.72 


1.53 


1.74 


1.51 


1.77 


85 


1.62 


1.67 


1.60 


1.70 


1.57 


1.72 


1.55 


1.75 


1.52 


1.77 


90 


1.63 


1.68 


1.61 


1.70 


1.59 


1.73 


1.57 


1.75 


1.54 


1.78 


95 


1.64 


1.69 


1.62 


1.71 


1.60 


1.73 


1.58 


1.75 


1.56 


1.78 


100 


1.65 


1.69 


1.63 


1.72 


1.61 


1.74 


1.59 


1.76 


1.57 


1.78 



TABLE A.12 



Values of d L and d u for 
the Durbin-Watson Test 



for c 



0.05 



Source: From J. Durbin and G.S. Watson, "Testing for Serial Correlation in Least Squares Regression," 
Biometrika, 38 June, 1951. 
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(continued) 
Values of d L and d u for 
the Durbin-Watson Test 
for a = 0.025 



n = 
k = 


number of observations 
number of independent variables 














n 


k = 


1 


k = 


2 


k = 


3 


k = 


4 


k = 


5 




d u 




d u 


d L 


d u 


d L 


du 


d L 


d u 


15 


0.95 


1.23 


0.83 


1.40 


0.71 


1.61 


0.59 


1.84 


0.48 


2.09 


16 


0.98 


1.24 


0.86 


1.40 


0.75 


1.59 


0.64 


1.80 


0.53 


2.03 


17 


1.01 


1.25 


0.90 


1.40 


0.79 


1.58 


0.68 


1.77 


0.57 


1.98 


18 


1.03 


1.26 


0.93 


1.40 


0.82 


1.56 


0.72 


1.74 


0.62 


1.93 


19 


1.06 


1.28 


0.96 


1.41 


0.86 


1.55 


0.76 


1.72 


0.66 


1.90 


20 


1.08 


1.28 


0.99 


1.41 


0.89 


1.55 


0.79 


1.70 


0.70 


1.87 


21 


1.10 


1.30 


1.01 


1.41 


0.92 


1.54 


0.83 


1.69 


0.73 


1.84 


22 


1.12 


1.31 


1.04 


1.42 


0.95 


1.54 


0.86 


1.68 


0.77 


1.82 


23 


1.14 


1.32 


1.06 


1.42 


0.97 


1.54 


0.89 


1.67 


0.80 


1.80 


24 


1.16 


1.33 


1.08 


1.43 


1.00 


1.54 


0.91 


1.66 


0.83 


1.79 


25 


1.18 


1.34 


1.10 


1.43 


1.02 


1.54 


0.94 


1.65 


0.86 


1.77 


26 


1.19 


1.35 


1.12 


1.44 


1.04 


1.54 


0.96 


1.65 


0.88 


1.76 


27 


1.21 


1.36 


1.13 


1.44 


1.06 


1.54 


0.99 


1.64 


0.91 


1.75 


28 


1.22 


1.37 


1.15 


1.45 


1.08 


1.54 


1.01 


1.64 


0.93 


1.74 


29 


1.24 


1.38 


1.17 


1.45 


1.10 


1.54 


1.03 


1.63 


0.96 


1.73 


30 


1.25 


1.38 


1.18 


1.46 


1.12 


1.54 


1.05 


1.63 


0.98 


1.73 


31 


1.26 


1.39 


1.20 


1.47 


1.13 


1.55 


1.07 


1.63 


1.00 


1.72 


32 


1.27 


1.40 


1.21 


1.47 


1.15 


1.55 


1.08 


1.63 


1.02 


1.71 


33 


1.28 


1.41 


1.22 


1.48 


1.16 


1.55 


1.10 


1.63 


1.04 


1.71 


34 


1.29 


1.41 


1.24 


1.48 


1.17 


1.55 


1.12 


1.63 


1.06 


1.70 


35 


1.30 


1.42 


1.25 


1.48 


1.19 


1.55 


1.13 


1.63 


1.07 


1.70 


36 


1.31 


1.43 


1.26 


1.49 


1.20 


1.56 


1.15 


1.63 


1.09 


1.70 


37 


1.32 


1.43 


1.27 


1.49 


1.21 


1.56 


1.16 


1.62 


1.10 


1.70 


38 


1.33 


1.44 


1.28 


1.50 


1.23 


1.56 


1.17 


1.62 


1.12 


1.70 


39 


1.34 


1.44 


1.29 


1.50 


1.24 


1.56 


1.19 


1.63 


1.13 


1.69 


40 


1.35 


1.45 


1.30 


1.51 


1.25 


1.57 


1.20 


1.63 


1.15 


1.69 


45 


1.39 


1.48 


1.34 


1.53 


1.30 


1.58 


1.25 


1.63 


1.21 


1.69 


50 


1.42 


1.50 


1.38 


1.54 


1.34 


1.59 


1.30 


1.64 


1.26 


1.69 


55 


1.45 


1.52 


1.41 


1.56 


1.37 


1.60 


1.33 


1.64 


1.30 


1.69 


60 


1.47 


1.54 


1.44 


1.57 


1.40 


1.61 


1.37 


1.65 


1.33 


1.69 


65 


1.49 


1.55 


1.46 


1.59 


1.43 


1.62 


1.40 


1.66 


1.36 


1.69 


70 


1.51 


1.57 


1.48 


1.60 


1.45 


1.63 


1.42 


1.66 


1.39 


1.70 


75 


1.53 


1.58 


1.50 


1.61 


1.47 


1.64 


1.45 


1.67 


1.42 


1.70 


80 


1.54 


1.59 


1.52 


1.62 


1.49 


1.65 


1.47 


1.67 


1.44 


1.70 


85 


1.56 


1.60 


1.53 


1.63 


1.51 


1.65 


1.49 


1.68 


1.46 


1.71 


90 


1.57 


1.61 


1.55 


1.64 


1.53 


1.66 


1.50 


1.69 


1.48 


1.71 


95 


1.58 


1.62 


1.56 


1.65 


1.54 


1.67 


1.52 


1.69 


1.50 


1.71 


100 


1.59 


1.63 


1.57 


1.65 


1.55 


1.67 


1.53 


1.70 


1.51 


1.72 
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n = 
k = 


number of observations 
number of independent variables 














n 


k = 


1 


k = 


2 


k = 


3 


k = 


4 


k = 


5 




d u 




d u 


d L 


d u 


d L 


du 


d L 


d u 


15 


0.81 


1.07 


0.70 


1.25 


0.59 


1.46 


0.49 


1.70 


0.39 


1.96 


16 


0.84 


1.09 


0.74 


1.25 


0.63 


1.44 


0.53 


1.66 


0.44 


1.90 


17 


0.87 


1.10 


0.77 


1.25 


0.67 


1.43 


0.57 


1.63 


0.48 


1.85 


18 


0.90 


1.12 


0.80 


1.26 


0.71 


1.42 


0.61 


1.60 


0.52 


1.80 


19 


0.93 


1.13 


0.83 


1.26 


0.74 


1.41 


0.65 


1.58 


0.56 


1.77 


20 


0.95 


1.15 


0.86 


1.27 


0.77 


1.41 


0.68 


1.57 


0.60 


1.74 


21 


0.97 


1.16 


0.89 


1.27 


0.80 


1.41 


0.72 


1.55 


0.63 


1.71 


22 


1.00 


1.17 


0.91 


1.28 


0.83 


1.40 


0.75 


1.54 


0.66 


1.69 


23 


1.02 


1.19 


0.94 


1.29 


0.86 


1.40 


0.77 


1.53 


0.70 


1.67 


24 


1.04 


1.20 


0.96 


1.30 


0.88 


1.41 


0.80 


1.53 


0.72 


1.66 


25 


1.05 


1.21 


0.98 


1.30 


0.90 


1.41 


0.83 


1.52 


0.75 


1.65 


26 


1.07 


1.22 


1.00 


1.31 


0.93 


1.41 


0.85 


1.52 


0.78 


1.64 


27 


1.09 


1.23 


1.02 


1.32 


0.95 


1.41 


0.88 


1.51 


0.81 


1.63 


28 


1.10 


1.24 


1.04 


1.32 


0.97 


1.41 


0.90 


1.51 


0.83 


1.62 


29 


1.12 


1.25 


1.05 


1.33 


0.99 


1.42 


0.92 


1.51 


0.85 


1.61 


30 


1.13 


1.26 


1.07 


1.34 


1.01 


1.42 


0.94 


1.51 


0.88 


1.61 


31 


1.15 


1.27 


1.08 


1.34 


1.02 


1.42 


0.96 


1.51 


0.90 


1.60 


32 


1.16 


1.28 


1.10 


1.35 


1.04 


1.43 


0.98 


1.51 


0.92 


1.60 


33 


1.17 


1.29 


1.11 


1.36 


1.05 


1.43 


1.00 


1.51 


0.94 


1.59 


34 


1.18 


1.30 


1.13 


1.36 


1.07 


1.43 


1.01 


1.51 


0.95 


1.59 


35 


1.19 


1.31 


1.14 


1.37 


1.08 


1.44 


1.03 


1.51 


0.97 


1.59 


36 


1.21 


1.32 


1.15 


1.38 


1.10 


1.44 


1.04 


1.51 


0.99 


1.59 


37 


1.22 


1.32 


1.16 


1.38 


1.1 1 


1.45 


1.06 


1.51 


1.00 


1.59 


38 


1.23 


1.33 


1.18 


1.39 


1.12 


1.45 


1.07 


1.52 


1.02 


1.58 


39 


1.24 


1.34 


1.19 


1.39 


1.14 


1.45 


1.09 


1.52 


1.03 


1.58 


40 


1.25 


1.34 


1.20 


1.40 


1.15 


1.46 


1.10 


1.52 


1.05 


1.58 


45 


1.29 


1.38 


1.24 


1.42 


1.20 


1.48 


1.16 


1.53 


1.11 


1.58 


50 


1.32 


1.40 


1.28 


1.45 


1.24 


1.49 


1.20 


1.54 


1.16 


1.59 


55 


1.36 


1.43 


1.32 


1.47 


1.28 


1.51 


1.25 


1.55 


1.21 


1.59 


60 


1.38 


1.45 


1.35 


1.48 


1.32 


1.52 


1.28 


1.56 


1.25 


1.60 


65 


1.41 


1.47 


1.38 


1.50 


1.35 


1.53 


1.31 


1.57 


1.28 


1.61 


70 




1 AO, 


1 /in 

I 


I .Z)Z 


1.37 


1.55 


1.34 


1.58 


1.31 


1.61 


75 


1.45 


1.50 


1.42 


1.53 


1.39 


1.56 


1.37 


1.59 


1.34 


1.62 


80 


1.47 


1.52 


1.44 


1.54 


1.42 


1.57 


1.39 


1.60 


1.36 


1.62 


85 


1.48 


1.53 


1.46 


1.55 


1.43 


1.58 


1.41 


1.60 


1.39 


1.63 


90 


1.50 


1.54 


1.47 


1.56 


1.45 


1.59 


1.43 


1.61 


1.41 


1.64 


95 


1.51 


1.55 


1.49 


1.57 


1.47 


1.60 


1.45 


1.62 


1.42 


1.64 


100 


1.52 


1.56 


1.50 


1.58 


1.48 


1.60 


1.46 


1.63 


1.44 


1.65 



TABLE A.12 



(continued) 
Values of d L and d u for 
the Durbin-Watson Test 
for a = 0.01 
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TABLE A.13 



Factors for Determining Number of Factor for Determining 

3-Sigma Control Limits, Observations Control Limits, Control 

Mean and Range Control ,„ £ach Samp|e chart for the Mean _ 



2 


1.880 


0 


3.267 


3 


1.023 


0 


2.575 


4 


0.729 


0 


2.282 


5 


0.577 


0 


2.1 15 


6 


0.483 


0 


2.004 


7 


0.419 


0.076 


1.924 


8 


0.373 


0.136 


1.864 


9 


0.337 


0.184 


1.816 


10 


0.308 


0.223 


1.777 


11 


0.285 


0.256 


1.744 


12 


0.266 


0.284 


1.716 


13 


0.249 


0.308 


1.692 


14 


0.235 


0.329 


1.671 


15 


0.223 


0.348 


1.652 



Source: From E. S. Pearson, "The Percentage Limits for the Distribution of Range in Samples from a Normal 
Population," Biometrika 24 (1932): 416. 



Factors for Determining 
Control Limits, Control 
Chart for the Range 

~dT D, 



Answers to Selected 
Odd-Numbered Exercises 



Chapter 2 

2.3 a. 40.88 million b. lower limit is 35, upper limit is 
under 45 c. 10 years d. 40 years 2.5 a. 172.21 thousand 
b. lower limit is 45, upper limit is under 55 c. 10 years 
d. 50 years 2.45 a. y = 323.53 + 0.8112* b. yes, direct 
2.55 a. 1290 cities b. 2729 cities c. 242 cities; 8.65% 
d. 175,000 2.67 a. yes b. lawjudge = 0.381 + 0.90*acad 
2.69 a. yes b. Canada = 1.0894 + 1.754*U.S. 
2.71 c. highest: mechanical 2 and electrical 1; lowest: 
mechanical 1 and electrical 3 

Chapter 3 

3.1 x = $20.45, median = $20,495 3.3 x = 57.05 visitors, 
median = 57.50, mode = 63 3.5 x = 8.35, median = 8.60 

3.7 x =37.7, median = 35.0 3.9 83.2 3.11 a. mean 

b. median 3.13 x = 398.86, median = 396.75 3.15 females: 
x = 40.62, median = 39.00; males: x = 41.08, median = 41.50 
3.17 range = 39, MAD = 19.71 visitors, s = 12.40, s 2 = 153.84 
3.19 a. ^i = 29.11million,median = 22.4million,range = 44.5 
million, midrange = 39.45 million b. MAD = 12.99 million 

c. <t= 15.44 million, a 2 = 238.396 3.21 a. x=27.2mpg, 
median = 28 mpg, range = 30 mpg, midrange = 25 mpg 

b. MAD = 5.6 mpg c. s = 8.052, s 2 = 64.84 3.23 Q 1 = 7, 
Q 2 = 18, Q 3 = 30, interquartile range = 23, quartile deviation = 
11.5 3.25 a. x = 23.35, median = 22.86, range = 26.13, 
midrange = 26.465 b. MAD = 4.316 c. s = 5.49, s 2 = 30.14 
3.27 a. x = 90.771, median = 91.4, range = 40.4, midrange = 
92.0 b. MAD = 6.70 c. s = 8.36, s 2 = 69.90 3.29 a. 84% 
b. 88.89% c. 96% 3.31 90%, yes 3.33 a. 68% 
b. 2.5% c. 84% d. 13.5% 3.35 Barnsboro 
3.39 a. approximately 8.40 and 0.912 3.41 approximately 
21.75 and 15.65 3.43 r= -0.8 3.45 lawjudge = 0.3805 + 
0.9*acad,r 2 = 0.9454,r = 0.972 3.47generic= -3.5238 + 
0.377*brand,r 2 = 0.7447, r = 0.863 3.49 $4.69 3.51 x = 
117.75, median = 117.5, no 3.53 no, positively skewed 
3.55 a. x becomes 3.1 lbs., s remains 0.5 lbs. b. 4. libs. 
3.57 a. x = 2. 10,median = 2. 115, range = 0.46, midrange = 

2.08 b. MAD = 0.137 c. s = 0.156, s 2 = 0.02446 

3.59 120, 116, 124, 20, symmetrical 3.61 greater variation 
for data in exercise 3.60 



Chapter 4 

4.3 a. secondary b. secondary 4.11 response error 

4.13 telephone 4.39 systematic 4.43 a. sample b. sample 

c. sample d. census 4.69 a. 56 b. 144 

Chapter 5 

5.1 subjective 5.7 decrease for men, increase for women 
5.9 0.36 5.13 a. 0 b. 196 c. 147 d. 372 5.19 0.945, 
0.035 5.21 0.35,0.65 5.23 0.71 5.25 b. 0.13 c. 0.35 

d. 0.83 5.27 0.91 5.29 a. 0.947 b. 0.710 c. 0.351 

d. 0.992 5.31 no, no 5.33 a. 0.184 b. 0.123 c. 0.034 
d. 0.659 5.35 0.851 5.37 a. 0.019 b. 0.745 c. 0.236 
5.39 0.175 5.43 0.154 5.45 a. 0.51 b. 0.85 
c. 0.77 5.47 a. 0.5 b. 0.455 5.49 a. 0.1 b. 0.233 
5.51 256 5.53 36 5.57 20 5.59 7.9019 a '10 27 
5.61 a. 0.000000005 b. 0.0625 c. 0.062500005 d. 0 
5.63 a. 0.001 b. classical c. yes 5.65 a. 3 b. 3 c. 1 
5.67 a. 0.005 b. 0.855 c. 0.14 d. 0.995 5.69 a. 0.85 
b. 0.278 c. 0.046 5.71 a. 0.7 b. 0.9 c. 0.667 
5.73 a. 0.974 b. 4.147 a 10- 12 c. 10 5.75 22 
5.77 0.296, 0.963 5.79 a. 0.9 b. 0.429 5.81 120 
5.83 1320 5.85 10,000 

Chapter 6 

6.3 a. discrete b. continuous c. discrete d. continuous 
6.5 fj, = 7.2, (t = 2.18, o 2 = 4.76 6.7 fj, = 1.9, a = 1.14, a 2 = 
1.29 6.9 $10 6.11 E(x) = 3.1 6.13 $0.50 6.15 $37.9 
million, $41.9 million, $40.3 million, minor 6.19 a. 3.6 
b. 1.59 c. 0.2397 d. 0.9133 e. 0.5075 6.21 a. 0.0102 
b. 0.2304 c. 0.3456 d. 0.0778 6.23 a. 0.9703 
b. 0.0294 c. 0.0003 d. 0.0000 6.25 a. 0.9375 
b. 0.6875 6.27 0.1250, 0.1250, 0.3750 6.29 0.3439 
6.31 a. 0.00187 b. 0.0185 c. yes 6.35 a. 0.1000 
b. 0.6000 c. 0.3000 6.37 0.9000 6.39 a. 0.1353 

b. 0.2707 c. 0.8571 d. 0.5940 6.41 a. 5.3 b. 0.1239 

c. 0.1740 d. 0.9106 e. 0.8784 6.43 a. 1.9 b. 0.2842 
c. 0.0812 d. 0.9966 e. 0.5531 6.45 0.0047, should 
consider slight decrease 6.47 a. 10.0 b. 0.0901 

c. 0.1251 d. 0.7916 e. 0.7311 6.49 0.2275, $45,000 
6.51 not merely a coincidence 6.55 0.7174 6.57 no 
6.61 87.95% 6.63 0.3684 6.65 0.1837 6.67 0.6065 
6.69 0.1323 6.71 0.1904 6.73 0.0839 6.75 0.9872 
6.77 0.7748 6.79 0.0012, not believable 6.81 0.1667, 
0.2853, 0.0327 
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Chapter 7 

7.9 a. 0.5 b. approx 0.683 c. approx 0.8415 d. 0 
e. approx 0.4775 f. approx 0.9775 7.11 a. 0.5 b. approx 
0.955 c. approx 0.683 d. approx 0.9985 7.13 a. approx 
0.1585 b. approx 0.683 c. approx 0.0225 d. approx 
0.819 7.15 a. approx 0.0015 b. approx 0.1585 
7.17 a. -0.67,0.67 b. -1.28,1.28 c. -0.74,0.74 
7.19 a. -2.00 b. -0.80 c. 0.00 d. 3.40 e. 4.60 
7.21 a. 0.3643 b. 0.1357 c. 0.9115 7.23 a. 0.8730 

b. 0.2272 c. 0.1091 7.25 a. 0.52 b. 2.05 c. 0.10 
d. 0.52 7.27 a. 0.0874 b. 0.8790 c. 0.8413 

7.29 a. 0.4207 b. 0.3050 c. 0.3446 7.31 $343,350 
7.33 a. 0.4207 b. $9545 7.35 3.22 minutes 7.37 70,500 
7.41 a. 10.0,2.739 b. 0.1098,0.2823,0.3900,0.1003 
7.43 a. 12.0,2.19 b. 0.1797 c. 0.1820 d. 0.8729 
7.45 0.6198 7.47 0.6037 7.53 a. 0.5488 b. 0.4493 

c. 0.3679 d. 0.3012 7.55 0.2865 7.57 0.619,0.383, 
8779.7 hours 7.61 0.1151 7.63 0.3911 7.65 no 

7.67 0.0808 7.69 0.1970 7.71 0.8023 7.73 not credible 
7.75 0.6604 7.77 0.4584, 0.2101 7.79 0.4307 
7.81 0.0918,0.0446 7.83 20.615 7.85 a. 0.4628 
b. 0.7772 c. 12,900 7.87 0.4724 

Chapter 8 

8.3 0.18,0.15,1000 8.7 a. 25 b. 10 c. 5 d. 3.162 
8.9 a. 0.8413 b. 0.6853 c. 0.9938 8.11 0.8849, 0.8823 
8.13 concern is justified 8.15 0.1056 8.17 0.1949 
8.19 0.9881 8.21 tt= 0.265, a < 0.079 8.23 a. 0.9616 
b. 0.5222 c. 0.9616 8.25 a. 0.426 b. 0.35 c. 0.035 

d. 0.9850 8.27 0.9938 8.29 0.0000; district's claim is much 
more credible 8.35 0.1170 8.37 0.8023 8.43 a. 0.0183 

b. not credible 8.45 b. 0.038 8.47 a. 2.40 b. 0.0082 

c. no 8.51 a. 0.10 b. 0.0062 c. no 8.53 0.0643 
8.55 0.0000 8.57 0.0228 

Chapter 9 

9.7 a. 2.625 b. 4.554 9.11 a. 0.45 b. (0.419,0.481) 
c. 95%, 0.95 9.15 90%: (236.997, 243.003); 95%: 
(236.422, 243.578) 9.17 90%: (82.92, 87.08); 95%: (82.52, 
87.48) 9.19(149.006,150.994) 9.23(99.897,100.027); 
yes 9.27 1.313 9.29 a. 1.292 b. -1.988 c. 2.371 
9.31 95%: (46.338, 54.362); 99%: (44.865, 55.835) 
9.33 a. (22.16, 27.84) b. yes 9.35 (28.556, 29.707) 
9.37 (1520.96,1549.04) 9.39 (10.15, 13.55); no 
9.41 (21.92, 22.68); yes 9.43 (0.429,0.491) 9.45 (0.153, 
0.247); not credible 9.47 (0.37,0.43) 9.49 (0.5794, 
0.6246) 9.51 a. (0.527,0.613) 9.53 (0.925,0.975) 
9.55 (0.566, 0.634); no; may not succeed 9.57(0.311,0.489) 
9.61 92 9.63 1068 9.65 863 9.67 601 9.71 95%: 
(0.522, 0.578); 99%: (0.513, 0.587) 9.73 458 9.75 92 
9.77 462 9.79 1226 9.81 0.01 9.83 (135.211, 138.789) 
9.85 246 9.87 90%: (44.91, 49.96); 95%: (44.39, 50.47) 
9.89 411 9.91 $1200 9.93 95%: (0.347, 0.433); 99%: 
(0.334, 0.446) 9.95 4161 9.97 722 9.99 1068 
9.101 1469 9.103 3745 9.105 90%: (0.375, 0.425); 95%: 
(0.370,0.430) 9.107(0.018,0.062) 9.109 ($24.33, $25.67) 
9.111 (64.719, 68.301); funds are not endangered 

Chapter 10 

10.3 a. no b. yes c. yes d. no e. yes f. no 

10.5 type I 10.7 type II 10.13 type I 10.17 a. numerically 



high b. numerically low 10.23 no; do not reject H 0 
10.25 a. 0.0618 b. 0.1515 c. 0.0672 10.27 reject H 0 ; 
p-value = 0.021 10.29 do not reject H Q ; p-value = 0.035 
10.31 no; do not reject H 0 ; p-value = 0.052 10.33 p-value = 
0.035; do not reject H 0 10.35 a. do not reject H 0 b. reject 
H Q c. do not reject H Q d. reject H Q 10.37 (1.995,2.055); 
do not reject H a ; same 10.41 do not reject H 0 10.43 no; do 
not reject H 0 10.45 no; reject H 0 10.47 yes; reject H 0 
10.49 do not reject H 0 10.51 yes; reject H 0 10.53 (86.29, 
92.71); yes; yes 10.55 (35.657, 37.943); no; yes 10.57 0.03; 
reject H fl 10.61 reject H 0 10.63 reject H Q 10.65 no; reject 
H 0 10.67 yes; reject H Q ; p-value = 0.005 10.69 reject H 0 ; 
p-value = 0.023 10.71 yes; p-value = 0.118 10.73 no; do 
not reject H fl ; p-value = 0.079 10.75 (0.735, 0.805); yes; yes 
10.77 (0.402, 0.518); yes; yes 10.81 alpha unchanged, beta 
decreases 10.83 0.9871 10.87 a. 2.33 b. 0.036 
10.93 reject H Q , has increased 10.95 a. 0.05 level: reject H Q 

b. 95% CI: (179,278; 198,322) 10.97 yes 10.99 reject H 0 
10.101 reject H 0 ; statement not credible 10.103 do not reject 
H 0 ; claim is credible 10.105 a. 0.9505 b. 0.8212 

c. 0.5753 d. 0.2946 e. 0.1020 10.107 b. e.g., 0.005 
c. e.g., 0.02 d. 0.024 10.109 yes; reject H 0 ; p-value = 
0.007 10.111 no; do not reject H 0 ; p-value = 0.059 
10.113 do not reject H 0 ; p-value = 0.282 

Chapter 11 

11.3 do not reject H 0 , > 0.20 11.5 yes; reject H 0 11.7 do 
not reject H Q ; between 0.05 and 0.10; (-8.872, 0.472) 
11.9 reject H Q ; between 0.10 and 0.05; 90% CI: (-1.318, 
-0.082) 11.11 claim could be valid; reject H 0 ; between 0.01 
and 0.025 11.13 yes; p-value = 0.0000 11.15(0.639, 
3.961); no; yes 11.17 (-3.33, 31.20); yes; yes 11.19 do not 
reject H Q 11.21 Yes, do not reject H„; > 0.20; 95% CI: 
(-23.43, 6.43) 11.23 do not reject H Q ; 0.1400; (-117.18, 
17.18); yes; yes 11.25 do not reject H 0 ; (—2.03, —1.37); yes; 
yes 11.27 reject H 0 ; 0.000 11.29 do not reject H Q ; 0.095 
11.33 reject H Q 11.35 Yes, do not reject H 0 ; 0.2538; 95% 
CI: (-23.10, 6.10) 11.37 do not reject H 0 ; 0.1967; (-7.77, 
37.77) 11.39 reject H 0 ; 0.0122 11.41 reject H„; 0.03 
11.43 do not reject H Q ; 0.12 11.45 dependent 11.47 reject 
H 0 11.49 reject H 0 ; between 0.005 and 0.01 11.51 do not 
reject H 0 ; 0.170; (-4.663, 1.774) 11.53 reject H 0 11.55 no; 
do not reject H 0 11.57 do not reject H 0 11.59 yes; reject 
H 0 ; 0.0607; (-0.150, -0.010) 11.61 reject H 0 ; 0.0583 
11.63 do not reject H 0 ; 0.077; (-0.032, 0.172) 11.65 do not 
reject H 0 ; 0.1866; (-0.055, 0.005) 11.67 do not reject H 0 ; 
no, no 11.69 yes; do not reject H 0 ; no 11.71 yes; reject H 0 
11.73 reject H Q 11.75 suspicion confirmed; reject H Q ; 
between 0.025 and 0.05 11.77 do not reject H 0 ; (-505.32, 
65.32) 11.79 do not reject H 0 ;> 0.10 11.81 yes; reject H Q ; 
0.0141 11.83 reject H 0 ; table: < 0.01; computer: 0.0075; 
(0.13,6.67) 11.85 do not reject H 0 ; (-1.13, 0.13) 
11.87 yes; reject H 0 11.89 no; do not reject H 0 11.91 yes; 
reject H Q ; < 0.005 11.93 yes; reject H 0 ; 0.001 11.95 yes; do 
not reject H 0 ; no 11.97 do not reject H„; 0.140; (-1.95, 
12.96) 11.99 not supported; do not reject H„; 0.135 
11.101 do not reject H 0 ; 0.414; (-0.1204, 0.0404) 

Chapter 12 

12.7 designed 12.21 do not reject ff 0 ; > 0.05 12.23 do not 
reject H Q ; > 0.05 12.25 do not reject H 0 12.27 b. reject 
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H 0 12.29 b. do not reject H 0 c. ^: (48.914, 61.086); fi 2 : 
(42.714, 54.886); /i, 3 : (39.514, 51.686) 12.35 fi^. (16.271, 
19.009); fji 2 : (13.901, 16.899); (15.690, 18.060) 
12.47 reject H Q ; between 0.025 and 0.05 12.49 do not reject 
H 0 ; do not reject H 0 12.51 do not reject H 0 ; do not reject H 0 
12.53 reject H 0 12.55 reject H 0 12.59 yes; reject H 0 
12.69 Factor A, do not reject ff 0 ; Factor B, reject ff 0 ; 
Interaction, reject H 0 12.71 Factor A, do not reject H 0 ; Factor 
B, reject H 0 ; Interaction, do not reject H 0 12.73 Factor A, 
reject H 0 ; Factor B, reject H 0 ; Interaction, reject H 0 
12.75 Assembly, do not reject H 0 ; Music, reject H 0 ; Interaction, 
reject H Q ; Method 1, (35.846, 41.654); Method 2, (34.096, 
39.904); Classical, (29.096, 34.904); Rock, (40.846, 46.654) 
12.77 Bag, do not reject H 0 ; Dress, reject H 0 ; Interaction, reject 
H 0 ; Carry, (24.798, 30.536); Don't Carry, (27.298, 33.036); 
Sloppy, (41.736, 48.764); Casual, (17.736, 24.764); Dressy, 
(16.736, 23.764) 12.79 Keyboard, reject H Q ; Wordpack, do 
not reject H 0 ; Interaction, reject H 0 12.83 randomized block 
12.85 independent: faceplate design; dependent: time to 
complete task; designed 12.87 no, randomized block 
procedure should be used 12.89 reject H 0 12.91 do not 
reject H Q 12.93 reject H 0 12.95 do not reject H 0 
12.97 reject H 0 12.99 Style, do not reject H 0 ; Darkness, 
reject H Q ; Interaction, do not reject H Q ; Style 1, (25.693, 
30.307); Style 2, (23.693, 28.307); Light, (27.425, 33.075); 
Medium, (22.175, 27.825); Dark, (22.925, 28.575) 
12.101 Position, reject H 0 ; Display, do not reject H 0 ; 
Interaction, reject H 0 ; Position 1, (42.684, 47.982); Position 2, 
(47.129, 52.427); Display 1, (42.089, 48.577); Display 2, 
(43.923, 50.411); Display 3, (46.923, 53.411) 

Chapter 13 

13.7 a. 3.490 b. 13.362 c. 2.733 d. 15.507 e. 17.535 
f. 2.180 13.9 a. A = 8.547, B = 22.307 b. A = 7.261, B = 
24.996 c. A = 6.262, B = 27.488 d. A = 5.229, B = 
30.578 13.13 4 13.15 a. 6 b. 12.592 c. do not reject 
Hq 13.17 yes; reject H 0 13.19 do not reject H Q 
13.21 do not reject H 0 13.23 do not reject H 0 13.25 reject 
H 0 13.29 a. 6 b. 2 c. 12 d. 8 e. 12 f. 4 
13.31 a. 12.833 b. 15.507 c. 16.812 d. 7.779 
13.33 no, reject H 0 ; between 0.025 and 0.01 13.35 no; reject 
H 0 ; between 0.05 and 0.025 13.37 no; reject H Q ; less than 
0.01 13.39 yes; do not reject H Q 13.41 yes; reject H Q 
13.43 do not reject H 0 13.45 do not reject H 0 13.47 reject 
H 0 13.49 yes; do not reject H Q 13.53(15.096,43.011) 
13.55(2.620,9.664) 13.57(0.1346,0.2791) 13.59 do 
not reject H 0 13.61 do not reject H 0 ; between 0.05 and 
0.025 13.63 (0.0329,0.0775) 13.65 do not reject H 0 
13.67 reject H 0 13.69 yes; do not reject H 0 13.71 do not 
reject H Q ; between 0.10 and 0.05 13.73 no; do not reject ff 0 ; 
between 0.10 and 0.05 13.75 (0.02001,0.05694) 
13.77 reject H 0 13.79 do not reject H Q 13.81 reject H Q 
13.83 reject H 0 13.85 do not reject H 0 13.87 probably not; 
reject H Q 

Chapter 14 

14.7 reject H Q ; < 0.005 14.9 yes; reject H 0 ; 0.017 14.11 do 
not reject H Q ; 0.700 14.13 reject H Q 14.15 yes, reject H 0 
14.17 do not reject H 0 ; between 0.05 and 0.10 14.19 no; do 
not reject H 0 ; 0.137 14.23 do not reject H 0 14.25 yes; do 
not reject H 0 ; 0.2443 14.29 reject H 0 14.31 no; reject H 0 



14.33 no; reject H Q ; 0.058 14.37 yes; do not reject H 0 ; 
between 0.10 and 0.90 14.39 no; do not reject H Q ; 0.393 
14.43 do not reject H 0 14.45 reject H 0 14.49 do not reject 
H 0 14.51 do not reject H 0 14.53 do not reject H 0 
14.55 0.83, yes 14.57 0.343, no 14.59 reject H 0 ; 0.0287 
14.61 no, 0.736 14.63 do not reject H 0 14.65 yes; reject 
H 0 ; 0.01 14.67 no; do not reject H 0 ; between 0.05 and 0.10 
14.69 yes; do not reject H 0 ; between 0.025 and 0.05 
14.71 yes; do not reject H 0 ; between 0.025 and 0.05 
14.75 claim not credible, reject H 0 ; between 0.05 and 0.025 
14.77 0.868, yes 14.79 no; do not reject H 0 ; 0.343 
14.81 do not reject H 0 ; 0.210 14.83 they can tell the 
difference; reject H Q ; 0.016 14.85 do not reject H Q ; 0.486 

Chapter 15 

15.5 second 15.7 second 15.9 a. Shares = 44.3 + 
38.756' f Years b. 431.9 15.11 Totgross = 106.28 + 
1.474' f Twowks; 253.7 15.13 Acres = 349,550 - 7851 * Rain; 
208,223 acres 15.21 a. y = 21.701 - 1.354x b. 3.617 

c. (8.107,16.339) d. (4.647,14.383) e. interval d is wider 
15.23 a. Rating = 48.59 + 8.327*TD% b. 90.22 c. 1.765 

d. (85.228,95.214) e. (88.134,92.308) 15.25 91.48; (-33.9, 
510.1) 15.27 Revenue = -1.66' f 10 8 + 64,076,803 "Dealers; CI: 
(5.88"'10 9 , 6.60*10 9 ); PL (4.92*10 9 , 7.57*10 9 ) 15.29 CI: 
(89,209; 295,832); PL (-155,578; 540,619) 15.33 0.81 
15.37 a. y = 95.797 + 0.09788* b. 0.679,0.461 c. 106.56 
15.39 a. Millsocks = -8845 + 44. 94 * Population b. 0.904 

c. 3737.2millionpairs 15.41 r= -0.2791; r 2 = 0.0779 
15.43 Forgross = -69.382 + 1.5266' t Domgross; r = 0.7356; 
^ = 0.5411 15.45 no 15.47 > 0.10 15.49 a. reject H Q 
b. reject H Q c. (0.023, 0.172) 15.51 a. reject H 0 b. reject 
H 0 c. (0.392,9.608) 15.53 61.2,58.8 15.55 774.80, 
536.01,238.79; 0.692; do not reject H 0 15.57 Gallons = 
5,921,560.92 + 10.44806 ,f Hours; r = 0.993; r 2 = 0.986; 
no; (8.746, 12.150) 15.59 Netlncome = 59.8006 + 
0.0382' f Revenue; r = 0.7492; r 2 = 0.5612; no; (0.031, 0.045) 
15.71 Netlncome = 0.211 + 0.0999 :f TotRev; $2,009 billion 
15.73 a. Retired = 33.08 -2.893 s ' New b. 12.83 c. 7.04 
15.75 Netlncome = -0.1821 + 0.0551*Revenue; 
$0,369 million 15.77 a. Fuel = 6.089 + 77.412*Hours 

b. r = 0.995; r 2 = 0.99 c. 160.9 15.79 a. Fuel = 15.084 + 
0.0367*Miles b. 0.985, 0.971 c. 73.131 billion gallons 
15.81 a. Rear = 1855.9 - 0.3057"Front b. -0.104, 0.011 

c. $1550 15.83 a. Strength = 60.02 + 10.507' > Temptime 

b. 53.0% c. 0.017 d. 0.017 e. (2.445, 18.569) 

15.85 a. Rolresis = 9.450 - 0.08113' s psi b. 23.9% c. 0.029 

d. 0.029 e. (-0.15290, -0.00936) 15.87 CI: (8.26, 28.87); 
PI: (-6.66, 43.78) 15.89 CL (9.389, 10.076); PI: (8.844, 
10.620) 15.91 a. GPA = -0.6964 + 0.0033282' f SAT; 
2.965 b. 69.5% c. CI: (2.527, 3.402); PL (1.651, 4.278) 
15.93 a. Pay% = 7.0202 + 1.51597*Rate% b. 98.0%; yes 

c. CI: (18.9874, 19.3084); PI: (18.6159, 19.6799) 
15.95 a. Estp/e = 52.56 - 0.0959 :t Revgrow%; 38.17 

b. 0.2%; no c. CL (-0.74, 77.07); PL (-94.67, 171.01) 

Chapter 16 

16.9 a. 300, 7, 13 b. 399 16.11 a. y = 10.687 + 2.157xj 
+ 0.0416x 2 c. 24.59 16.13 a. y = -127.19 + 7.611^ + 
0.3567x 2 c. -17.79 16.15 b. 454.42 16.17 a. 130.0 
b. 3.195 c. (98.493, 101.507) d. (93.259, 106.741) 
16.19 a. CalcFin = -26.6 + 0.776*MathPro + 0.0820' f SATQ; 
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Appendix B: Answers to Selected Odd-Numbered Exercises 



90% CI: (64.01, 73.46) b. 90% PI: (59.59, 77.88) 
16.21 a. (79.587, 88.980) b. (75.562, 93.005) 
16.27 0.716 16.29 a. yes b. p v reject H 0 ; /3 2 , do not reject 
H 0 d. p v (0.54, 3.77); p 2 , (-0.07, 0.15) 16.31)35,(0.0679, 
0.4811); p 2 , (0.1928, 0.5596); j3 3 , (0.1761, 0.4768) 
16.33 a. yes; both are significant 16.35 a. y = -40,855,482 + 
44,281.6xj + 152,760.2x 2 b. yes c. jSj, reject H 0 ; p 2 , do 
not reject H 0 d. ^, (41,229.4, 47,333.8); /3 2 , (-4,446,472, 
4,751,992) 16.45 a. y = -0.5617 + 0.0003550*! + 
0.011248* 2 - 0.02116* 3 b. p v (-0.0011, 0.0018); 
P 2 , (-0.0083, 0.0308); P 3 , (-0.0847, 0.0424) c. 0.465 
16.51 Speed = 67.6 - 3. 21 "-Occupants - 6.63*Seatbelt 
16.53 a. y = 99.865 + 1.236*j + 0.822* 2 b. 125.8161bs. 
c. (124.194,127.438) d. (124.439,127.193) e. /3 X ,(0.92, 
l.55);P 2 , (0.09, 1.56) 16.55 a. gpa = -1.984+ 0.00372*sat + 
0.00658 s "rank b. 2.634 c. (1.594, 3.674) d. (2.365,2.904) 
e. p v (0.000345, 0.007093); /3 2 , (-0.010745, 0.023915) 
16.57 $38,699 



Chapter 17 

17.3 negative, negative 17.5 positive, negative, positive 

17.7 $Avgrate = 286.094 - 8.239*%Occup + 

0.07709* %Occup 2 ; 49.1% 17.9 0to60 = 26.8119 - 

0.153866 s "hp + 0.0003083 * hp 2 ; 8.396 seconds; yes 

17.11 Forgross = 860.8 - 4.152*Domgross + 

0.007689*Domgross 2 ; $430.4 million; yes 

17.13 second-order with interaction 17.15 $percall = 

61.2 + 25.63' f yrs + 6.41*score -1.82*yrs 2 - 0.058 s " score 2 + 

0.29 s yrs s score;R 2 = 0.949; yes 17.17 a. oprev = -231.2 + 

0.129 s ' employs + 0.00565 s ' departs; yes b. oprev = 

399.2 + 0.0745 s " employs + 0.00087 s " departs + 

0.00000014 s " employs* departs; R 2 increases from 0.958 

to 0.986 17.19 0to60 = 25.4 - 0.161*hp-0.00030*curbwt 

+ 0.000028 s " hp s curbwt;R 2 = 0.734; yes 17.23 two 

17.25 550 customers 17.27 price = -30.77 + 4.975 s " gb + 

54.20 s "highrpm; $54.20 17.29 productivity = 75.4 + 

1.59*yrsexp - 7.36' f metha + 9.73*methb; R 2 = 0.741 

17.31 y = 0.66(1.38)* 17.33 y = 14.5066(1.026016)* ; R 2 = 

0.509;$87.57 17.35 logrevenue = -0.1285 + 1.0040 log 

employs — 0.1121 log departs; revenue = 

0.7439 s " employs 1 0040s "departs-° n21 ; $546.2 million 

17.41 yes 17.43 multicollinearity may be present 

17.45 multicollinearity may be present 17.49 a. * 5 , x 2 , * 9 

b. y = 106.85 - 0.35* 5 - 0.33* 2 c. 0.05 level: x s , x z , * 9 ; 0.01 

level:* 2 17.59 Pages = -53.9 + 313.99 s "x-26.335 s "xSq; 

-422.6 pages 17.61 y = 10.705 + 0.974* - 0.015* 2 ; yes; 

83.6% 17.63 appartmp = -10.4 + 1.06 s 'roomtmp + 

0.0925 *relhumid; appartmp = 3.90 + 0.847*roomtmp - 

0.194 s 'relhumid + 0.00425*roomtmp*relhumid; R 2 increases 

from 0.982 to 0.994 17.65 productivity = 19.1 + 

0.211*backlog + 0.577 s " female; R 2 = 0.676; yes 

17.67 log appartmp = -0.28048 + 1.09898 log 

roomtmp + 0.054483 log relhumid; appartmp = 

0.524228*roomtmp L09898 *relhumid 0 054483 ; 71.3 degrees 

17.69 yes; opcost/hr = 697.8 + 1.80 s 'gal/hr; 87.69% 

17.71 a. yesb. final = 14.79 + 0.885' f testl; R 2 = 0.8568 



17.73 RATING = 1.955 + 4.189 s " YDS/ATT - 4.1649'TNT% 
+3.3227*TD% + 0.8336 S COMP%; 100.00% (rounded) 

Chapter 18 

18.3 369,600 gallons 18.5 with x = 1 for 2001, Earnings = 
14.17 + 0.382x; $17.99 18.7 a. subs = -12.4227 + 
15.3458x; 263.8 million b. subs = 6.5023 + 7.2351* + 
0.6239* 2 ; 338.9 million c. quadratic 18.17 the 0.4 curve; 
the 0.7 curve 18.2130% 18.23 b. 1 , 74.720; II, 103.978; 
III, 123.761; IV, 97.540 18.25 J, 100.497; F, 94.334; M, 
103.158; A, 103.596; M, 98.062; J, 100.547; J, 98.166; A, 
98.385; S, 96.864; O, 108.841; N, 93.196; D, 104.353 
18.27 $192.0 thousand, $201.6 thousand 18.29 1213.2; 
1541.2 18.31 $61.8023 billion 18.33 36.2742 quadrillion 
Btu 18.35 189,000 gallons 18.39 quadratic, quadratic 
18.41 quadratic, quadratic 18.45 0.58, positive 
autocorrelation 18.47 a. inconclusive b. inconclusive 
18.49 y t = -1.51 + 1.119y f _ i; 288.5 18.51 y t = 525.6 + 
0.812y t _j; 2895.5; MAD = 100.5 18.53 100 
18.55 best: services; worst: mining 18.57 $104,213 
18.59 Military = 1.6086 - 0.0379x; 0.964 million 
18.61 Restaurants = -1408.05 + 1206.50* - 9.92905* 2 ; 
23,249 restaurants 18.63 a. AvgBill = 16.8086 + 1.15786%; 
$41,124 b. AvgBill = 16.6486 + 1.26452* - 0.0133* 2 ; 
$37,324 c. quadratic, quadratic 18.67 I, 95.14; II, 89.69; 
ffl, 95.64; IV, 119.53 18.69 I, 182.240; II, 68.707; 

III, 34.210; IV, 114.843 18.71 I, 94.68; H, 93.54; ffl, 111.05; 

IV, 100.73 18.73 I, 72.50; II, 83.11; ffl, 111.16; IV, 132.77 
18.75 I, 71; II, 91; III, 104; IV, 66 18.77 Cost = 38.3268 + 
0.794341* + 0.0538258* 2 ; Cost = 37.1427 + 1.38642*; 
quadratic 18.83 DW statistic = 0.48; positive autocorrelation 
18.85 CPI ( = 2.1176 + 1.01668*CPI ( _ 1 ; MAD = 1.429; 
194.17 18.87 5.65% increase 

Chapter 19 

19.9 0, 2000, 2000 shirts 19.11 a. maximax b. maximax 
c. maximin 19.13 purchase, do not purchase, purchase 
19.17 make claims, $400 19.19 design C, $5.6 million 
19.21 direct, 21.5 minutes, 0.5 minutes 19.25 A or C, 
$12,800, $12,800 19.27 direct, 0.5 minutes; longer, 9.0 
minutes 19.29 287 cod 19.31 3545 programs 
19.33 7 units 19.35 current; Dennis; Dennis 
19.37 DiskWorth; ComTranDat; ComTranDat 

Chapter 20 

20.35 0.0026 20.39 in control 20.41 out of control 
20.43 out of control 20.45 LCL = 0, UCL = 0.088 
20.47 LCL = 0, UCL = 15.72 20.49 centerline = 0.197; 
LCL = 0.113; UCL = 0.282; in control 20.51 centerline = 
4.450; LCL = 0; UCL = 10.779; out of control 20.53 in 
control 20.55 yes; mean chart failed test #1 at sample 6 
20.57 yes; mean chart failed tests #1 and #5 at sample 5 
20.63 c-chart 20.65 centerline = 0.05225; LCL = 0.00504; 
UCL = 0.09946; in control 20.67 out of control 
20.69 centerline = 0.0988; LCL = 0.0355; 
UCL = 0.1621; in control 



A 

Acceptance Sampling The use of a 
sampling plan to determine whether 
a shipment should be accepted or 
rejected, 759 
Accuracy The difference between the 
observed sample statistic and the 
actual value of the population pa- 
rameter being estimated. This may 
also be referred to as estimation 
error or sampling error, 274 
Addition Rules Formulas for deter- 
mining the probability of the union 
of two or more events, 143-144 
Alpha (a) The probability of making 
a Type I error by rejecting a true 
null hypothesis, 313-314 
Alternative Hypothesis (Hj) An 
assertion that holds if the null 
hypothesis (H 0 ) is false. If the null 
hypothesis is not rejected, H 1 must 
be rejected, and vice versa, 311 
formulating, 311-313, 315 
Analysis of Variance (ANOVA) 
A procedure for comparing the 
means of two or more independent 
samples at the same time. Also used 
in testing the overall strength of the 
relationship between the dependent 
variable and the independent 
variable(s) in regression analysis, 
409-452 

basic concepts of, 410-412 
between-group variation in, 

412-413 
experimentation and, 410-412 
multiple regression analysis and, 

616-617 
one-way, 414-424 
randomized block, 427-437 
regression analysis and, 574 



two-way, 439-452 
within-group variation in, 
412-413 
ANOVA, See Analysis of variance 
A priori probability, See Classical 

probability 
Arithmetic Mean Also called the 
average or mean, this measure of 
central tendency is the sum of data 
values divided by the number of 
observations, 7, 59-60 
from grouped data, 84-85 
Assignable Variation In a process, 
variability that occurs due to 
nonrandom causes. Also known 
as special-cause variation, 758 
Attributes, See Control charts for 

attributes 
Autocorrelation Correlation between 
successive error terms, or residuals, 
in a regression and correlation 
analysis, 580, 713-716 
Autoregressive Forecasting 
A forecasting model in which each 
independent variable represents 
previous values of the dependent 
variable, 716-721 
Average, 59-60 

B 

Balanced Experiment A design in 
which an equal number of persons 
or test units receives each 
treatment, 411 

Bar Chart For qualitative variables, 
a graphical method in which the 
number of values in a category is 
proportional to the length of a 
rectangle. Unlike the histogram, 
there is a gap between adjacent 
rectangles, 28-29 



Bayesian Decision Making The use 

of a decision criterion in which the 
probabilities of the states of nature 
are considered, with the alternative 
having the greatest expected payoff 
being selected, 743-746 

expected payoff and, 743-744 
expected value of perfect 
information and, 744-746 
Bayes' Theorem A theorem dealing 
with sequential events, using infor- 
mation obtained about a second 
event in order to revise the proba- 
bility that a first event has occurred, 
151-155 

Bernoulli Process A process in which 
there are two or more consecutive 
trials, each of which has just two 
possible outcomes (success or fail- 
ure), and in which the probability 
of a success remains the same from 
one trial to the next; i.e., the trials 
are statistically independent from 
each other, 175, 223 
Beta (P) The probability of making a 
Type II error by failing to reject a 
false null hypothesis, 313-314 
Between-group variation, in analysis 

of variance, 412-413 
Bimodal Distribution A distribution 
in which two values occur with the 
greatest frequency, 62 
Binomial Distribution A discrete 
probability distribution involving 
consecutive trials of a Bernoulli 
process, 175-181 

Bernoulli process and, 178 
description and applications of, 

175-178 
mean of, 175, 224 
normal approximation to, 
223-226, 336 
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Binomial Distribution (continued) 

Poisson approximation to, 
192-193 

standard deviation of, 224 

variance of, 175 
Block In the randomized block 
ANOVA design, a set of persons or 
test units having similar characteris- 
tics in terms of the blocking 
variable, 428 
Box-and- Whisker Plot A graphical 
method that simultaneously 
displays the median, quartiles, and 
highest and lowest values within 
the data, 77-78 
Business Statistics The collection, 
summarization, analysis, and 
reporting of numerical findings 
relevant to a business decision or 
situation, 2-14 

attitudes toward, 2 

decision making and, 11-12 

history of, 3-4 

as tool, 11-12 

See also Statistics 

c 

Calculated Value A value of the test 
statistic that is calculated from sam- 
ple data. It is compared to the critical 
value(s) in deciding whether or not to 
reject the null hypothesis, 315-317 

Causal Research An investigation for 
the purpose of determining the ef- 
fect of one variable on another; 
typically carried out by means of 
experimentation, 103 

Cause-and-Effect Diagram A diagram 
that assists the solution of quality 
problems by visually depicting 
various causes at the left that feed 
directionally toward the ultimate 
effect at the far right. Because of its 
appearance, it is also known as a 
fishbone diagram, 767 

Cause-and-effect problems, in 
experiments, 110 

c-Chart A control chart displaying 
the number of defects per unit of 
output, 784-788 

Census The actual measurement or 
observation of all possible elements 
from the population, this can be 
viewed as a "sample" that includes 
the entire population, 7, 117 
sample vs., 118-119 



Centered Moving Average 
A smoothing technique in which 
the original time series is replaced 
with another series, each point of 
which is the center of and the aver- 
age of N points from the original 
series, 691-695 

Centerline A control chart line 
representing the central tendency 
of a process when it is in control, 
771-772 

Central Limit Theorem For large, 
simple random samples from a 
population that is not normally 
distributed, the sampling distri- 
bution of the mean will be 
approximately normal, with 
[L- = [l and (J— = cr/Vn. The sam- 
pling distribution will more closely 
approach the normal distribution 
as the sample size is increased, 208, 
251-252 

Central tendency, measures of. See 
Measures of central tendency 

Charts, See Control charts; Visual 
representation of data 

Chebyshev's Theorem For either a 
sample or a population, the 
percentage of observations that 
fall within k (for k > 1) standard 
deviations of the mean will be at 
least [1 - (l/fe 2 )](100)%, 78-79 

Check Sheet In total quality manage- 
ment, a sheet or other mechanism 
for recording frequency counts or 
product measurements, 768 

Chi-Square Distribution A family of 
continuous probability distributions, 
with a different curve for each pos- 
sible value of the number of degrees 
of freedom, df. It is the sampling 
distribution of (w — ljs 2 /^ 2 when- 
ever samples of size « are repeatedly 
drawn from the same, normal 
population, 466-469 

confidence interval for population 

variance and, 487^489 
hypothesis tests for population 
variance and, 489-491 

Chi-square test for equality of 
proportions, 484-486 

Chi-square test for goodness of fit, 
469^174 
procedure for, 470-471 
purpose of, 469 

Chi-square test for normality, 
474-475 



Chi-square test of variable indepen- 
dence, 477-482 
procedure for, 477-478 
purpose of, 477 

Class Each category of a frequency 
distribution, 18 
exhaustive, 18 
mutually exclusive, 18 
open-end, 18 

Classical Probability Probability de- 
fined in terms of the proportion of 
times that an event can be theoreti- 
cally expected to occur, 136 

Classical time series model, 687 

Class Interval The width of each 
class in a frequency distribution; 
this is the difference between the 
lower class limit of a class and the 
lower class limit of the next higher 
class, 18 

Class Limits The boundaries for 
each class in a frequency distri- 
bution; these determine which 
data values are assigned to the 
class, 18 

Class Mark For each class in a 
frequency distribution, the 
midpoint between the upper 
and lower class limits, 18 

Cluster analysis, 114 

Cluster Sample A probability 
sampling technique in which 
the population is divided into 
groups, then members of a random 
selection of these groups are 
collected by either a sample or 
a census, 124 

Coefficient of Correlation Expres- 
sed as r (with — l<r<+l), a 
number describing the strength and 
direction of the linear relationship 
between the dependent variable (y) 
and the independent variable (x), 
86-89, 565-566 
testing, 570-571 

Coefficient of Determination, t 2 In 
simple regression, the proportion of 
the variation in y that is explained by 
the regression line y = b 0 + b t x, 
88, 566-567 

Coefficient of Multiple Determination 
Expressed as R 2 , the proportion of 
the variation in y that is explained 
by the multiple regression equation. 
Its positive square root is the 
coefficient of multiple correlation, 
R, 614 
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Coefficient of Variation (CV) The 
relative amount of dispersion 
in a set of data; it expresses the 
standard deviation as a percentage 
of the mean, 79-81 

COLAs (cost-of-living adjustments), 
724 

Combinations The number of 
unique groups for n objects, r at 
a time. The order of arrangement 
is not considered, 158-159 
Common-Cause Variation In a 
process, variation that occurs by 
chance. Also known as random 
variation, 757 
Competitive Benchmarking The 
practice of studying, and attempt- 
ing to emulate, the strategies and 
practices of organizations already 
known to generate world-class 
products and services, 764 
Compiled List A list or sample 
consisting of persons who share 
a common characteristic, such 
as age, occupation, or state of 
residence, 107 
Complement Either an event will 
occur (A) or it will not occur (A). 
A is the complement of A, and 
P(A) + P(A') = 1, 135 
Conditional Probability The proba- 
bility that an event will occur, 
given that another event has 
already happened, 146 
Confidence Coefficient For a confi- 
dence interval, the proportion of 
such intervals that would include 
the population parameter if the 
process leading to the interval were 
repeated a great many times, 274 
Confidence Interval An interval esti- 
mate for which there is a specified 
degree of certainty that the actual 
value of the population parameter 
will fall within the interval, 
270, 274 

with finite population, 297-298 
hypothesis testing and, 327 
for mean, with population 
standard deviation known, 
275-279 
for mean, with population stan- 
dard deviation unknown, 
280-285 
multiple regression and, 608-610 
for population proportion, 
287-290 



for population variance, 

487-489 
regression analysis and, 

560-561, 608-610 
for slope of regression line, 

571-573 

Confidence Level Like the confidence 
coefficient, this expresses the degree 
of certainty that an interval will in- 
clude the actual value of the popu- 
lation parameter, but it is stated as 
a percentage. For example, a 0.95 
confidence coefficient is equivalent 
to a 95% confidence level, 274 
Consumer Price Index (CPI) An 
index produced by the U.S. Bureau 
of Labor Statistics, it describes the 
change in prices from one time pe- 
riod to another for a fixed "market 
basket" of goods and services, 
723-724, 726 
Contingency Table A tabular method 
showing either frequencies or rela- 
tive frequencies for two variables at 
the same time, 43-45, 140, 477 
Continuity, correction for, 224. See 
also Correction for continuity 
Continuous Probability Distribution 
A probability distribution describ- 
ing probabilities associated with 
random variables that can take on 
any value along a given range or 
continuum, 206-231 

exponential distribution, 227-231 
normal approximation to bino- 
mial distribution, 223-226 
normal distribution, 208-226 
simulating observations from, 

232-234 
standard normal distribution, 
212-221 

Continuous quantitative variables, 9 
Continuous Random Variable A 
random variable that can take a 
value at any point along an interval. 
There are no gaps between the 
possible values, 168, 206 
Control Chart A chart displaying a 
sample statistic for each of a series 
of samples from a process. It has 
upper and lower control limits and 
is used in determining whether the 
process is being influenced by 
assignable variation, 758-759, 
771-791 

format of, 771-772 
types of, 771 



Control Chart for Attributes A con- 
trol chart in which the measure of 
product quality is a count, 771, 
782-788 

c-charts, 784-788 
p-charts, 783-784 
Control Chart for Variables A con- 
trol chart in which the measure of 
product quality is a measurement, 
772-781 
mean, 772-777 
range, 777-781 
Control group, 109 
Control Limit In a control chart, 
a boundary beyond which the 
process is judged to be under the 
influence of nonrandom, or 
assignable, variation, 771-772 
Convenience Sample A nonprobabil- 
ity sampling technique where mem- 
bers are chosen primarily because 
they are both readily available and 
willing to participate, 125 
Correction for Continuity In the 
normal approximation to the 
binomial distribution, the "expan- 
ding" of each possible value of the 
discrete variable, x, by 0.5 in each 
direction. This correction becomes 
less important when n is large and 
we are determining the probability 
that x will lie within or beyond a 
given range of values, 224 
Correlation Analysis The measure- 
ment of the strength of the linear 
relationship between two variables, 
550, 565-568 

causation confusion and, 580-581 
coefficient of correlation and, 

86-89, 565-566 
coefficient of determination and, 

88, 565-567 
extrapolating beyond range of 

data in, 581 
Kolmogorov-Smirnov test 

and, 577 
normal probability plot and, 577 
residual analysis and, 576-580 
types of relationships 
between variables and, 
37-38, 86-87, 550 
See also Multiple correlation 
analysis 

Correlation Matrix A matrix listing 
the correlation of each variable 
with each of the other variables, 
665-666 
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Cost-of-living adjustments 
(COLAs), 724 

Costs of Conformance In total quality 
management, the costs associated 
with making the product meet 
requirements or expectations — i.e., 
the costs of preventing poor 
quality, 757 

Costs of Nonconformance In total 
quality management, the costs 
associated with the failure of a 
product to meet requirements or ex- 
pectations — i.e., the costs that occur 
as the result o/poor quality, 757 

Counting, 156 

Critical Value A value of the test 
statistic that serves as a boundary 
between the nonrejection region and 
a rejection region for the null 
hypothesis. A hypothesis test will 
have either one (one-tail test) or two 
(two-tail test) critical values, 317 

Cross-tabulation Also known as the 
crosstab or contingency table, 
shows how many people or items 
are in combinations of categories, 
43-45, 139, 477 

Cumulative Frequency Distribution 
A frequency distribution showing 
the number of observations within 
or below each class, or above or 
within each class, 19 

Cumulative relative frequency 
distribution, 19 

Curvilinear Relationship A relation- 
ship between variables that is best 
described by a curved line, 37 

Cyclical Variation in a time series 
due to fluctuations that repeat over 
time, with the period being more 
than a year from one peak to the 
next, 687 

D 

Data Mining Analyzing a large col- 
lection of data to discern patterns 
and relationships, 113-115 

Data Warehouse A large collection 
of data from inside and outside 
a company to be analyzed for 
patterns and relationships, 
113-114 

Deciles Quantiles dividing data 
into 10 parts of equal size, with 
each comprising 10% of the 
observations, 70 



Decision Alternatives The choices 
available to the decision maker, 
with one choice to be selected, 737 
expected opportunity loss for, 

747-748 
expected payoff for, 743-744 
payoff table and, 738 
Decision making, 735-751 
Bayesian, 743-746 
business statistics and, 11 
in hypothesis-testing 

procedure, 317 
incremental analysis and, 

749-751 
inventory, 749-751 
non-Bayesian, 740-742 
opportunity loss approach to, 

747-748 
payoff table in, 738 
regret table and, 741-742 
structuring decision situation in, 

736-739 

Decision Rule A statement specifying 
calculated values of the test statistic 
for which the null hypothesis 
should or should not be rejected 
(e.g., "Reject H 0 if z > 1.96, other- 
wise do not reject."), 317 

Decision Tree A graphical method 
of describing the decision situation, 
including alternatives, states of na- 
ture, and probabilities associated 
with the states of nature, 739 

Degrees of Freedom The number of 
values that remain free to vary once 
some information about them is 
already known, 280 

Deming Cycle Also known as the 
Deming Wheel and the PDCA 
(plan-do-check-act) Cycle, this 
model stresses the importance of 
continuous and systematic quality- 
improvement efforts at all levels of 
the organization, 760 

Deming 's 14 Points, 763 

Dependent Events Events for which 
the occurrence of one affects the 
probability of occurrence for the 
other, 146 

Dependent Samples Samples for 
which the selection process for one 
is related to the selection process 
for the other — e.g., before-after 
measurements for the same people 
or other test units, 363 

analysis of variance and, 437 
hypothesis testing and, 383-387 



sign test for comparing, 528-533 
Wilcoxon signed rank test for 
comparing, 511-514 
Dependent Variable In an experiment, 
the variable for which a value is 
measured or observed. In regression 
analysis, the y variable in the 
regression equation, 37, 109, 
411, 550 
Descriptive Research A study for 
the purpose of describing some 
phenomenon, 103 
Descriptive Statistics A branch of 
statistics in which the emphasis is 
on the summarization and des- 
cription of data that have been 
collected, 5 

from grouped data, 83-85 
Deseasonalized time series A time se- 
ries from which seasonal influences 
have been removed. Such a series is 
also described as seasonally 
adjusted, 702-703 
Designed Experiment An experiment 
in which treatments are randomly 
assigned to the participants or test 
units, 411 
Determination, coefficient of, 88, 

566-567 
Dichotomous Question In a 
questionnaire, a question having 
just two alternatives from which 
the respondent can choose, 
105-106 
Directional testing, 311. See also 

One-tail tests 
Direct Relationship A relationship in 
which the variables y and x increase 
and decrease together, 37 
Discrete Probability Distribution 
The probability distribution for a 
discrete random variable, 169-173 
binomial distribution, 175-181 
hypergeometric distribution, 

183-186 
Poisson distribution, 187-193 
uniform, 173 
Discrete quantitative variables, 8 
Discrete Random Variable A random 
variable that can take on only cer- 
tain values along an interval, with 
these values having gaps between 
them, 168 
Discriminant analysis, 1 14 
Dispersion, 58, 67-82 

Chebyshev's theorem and, 78-79 
coefficient of variation and, 82 
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relative, 82 

standardized data and, 79-81 
Distribution shape, and measures of 

central tendency, 63-64 
Dotplot A visual display represent- 
ing each data value with a dot, and 
allowing us to readily see the shape 
of the distribution as well as the 
high and low values, 27 
Dummy Variable A variable that 
takes on a value of either 1 or 0, 
depending on whether a given char- 
acteristic is present or absent. It al- 
lows qualitative data to be included 
in a regression analysis, 631-632, 
657-660 

Durbin- Watson Test A test for auto- 
correlation of the residuals in a 
regression analysis. It is especially 
applicable when the data represent 
a time series, 713-716 

E 

Empirical Rule For distributions that 
are bell-shaped and symmetrical, 
about 68% of the observations will 
fall within 1 standard deviation of 
the mean, with about 95% falling 
within 2 standard deviations, and 
practically all of the observations 
falling within 3 standard deviations 
of the mean, 79 

EMV (expected monetary value), 744 

EOL (expected opportunity loss), 
747-748 

Equality of proportions. See Chi- 
square test for equality of 
proportions 

Errors 

estimation, 270 

in hypothesis testing, 313-314, 

343-344 
multiple standard error of 

estimate, 608 
nonresponse, 108 
nonsampling, 119 
nonsystematic, 108 
power of test and, 343-344 
response, 108 

sampling, 108, 119, 292,417 
standard error of estimate, 

559-560 
standard error of the mean, 246 
systematic, 108 
Type I, 313-314 
Type II, 313-314 



Estimation error, 270 
Estimator, unbiased, 270 
Event One or more of the possible 
outcomes of an experiment; a 
subset of the sample space, 135 
dependent events, 146 
exhaustive events, 140 
independent events, 146 
intersection of events, 140 
mutually exclusive events, 140 
union of events, 140 
EVPI (expected value of perfect 
information), 744-746. See 
also Expected value of perfect 
information 
Excel applications, See front endsheet 
of text 

Exhaustive In a frequency distribu- 
tion, the condition in which the set 
of classes includes all possible data 
values. In an experiment, the condi- 
tion in which a set of outcomes in- 
cludes all of the possible outcomes, 
18, 140 

Exhaustive events, 140 

Expected Frequency In chi-square 
analysis, is the frequency of oc- 
currence that would be expected for 
a given cell in the table if the null 
hypothesis were really true, 470 

Expected monetary value (EMV), 

744. See also Expected payoff 

Expected opportunity loss (EOL), 
747-748 

Expected Payoff For a decision alter- 
native, the sum of the products of 
the payoffs and the state of nature 
probabilities, 743-744 

Expected value, 171 

Expected Value of Perfect Information 
(EVPI) The difference between the 
expected payoff with perfect infor- 
mation and the expected payoff of 
the best alternative with current 
information, 744-746 

Experiment An activity or measure- 
ment that results in an outcome. 
Also, a causal study for determining 
the influence of one or more inde- 
pendent variables on a dependent 
variable, 135, 410-412 
balanced, 411 
designed, 411 
factorial, 439 

Experimental group, 110 

Experimentation, and analysis of 
variance, 410-412 



Exploratory Research An investiga- 
tion for the purpose of gaining 
familiarity with the problem situa- 
tion, identifying key variables, and 
forming hypotheses that can be 
tested in subsequent research, 103 

Exponential Distribution A continu- 
ous distribution that, for a Poisson 
process, describes probabilities 
associated with the continuous 
random variable, x = the amount 
of time, space, or distance between 
occurrences of the events of 
interest, 227-231 

description and applications 

of, 227-228 
Poisson distribution and, 
227-228 

Exponential model, 662 

Exponential Smoothing A method 
that dampens the fluctuations in a 
time series; each "smoothed" value 
is a weighted average of current 
and past values observed in the 
series, 695-698 

forecasting with, 707-709 

External Secondary Data Secondary 
data that have been generated by 
someone outside the researcher's 
firm or organization, 104, 112-116 

External Validity In experimental 
research, the extent to which the 
results can be generalized to other 
individuals or settings, 110 

Extraneous variables, 109 

F 

Factor An independent variable 
in an experiment, 411 

Factor analysis, 114 

Factorial For a positive integer, n, 
the factorial («!) is the product of 
n X (n - 1) X (n - 2) X ■•■ X 1. 
The value of 0! is defined as 1, 157 

Factorial Experiment A design in 
which there are two or more factors 
and the treatments represent all 
possible combinations of their lev- 
els. The two-way ANOVA is the 
most basic form of this design, 439 

Factorial rule of counting, 157 

Factor Level A specific value for 
a factor. When an experiment 
involves just one factor, each 
factor level can be referred to 
as a treatment, 411 
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Finite Population Correction When 
sampling is without replacement 
and from a finite population, the 
use of a correction term is used 
to reduce the standard error 
according to the relative sizes 
of the sample and the population, 
297-301 
Finite population correction 

factor, 257 
Fishbone diagram, 767. See also 
Cause-and-effect diagram 
Focus group interview, 103 
Forecasting, 706-721 

autoregressive, 716-721 
exponential smoothing and, 

707-709 
seasonal indexes in, 710 
trend equation and, 707 
Frequency The number of data 
values falling within each class 
of a frequency distribution, 18 
expected, 470 
observed, 470 
Frequency Distribution A display 
table dividing observed values into 
a set of classes and showing the 
number of observations falling into 
each class, 16-19 
constructing, 16-19 
cumulative, 19 
guidelines for, 18 
relative, 19 
Frequency Polygon A graphical dis- 
play with line segments connecting 
points formed by intersections of 
class marks with class frequencies. 
May also be based on relative fre- 
quencies or percentages, 20 
Friedman Test For the randomized 
block design, a nonparametric test 
that compares more than two 
dependent samples at the same 
time. It is the nonparametric 
counterpart to the randomized 
block analysis of variance, 
523-526 
F statistic 

for comparing variances of 
two independent samples, 
394-398 
in one-way analysis of 

variance, 417 
in randomized block analysis of 

variance, 431 
in two-way analysis of variance, 
444, 447^148 



G 

Goodness-of-Fit Test A test that 
examines whether a sample could 
have been drawn from a population 
having a specified probability 
distribution — e.g., the normal 
distribution, 469-475 

chi-square test for, 469-475 
Kolmogorov-Smirnov test for, 
535-538, 577 
Grand Mean The mean of all obser- 
vations or measurements, 415 
Graphs, See Visual representation 
of data 

Grouped Data Data summarized in a 
frequency distribution, 83-85 

H 

Histogram For quantitative 
variables, a graphical method that 
describes a frequency distribution 
by expressing either the frequency 
or relative frequency within each 
class as the length of a rectangle. In 
a histogram, adjacent rectangles 
share a common side, 20-22 

Homoscedasticity The assumption 
of equal standard deviations of 
y values about the population 
regression line, regardless of 
the value of x, 551 

Hypergeometric Distribution A dis- 
crete probability distribution in 
which there are consecutive trials, 
but in which the probability of suc- 
cess is not the same from one trial 
to the next; i.e., the trials are not 
statistically independent from each 
other. 183-186 

Hyperplane A mathematical 
counterpart to the two-dimensional 
surface fitted to data points when 
there are just two x variables. 
The multiple regression equation is 
either a plane (two x variables) or a 
hyperplane (more than two x 
variables), 603 

Hypothesis, See Alternative hypo- 
thesis; Null hypothesis 

Hypothesis tests, 310-341, 363-393 
basic procedures of, 315-317 
comparing means of two depen- 
dent samples, 383-387 
comparing means of two 

independent samples, 363-377 



comparing two sample 

proportions, 388-393 
comparing variances of two inde- 
pendent samples, 394-398 
confidence intervals and, 327 
errors in, 313-314, 343-344 
for population variance, 489-491 
nature of, 310-313 
of a mean with population 

standard deviation known, 

318-321 
of a mean, with population 

standard deviation unknown, 

328-334 
of a proportion, 336-441 
power of a test, 313-314, 

343-347 
p-value and, 321-324 
Wests, 328-334, 363-376 
z-tests, 318-321, 378-381, 

388-393 



Ignorance A decision situation in 
which neither the possible states of 
nature nor their exact probabilities 
are known, 737 
Incremental Analysis An inventory 
decision method where items are 
sequentially considered, with each 
unit in the sequence being stocked 
only if the expected payoff for 
stocking it exceeds the expected 
payoff for not stocking it, 749-751 
Independent Events Events for which 
the occurrence of one has no effect 
on the probability of occurrence for 
the other, 146 
Independent Samples Samples for 
which the selection process for 
one is not related to the selection 
process for the other, 362 

comparing variances of, 394-398 
Kruskal-Wallis test for compar- 
ing more than two, 519-521 
t-test for comparing means of, 

363-376 
Wilcoxon rank sum test for 

comparing, 515-518 
Z-test for comparing means of, 

378-381 
Z-test for comparing proportions 
of, 388-393 
Independent Variable In regression 
analysis, an x variable in the regres- 
sion equation. In an experiment, 
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a variable that is observed or 
controlled for the purpose of deter- 
mining its effects on the value of 
the dependent variable, 37, 109, 
411, 550 

Indexes, 722-727 

Consumer Price, 723-724, 726 
Employment Cost, 726 
Industrial Production, 725 
New York Stock Exchange 

Composite, 726 
Producer Price, 725 
shifting base of, 726-727 
Standard & Poor's Com- 
posite, 726 

Index Number A percentage express- 
ing a measurement or combination 
of measurements in a given period 
compared to a selected base period, 
722-727 

Inferential Statistics A branch of 
statistics that goes beyond mere 
description and, based on sample 
data, seeks to generalize from the 
sample to the population from 
which the sample was drawn, 5-6, 
102, 244, 270 

Interactive Effect In a two-factor ex- 
periment, an effect that results from 
the combination of a level for one 
factor with a level for the other. In 
the randomized block design, inter- 
action is present when the effect of 
a treatment depends on the block to 
which it has been administered, 
429, 440^141 

Internal Secondary Data Secondary 
data that have been generated from 
within the researcher's firm or orga- 
nization, 104, 112 

Internal Validity In experimental 
research, the extent to which the 
independent variable was actually 
responsible for changes measured 
in the dependent variable, 109-110 

Internet data sources, 115-116 

Interquartile Range The distance be- 
tween the first and third quartiles, 71 

Intersection Two or more events 
occur at the same time; e.g., "A and 
B", 140 

Interval Estimate A range of values 
within which the actual value of the 
population parameter may fall, 
270, 272-290 

with finite population, 297-298 
in multiple regression, 608-611 



in simple linear regression, 
559-564, 571-573 
Interval Limits The lower and upper 
values of the interval estimate, 273 
Interval Scale of Measurement 
A scale that has a constant unit of 
measurement, but in which the 
zero point is arbitrary, 10 
Interview 

focus group, 103 
personal, 105 
telephone, 105 
Inverse Relationship A relationship in 
which y and x increase or decrease 
in opposite directions, 37-38 
Irregular Variation in a time series 
due to random fluctuations or 
unusual events, 687 



Joint Probability The probability 
that two or more events will all 
occur, 146 

Judgment Sample A nonprobability 
sampling technique where the re- 
searcher believes the members to be 
representative of the population, 126 

Juran diagram, 768 

Just-in-Time Manufacturing A 
manufacturing system in which 
materials and components literally 
arrive "just in time" to enter the 
production process. It is a produc- 
tion philosophy that has favorable 
effects on both inventory costs 
and product quality, 764-765 

K 

Kaizen A philosophy that focuses 
on small, ongoing improvements 
rather than relying on large-scale 
(and typically, more abrupt) inno- 
vations, such as might come from 
advances in technology or 
equipment, 762 

Kolmogorov-Smirnov Test A 
nonparametric test to examine 
whether a set of data may have 
come from a given distribution. It is 
applied in the text to test whether 
data or residuals could have come 
from a normal population, 
535-538, 577 

Kruskal-Wallis Test A nonparamet- 
ric test that compares the medians 



of more than two independent 
samples at the same time, 
519-521 



Law of Large Numbers Over a 
large number of trials, the relative 
frequency with which an event oc- 
curs will approach the probability 
of its occurrence for a single 
trial, 136 

Leaf In a stem-and-leaf display of 
raw data, the rightmost digit on 
which classification of an observa- 
tion is based, 25 

Least-Squares Criterion For fitting a 
regression line, a criterion which 
requires that the sum of the squared 
deviations between actual y values 
and y values predicted by the equa- 
tion be minimized. The resulting 
equation is the least squares 
regression line, 552-554 

Least-squares regression line, 552-554. 
See also Regression line 

Left-Tail Test A one-tail test in 
which the rejection region for the 
null hypothesis is in the left tail of 
the sampling distribution, 312 

Linear regression, See Regression 
analysis 

Linear Relationship A relationship 
between variables that is best 
described by a straight line, 37-38 
Linear trend equations, 688-690 
Line Graph A display in which lines 
connect points representing values 
of one quantitative variable (y, or 
vertical axis) versus another (x, or 
horizontal axis). Two or more y 
variables may be plotted for the 
same x, 28-29 

M 

MAD (mean absolute deviation), 
71-72, 711-712. See also 
Mean absolute deviation 
Mailing lists, 107 
Mail Survey A survey technique in 
which the medium of communica- 
tion is a mail questionnaire, 105 
errors in, 108 
questionnaire design for, 

105-106 
sampling process for, 107 
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Main Effect An effect caused by one 
of the factors in an experiment, 440 

Marginal Loss In incremental analy- 
sis, the cost if one more unit is 
stocked and not sold, 749 

Marginal Probability The probabil- 
ity that a given event will occur, 
with no other events taken into 
consideration, 146 

Marginal Profit In incremental anal- 
ysis, the profit if one more unit is 
stocked and sold, 749 

Matched pairs, 383 

Maximax A non-Bayesian criterion 
specifying selection of the decision 
alternative having the highest 
maximum payoff, 741 

Maximin A non-Bayesian criterion 
specifying selection of the decision 
alternative having the highest 
minimum payoff, 740-741 

Mean, 7, 59-60 

arithmetic, 7, 59-60 
from grouped data, 84-85 
grand, 415 

median and mode vs., 62-63 

sampling distribution of, 243, 
243-252 

standard error of, 246 

trimmed, 65 

weighted, 61 
Mean Absolute Deviation (MAD) 
As a measure of dispersion, this is 
the average of the absolute values 
of the differences between observa- 
tions and the mean. In forecasting, 
a criterion for measuring the fit of 
an estimation equation to an actual 
time series. The best-fit model is the 
one having the lowest mean value 
for |y,-y ,1,71-72, 711-712 

time series and, 711-712 
Mean Chart A control chart display- 
ing a series of sample means from 
the output of a process, 772-777 
Mean of a Probability Distribution 
Expressed as /x or E(x), this is the 
expected value of the random vari- 
able, x. It is a weighted average in 
which the possible values of x are 
weighted according to their proba- 
bilities of occurrence, 171 
Mean Square A sum of squares 
value that has been divided by the 
number of degrees of freedom asso- 
ciated with the source of variation 
being examined, 417 



Mean Squared Error (MSE) In fore- 
casting, a criterion for measuring 
the fit of an estimation equation to 
an actual time series. The best-fit 
model is the one having the 
lowest mean value for (y i — y 
711-712 
Measurement, scales of, 9-10 
Measure of Central Tendency A 
numerical measure describing 
typical values in the data, 58-66 
arithmetic mean, 7, 59-60 
distribution shape and, 63-64 
median, 7, 61-62 
mode, 7, 62 
weighted mean, 61 
Measure of Dispersion A numerical 
measure describing the amount of 
scatter, or spread, in the data values, 
58, 67-82 
mean absolute deviation (MAD), 

71-72, 711-712 
quantiles, 70-71 
range, 7, 68 

standard deviation, 7, 72-74 

variance, 11-1 A 
Median A value that has just as 
many observations that are higher 
as it does observations that are 
lower. It is the second quartile, 7, 
61-62 

Midrange The average of the lowest 
and highest data values, 70 

Minimax Regret A non-Bayesian 
criterion specifying selection of the 
decision alternative having the 
lowest maximum regret, 741-742 

Minitab applications, See front end- 
sheet of text 

Mode The value that occurs with the 
greatest frequency in a set of data, 
7, 62 

Moving average, centered, 691-695 
MSE (mean squared error), 
711-712 

Multicollinearity A condition where 
two or more of the independent 
variables are highly correlated 
with each other, making the partial 
regression coefficients both statisti- 
cally unreliable and difficult to 
interpret, 632-633, 665-666 
Multidimensional scaling, 114 
Multiple Choice Question In a 
questionnaire, a question having 
several alternatives from which the 
respondent may choose, 105 



Multiple correlation analysis, 
613-615 

coefficient of multiple determina- 
tion and, 614 

dummy variable and, 631-632, 
657-660 

multicollinearity and, 632-633, 
665-666 

significance tests in, 615-619 

stepwise regression and, 668-672 
Multiple determination, coefficient 
of, 614 

Multiple Regression Regression 
analysis in which there is one 
dependent variable and two or 
more independent variables, 
599-632 

assumptions for, 602 
confidence intervals and, 

608-610 
dummy variable and, 631-632, 

657-660 
interval estimation in, 608-611 
model for, 601-602 
multicollinearity and, 632-633, 

665-666 
multiple standard error of 

estimate and, 608 
prediction intervals and, 

610-611 
residual analysis and, 624-629 
significance tests in, 615-619 
stepwise regression and, 
632-633,668-672 
Multiple Standard Error of Estimate 
The amount of dispersion of 
data points about the plane (or 
hyperplane) represented by the 
multiple regression equation; this 
is also the standard deviation of 
the residuals, 608 
Multiplication, principle of, 156 
Multiplication Rules Formulas for 
determining the probability of the 
intersection of two or more events, 
146-149 
Multiplicative model, 662-663 
Mutually Exclusive Classes The 
classes in a frequency distribution 
are mutually exclusive when each 
data value can fall into only one 
category, 18 
Mutually Exclusive Events The con- 
dition in which two events cannot 
occur at the same time. An event 
(A) and its complement (A') are 
always mutually exclusive, 140 
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N 

Negatively Skewed Distribution 

A nonsymmetrical distribution that 
tails off to the left; it has a mean 
that is less than the median, 64 
Nominal Scale of Measurement The 
most primitive scale of measurement; 
it employs numbers only to identify 
membership in a group or category, 9 
Non-Bayesian decision making, 

740-742 
Nondirectional testing, 311 

confidence intervals and, 327 
See also Two-tail tests 
Nonparametric Test A hypothesis 
test that makes no assumptions 
about the specific shape of the 
population from which a sample 
is drawn, 503-541 

advantages and disadvantages of, 

504-505 
corresponding parametric tests 

for, 505 
Friedman test, 523-526 
Kolmogorov-Smirnov test, 

535-538, 577 
Kruskal-Wallis test, 519-521 
runs test, 533-535 
sign test, 528-533 
Wilcoxon rank sum test, 

515-518 
Wilcoxon signed rank test, 
506-514 

Nonprobability Sampling Any sam- 
pling technique where not everyone 
in the population has a chance of 
being included in the sample. The 
selection process will involve at 
least some degree of personal 
subjectivity, 120, 124-126 

Nonresponse Error In survey 
research, error that occurs when per- 
sons who respond to the survey are 
different from those who don't, 108 

Nonsampling Error The error, or 
difference between the population 
parameter and its corresponding 
sample statistic, that occurs because 
of a directional tendency, or bias, 
in the measuring process, 119 

Nonsystematic Error Also referred to 
as "random," an error that occurs 
when a measured value differs from 
the actual value of the population 
parameter, but is just as likely to be 
too high as too low, 108 



Normal Distribution A family of 
distributions, each member of which 
is bell-shaped and symmetrical, and 
has the mean, median, and mode all 
located at the midpoint. The curve 
approaches the horizontal axis at 
both ends, but never intersects with 
it. The specific member of the fam- 
ily depends on just two descriptors: 
the mean (fi) and the standard 
deviation (a), 208-220 

approximation to binomial 

distribution, 223-226, 336 
areas beneath normal curve, 

209-221 
description and applications of, 

208-209 
standard, 212-221 
Normal probability plot, 577 
Null Hypothesis (H 0 ) A statement 
about the value of a population 
parameter, H Q is put up for testing 
in the face of numerical evidence. It 
is not necessarily identical to the 
claim or assertion that led to the 
test, 310-311 

errors with, 313-314 
formulating, 310-311, 315 

o 

Observation Method of primary 
data collection in which behavior is 
observed and measured, 110-111 

Observed Frequency In chi-square 
analysis, O- is the observed, or 
actual, frequency of occurrence 
for a given cell in the table, 470 

Odds A method in which a ratio is 
used as a way of expressing the 
likelihood that an event will 
happen versus the likelihood that 
it will not, 138-139 

Ogive A graphical display providing 
cumulative values for frequencies, 
relative frequencies, or percentages. 
May be either "less than" or 
"greater than", 21 

One-Tail Test Test in which the null 
hypothesis can be rejected by an ex- 
treme result in one direction only. 
The test arises from a directional 
claim or assertion about the popu- 
lation parameter, 311 

directional assertions and, 311 

One-Way Analysis of Variance An 
ANOVA procedure that examines 



two or more independent samples to 
determine whether their population 
means could be equal. Also referred 
to as the one-factor, completely 
randomized design, 414^424 

assumptions for, 414-415 

between-sample variation in, 415 

model for, 414-415 

pooled-variances f-test and, 424 

procedure for, 415-418 

purpose of, 414 

within-sample variation in, 415 
Open-End Class In a frequency dis- 
tribution, a class having either no 
lower limit or no upper limit, 18 
Open-Ended Question In a question- 
naire, a question that allows the 
respondent to answer in his own 
words and to express whatever 
thoughts he feels are relevant to the 
subject matter of the question, 106 
Operating Characteristic (OC) 
Curve The complement to the 
power curve, the OC curve 
plots the probability (j8) that the 
hypothesis test will make a Type II 
error (fail to reject a false null 
hypothesis) for a range of values of 
the population parameter for which 
the null hypothesis is false, 346 
Opportunity Loss Equivalent to 

regret, 747-748 
Ordinal Scale of Measurement A 
scale that allows the expression 
of "greater than" and "less than" 
relationships, but which has no 
unit of measurement, 9-10 
Outlier A data value that is very 
different from most of the other 
data values, 25 



Paired Observations Data collected 
from dependent samples, such as 
before-and-after measurements 
recorded for the same individuals 
or test units, 383 
Paired samples 

sign test for comparing, 528-533 
Wilcoxon signed rank test for 

comparing, 511-514 
See also Dependent samples 
Parameter A characteristic of the 
population, such as the population 
mean (//,), standard deviation (cr), 
or proportion (it), 7, 117 
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Parametric tests, corresponding to 
nonparametric tests, 505. 
See also names of specific 
parametric tests 

Pareto Diagram Also known as the 
Juran diagram, a bar chart describ- 
ing the relative frequencies with 
which various kinds of defects, 
or nonconformities, have 
occurred, 768 

Partial Regression Coefficient In the 
multiple regression equation, the 
coefficient for each independent 
variable. It is the slope of the equa- 
tion when all other x variables are 
held constant, 601 

interval estimation for, 619-620 
testing, 617-619 

Payoff The outcome at the intersec- 
tion of a decision alternative and a 
state of nature, 738 
expected, 743-744 

Payoff Table A method of structur- 
ing the decision situation. The 
rows of the table are decision 
alternatives, the columns are states 
of nature, and the entry at each 
intersection of a row and column 
is a numerical consequence such as 
a profit or loss. The table may also 
include estimated probabilities for 
the states of nature, 738 

p-Chart A control chart that moni- 
tors a process by showing the pro- 
portion of defectives in each of a 
series of samples, 783-784 

Percentiles Quantiles dividing 
data into 100 parts of equal size, 
with each comprising 1 % of the 
observations, 70 

Perfect information, expected value 
of, 744-746 

Periodicity A potential problem with 
the systematic sample; this can 
occur when the order in which the 
population appears happens to in- 
clude a cyclical variation in which 
the length of the cycle is the same 
as the value of k being used in 
selecting the sample, 123 

Permutations The number of unique 
arrangements for n objects, r at a 
time, 157 

Personal Interview A survey 
technique in which an interviewer 
personally administers the question- 
naire to the respondent, 105 



Pictogram A visual display using 
symbols to represent frequencies or 
other values, 32-33 

Pie Chart A circular display divided 
into sections based on either the 
number of observations within, or 
the relative values of, the segments, 
31-32 

Point Estimate A single number 
(e.g., x or p) that estimates the 
value of a population parameter 
(e.g., /x or it), 270-271 
using multiple regression 

equation, 604 
using regression line, 556-557 
Poisson Distribution A discrete prob- 
ability distribution applied to events 
for which the probability of occur- 
rence over a given span of time, 
space, or distance is extremely small. 
It can be used to approximate the 
binomial distribution when the 
number of trials is relatively large 
and the probability of a success in 
any given trial is very small, 
187-193 

approximation to binomial 

distribution, 191-193 
description and applications 

of, 187-189 
exponential distribution and, 
228-229 
Polynomial models extend the 
linear regression model to include 
terms in which the independent 
variable(s) are raised to a power 
greater than 1. They enable the 
model to fit data from relationships 
that are curvilinear, 644-656 
with one quantitative predictor 

variable, 644-650 
with two quantitative predictor 
variables, 652-656 
Pooled-variances f-test 

and analysis of variance, 424 
in hypothesis testing, 363-369 
Population The set of all possible el- 
ements that could theoretically be 
observed or measured; sometimes 
referred to as the universe, 6, 117 
Population Parameter The character- 
istic of the population that 
corresponds to the sample statistic. 
The true value of the population 
parameter is typically unknown, 
but is estimated by the sample 
statistic, 7, 117 



Positively Skewed Distribution A 

nonsymmetrical distribution that 
tails off to the right; it has a mean 
that is greater than the median, 64 

Posterior Probability A revised prob- 
ability based on additional 
information, 151 

Power Curve A graph in which the 
power (1 — j8) of the hypothesis 
test is plotted against a range of 
population parameter values for 
which the null hypothesis is false, 
346-350 

Power of a Test (1 — /3) The proba- 
bility that the hypothesis test will 
correctly reject a null hypothesis 
that is false, 313-314, 343-347 
Prediction Interval In regression 
analysis, the estimation interval 
for an individual y observation, 
561-562 

multiple regression and, 610-611 
Predictive Research An investigation 
for the purpose of obtaining a fore- 
cast or prediction of some value 
that will occur in the future, 104 
Primary Data Data that have been 
collected by the researcher for the 
problem or decision at hand, 
105-111 

from experimentation, 109-110 

from observation, 110-111 

from survey research, 105-108 
Principle of Multiplication If, 
following a first event that can hap- 
pen in Hj ways, a second event can 
then happen in n 2 ways, the total 
number of ways both can happen is 
«j«2- Alternatively, if each of k in- 
dependent events can occur in n 
different ways, the total number 
of possibilities is « , 156 
Prior Probability An initial 
probability based on current 
information, 151 
Probability A number between 0 and 
1 which expresses the chance that 
an event will occur, 135-156 

addition rules for, 143-144 

Bayes' theorem and, 151-155 

classical, 136 

conditional, 146 

joint, 146 

marginal, 146 

multiplication rules for, 146-149 
odds and, 138-139 
posterior, 151 
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prior, 151 

relative frequency, 136-137 
subjective, 137-138 
Probability Density Function Curve 
describing the shape of a continuous 
probability distribution, 206 
Probability Distribution For a 
random variable, a description 
or listing of all possible values it 
can assume, along with their 
respective probabilities of 
occurrence, 168 
mean of, 171 
variance of, 171-172 
See also Continuous probability 
distributions; Discrete proba- 
bility distributions 
Probability Sampling Any sampling 
technique where each member or 
element in the population has some 
(nonzero) known or calculable 
chance of being included in the 
sample. This type of sample is nec- 
essary if it is desired to statistically 
generalize from the sample to the 
population. 107, 120-124 
Process Flow Chart A chart visually 
depicting the operations (or subpro- 
cesses) involved in a process, show- 
ing the sequential relationships 
between them, 766 
Process orientation, 761 
Proportion, 7 

hypothesis testing of 
one sample proportion, 

336-341 
two sample proportions, 
388-393 
sampling distribution of, 

253-255 
See also Chi-square test for 
equality of proportions; Popu- 
lation proportion estimation 
Proportionate stratified sample, 
123-124 

Purposive Sample A nonprobability 
sampling technique where members 
are chosen specifically because 
they're not typical of the 
population, 126 

p-value The level of significance 
where the calculated value of the 
test statistic is exactly the same as 
a critical value. It is the lowest 
level of significance at which the 
null hypothesis can be rejected, 
321-324 



Q 

Quadratic trend equations, 690 

Qualitative Variable A variable that 
indicates whether a person or 
object possesses a given attribute, 8 

Quality Audit The evaluation of the 
quality capabilities of a company or 
a supplier, 764 

Quantiles Values that divide the data 
into groups containing an equal 
number of observations, 70-71 

Quantitative Variable A variable 
that expresses how much of an 
attribute is possessed by a person 
or object; may be either discrete 
or continuous, 8 

Quartile Deviation Half of the 
interquartile range, 71 

Quartiles Quantiles dividing data 
into 4 parts of equal size, with each 
comprising 25% of the 
observations, 70-71 

Questionnaire In survey research, the 
data collection instrument, 105-106 
design of, 105-106 
errors in, 108 

Quota Sample A nonprobability 
sampling technique that is similar 
to the stratified sample, but where 
members of the various strata are 
not chosen through the use of a 
probability sampling method, 125 

R 

Random digit dialing, 107 
Randomized Block Design An 

ANOVA procedure in which persons 
or test units are first arranged into 
similar groups (or blocks) before the 
treatments are randomly assigned. 
This reduces error variation by 
ensuring that treatment groups will 
be comparable in terms of the block- 
ing variable, 427-437 

assumptions for, 428-429 
dependent-samples J- test and, 437 
Friedman test for, 523-526 
model for, 428-429 
procedure for, 429^34 
purpose of, 427 
Randomness, runs test for, 533-535 
Random Variable A variable that 
can take on different values 
depending on the outcome of 
an experiment, 168-169, 206 



Random Variation In a process, 

variation that occurs by chance. 

Also known as common-cause 

variation, 757 
Range The difference between the 

highest and lowest values in a set of 

data, 7, 68 

interquartile, 71 
Range Chart A control chart 

displaying a series of sample ranges 

from the output of a process, 

777-781 

Ratio Scale of Measurement A scale 
that has a constant unit of measure- 
ment and an absolute zero value, 10 

Ratio to Moving Average A method 
for determining seasonal indexes, 
700-702 

Raw Data Data that have not been 
manipulated or treated in any way 
beyond their original collection, 16 
Regression Analysis Fitting an equa- 
tion to a set of data in order to de- 
scribe the relationship between the 
variables, 550-564 

analysis of variance perspective 

and, 574, 616-617 
assumptions for, 551-552 
causation confusion and, 580-581 
confidence intervals and, 

560- 561 
extrapolating beyond range of 

data in, 581 
interval estimation and, 559-564 
Kolmogorov-Smirnov test 

and, 577 
model, 551-552 

normal probability plot and, 577 
prediction intervals and, 

561- 562 

residual analysis and, 576-580 
types of relationships between 

variables and, 37-38, 88, 550 
See also Multiple regression 
Regression coefficient, partial. See 
Partial regression coefficient 
Regression Line In simple regression 
this is y = b Q + b^x, the equation 
for estimating the value of y based 
on a known or assumed value of x, 
552-553 
Regret For a given state of nature, 
the difference between each payoff 
and the best possible payoff if that 
state of nature were to occur. 
Regret is the same as opportunity 
loss, 741-742 
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Relationships between variables, 
37-38, 86-88, 550 

Relative Frequency Distribution A 
display table showing the propor- 
tion or percentage of total observa- 
tions falling into each class, 19 

Relative Frequency Probability Prob- 
ability defined in terms of the pro- 
portion of times an event occurs 
over a large number of trials, 
136-137 

Representative Sample A sample in 
which members tend to have the 
same characteristics as the popula- 
tion from which they were 
selected, 7 
Research, 105-117 

causal, 103 

descriptive, 103 

experimentation, 109-110 

exploratory, 103 

observational, 110-111 

predictive, 104 

primary data in, 105-112 

process of, 104 

secondary data in, 104, 112-116 
survey, 105-108 

Residual The difference between an 
observed y value and the estimated 
value (y ) predicted by the 
regression equation, 551 

Residual analysis 

multiple regression and, 624-629 
regression analysis and, 576-580 

Response Error In survey research, 
error that occurs whenever respon- 
dents report information that is not 
truthful, 108 

Response List A list or sample con- 
sisting of persons who share a com- 
mon activity or behavior, such as 
subscribing to a magazine, making 
a mail-order purchase, contributing 
to a charity, or applying for a credit 
card, 108 

Right-Tail Test A one-tail test in 
which the rejection region for the 
null hypothesis is in the right tail of 
the sampling distribution, 312 

Risk A decision situation in which 
both the possible states of nature 
and their exact probabilities of oc- 
currence are known, 737 

Runs Test A nonparametric test that 
evaluates the randomness of a series 
of observations by analyzing the 
number of runs it contains. A run is 



the consecutive appearance of one 
or more observations that are 
similar in some way, 533-535 



Sample A smaller number, or subset, 
of the people or objects that exist 
within the population, 6, 117 
census vs., 118-119 
cluster, 124 
convenience, 125 
dependent, 363. See also 

Dependent samples 
independent, 362. See also 

Independent samples 
judgment, 126 

paired, 383. See also Dependent 

samples 
purposive, 126 
quota, 125 

simple random, 120-122 

stratified, 123-124 

systematic, 122-123 

variance for, 72-74 
Sample size 

estimating population mean and, 
292-293 

estimating population proportion 
and, 293-295 

with finite population, 298-300 

selecting test statistic for hypoth- 
esis tests and, 315 

Type I and Type II errors and, 
347-348 

Sample Space All possible outcomes 

of an experiment, 135 
Sample Statistic A characteristic of 
the sample that is measured; it is 
often a mean, median, mode, pro- 
portion, or a measure of dispersion 
such as the standard deviation, 7 
Sampling, 117-126 

nonprobability, 120, 124-126 
probability, 107, 120-124 
process in research, 107 
Sampling Distribution The probabil- 
ity distribution of a sample statistic, 
such as the sample mean (x) or pro- 
portion (p), 243-258 

with finite population, 256-258 
of the mean, 243-252 
of the proportion, 253-255 
Sampling Error The error, or differ- 
ence between the population 
parameter and its corresponding 
sample statistic, that occurs because 



a sample has been taken instead of 
a census. This is a nondirectional, 
or random, error. This type of error 
is reduced by increasing the size of 
a probability sample, 108, 119, 
292,417 

Sampling with Replacement Sampling 
in which the item selected from 
the population is observed or mea- 
sured, then put back into the 
population, 176 

Sampling without Replacement Sam- 
pling in which the item selected 
from the population is not returned 
to the population after being 
observed or measured, 149, 176 

Scales of measurement, 9-10 

Scatter Diagram A visual summariza- 
tion of the data in which each point 
is a pair of observed values for the 
dependent and independent 
variables, 37—41 

Seasonal Variation in a time series 
due to periodic fluctuations that 
repeat over a period of one year 
or less, 687 

Seasonal Index The relative strength 
of each season (week, month, quar- 
ter) in contributing to the values of 
annual observations within the time 
series, 699-705 

deseasonalized time series, 

702-703 
in forecasting, 710 
ratio to moving average method, 
700-702 

Secondary Data Data that have been 
collected by someone other than the 
researcher for some purpose other 
than the problem or decision at 
hand, 104, 112-116 

evaluating, 116 

external, 104, 112-116 

internal, 104, 112 
Seeing Statistics applets, See front 

endsheet of text 
Significance Level (a) The probabil- 
ity that the test will commit a 
Type I error by rejecting a true 
null hypothesis, 313-314 

p-values and, 321-324 

selecting, 315 
Sign Test Used for the same 
purposes as the Wilcoxon signed 
rank test, this nonparametric test 
can be applied to ordinal data as 
well as interval or ratio. The sign 
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test expresses each difference only 
in terms of being a plus ( " + " ) or a 
minus, 528-533 
Simple linear regression model, 

550-564. See also Regression 

analysis 

Simple Random Sample A probabil- 
ity sample in which every person or 
element in the population has an 
equal chance of being included in 
the sample. The more theoretical 
definition is that, for a given 
sample size, each possible sample 
constituency has the same chance of 
being selected as any other sample 
constituency, 120-122 

Simulations 

from continuous probability 

distributions, 232-234 
from discrete probability 
distributions, 194-198 

Sketch A relevant symbol the size of 
which reflects a frequency or other 
numerical descriptor, 34 

Skewness The condition in which a 
distribution tails off to either the 
left or the right, exhibiting a lack 
of symmetry, 64 

Smoothing techniques 

exponential smoothing, 695-698 
moving average, 691-695 

Spearman Coefficient of Rank 
Correlation When paired observa- 
tions are at least of the ordinal 
scale, this nonparametric method 
measures the strength and 
direction of the relationship 
between them, 538-541 

Special-Cause Variation In a process, 
variation that occurs due to identifi- 
able causes that have changed the 
process. Also known as assignable 
variation, 758 

Standard Deviation A measure 
of dispersion, calculated as the 
positive square root of the variance, 
7, 72-74 
from grouped data, 85 

Standard Error of Estimate A mea- 
sure of the amount of scatter, or 
dispersion, of observed data points 
about the regression equation, 
559-560 
multiple, 608 

Standard Error of the Mean The 
standard deviation of the sampling 
distribution of the mean, 246 



Standardized data The expression of 
each data value in terms of its dis- 
tance from the mean in standard 
deviation units. The resulting distri- 
bution will have a mean of 0 and a 
standard deviation of 1, 79-81 

Standard Normal Distribution Also 
called a standardized normal dis- 
tribution, a normal distribution 
that results from the original x 
values being expressed in terms 
of their number of standard 
deviations away from the mean. 
The distribution has a mean of 0 
and a standard deviation of 1. 
Areas beneath the curve may be 
determined by computer statistical 
package or from the standard 
normal distribution table, 
212-221 

description and applications 

of, 212 
z-score for, 212 

Standard normal distribution table, 
See rear endsheet of text 

State of Nature A future 
environment that has a chance of 
occurring; it is not under the con- 
trol of the decision maker, 737 

Statistic A characteristic of the sam- 
ple, such as the sample mean (x), 
standard deviation (s), or 
proportion (p), 7, 117 

Statistical process control, 758-759, 

771- 791. See also Control 
chart 

control charts for attributes, 771, 

782-788 
control charts for variables, 

772- 781 
Statistics 

descriptive, 5, 102 
inferential, 5-6, 102, 244, 270 
sample, 6 

See also Business statistics 
Stem In a stem-and-leaf display of 
raw data, the leftmost digits on 
which classification of an observa- 
tion is based, 25 
Stem-and-Leaf Display A variant of 
the frequency distribution that uses 
a subset of the original digits in the 
raw data as class descriptors 
(stems) and class members (leaves), 
24-26 

Stepwise Regression A method in 
which independent variables enter 



the regression analysis one at a 
time. The first x variable to enter 
is the one explaining the most 
variation in y. At each step, 
the variable entered explains the 
greatest amount of the remain- 
ing variation in y, 632-633, 
668-672 

Stratified Sample A probability 
sample in which the population is 
divided into layers, or strata, then a 
simple random sample of members 
from each stratum is selected. 
Strata members have the same 
percentage representation in 
the sample as they do in the 
population, 123-124 

Subjective Probability Probability 
based on personal judgment or 
belief, 137-138 

Survey Research An investigation in 
which individuals are contacted by 
mail, telephone, or personal inter- 
view and their responses measured 
through the use of a questionnaire 
or data collection instrument, 
105-108 

errors in, 108 

questionnaire design in, 105-106 
sampling process in, 107-108 
types of, 105 

Symmetrical A distribution in which 
the left half and the right half are 
mirror images of each other, 64 

Systematic Error Also called 
directional error, or bias, a 
tendency to either overestimate or 
underestimate the actual value of a 
population parameter, 108 

Systematic Sample Probability sam- 
ple in which a random starting 
point between 1 and k is selected, 
then every kth element in the pop- 
ulation is sampled, 122-123 

T 

Tabulation 

simple, 43^44 
cross-tabulation, 43-45 

Taguchi Methods In total quality 
management, a method that 
surpasses traditional specification 
limits to recognize the loss in qual- 
ity that occurs as a dimension or 
measurement deviates farther from 
the target, 769-770 
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t Distribution A family of con- 
tinuous, unimodal, bell-shaped 
distributions, the t distribution is the 
probability distribution for the ran- 
dom variable t = {x — fi)/(s/vn). It 
has a mean of zero and a shape that 
is determined by the number of de- 
grees of freedom (df). The t distri- 
bution tends to be flatter and more 
spread out than the normal distri- 
bution, especially for small sample 
sizes, but it approaches the shape of 
the normal curve as the sample size 
(and df) increases, 280-283 
confidence intervals and, 

283-287 
See also f-tests 
Telephone Interview A survey 
research technique in which the 
telephone serves as the medium 
of communication between 
interviewer and respondent, 105 
Test for Equality of Proportions 
A chi-square application for examin- 
ing, based on independent samples 
from two or more populations, 
whether the population proportions 
could be the same. This is a special 
case of the chi-square test for vari- 
able independence, 484-486 
Test for Variable Independence A 
chi-square application for examin- 
ing whether two nominal-scale 
(category) variables might be 
related, 477-482 
Test Statistic Either a sample statistic 
or based on a sample statistic, a 
quantity used in deciding whether a 
null hypothesis should be rejected. 
Typical test statistics in the text are 
z and t, 315 
Time Series A sequence of observa- 
tions over regular intervals of time, 
686-690 
components of, 686-687 
deseasonalized, 702-703 
mean absolute deviation (MAD) 

criterion and, 711-712 
mean squared error (MSE) 

criterion and, 711-712 
ratio to moving average method 

and, 700-702 
seasonal indexes and, 699-705 
smoothing techniques and 
exponential smoothing, 
695-698 

moving average, 691-695 



trend equations and 
linear, 688-690 
quadratic, 690 
Total Quality Management A philos- 
ophy that (1) integrates quality into 
all facets of the organization and 
(2) focuses on improving the quality 
of the product by systematically im- 
proving the process through which 
the product is designed, produced, 
delivered, and supported, 756-791 
emergence, 760-762 
pioneers, 760 
practicing, 762-765 
process orientation, 761 
Six Sigma, 761-762 
statistical tools, 766-770 
Treatment When an experiment in- 
volves two factors, the combination 
of a level for one factor with a level 
for the second factor. When there is 
just one factor, each of its levels 
can be referred to as a treatment, 
109,411 
Tree Diagram A diagram that 
visually depicts sequential events 
and their marginal, conditional, 
and joint probabilities, 146, 739 
Trend Variation in a time series due 
to the overall upward or downward 
tendency of the series, 686-687 
Trend equations 

forecasts using, 707 
linear, 688-690 
quadratic, 690 
Trimmed Mean A mean calculated 
after dropping an equal number or 
percentage of data values from both 
the high end and the low end of the 
data values, 65 
f-distribution table, See inside rear 

cover of the text 
f-Test A hypothesis test using t as a 
test statistic and relying on the t 
distribution for the determination 
of calculated and critical values, 
for coefficient of correlation, 

570-571 
comparing means of two depen- 
dent samples, 383-387 
comparing means of two 

independent samples, 363-376 
of mean, population standard 

deviation unknown, 328-334 
for partial regression coefficients, 

617-619 
pooled-variances, 363-369 



for slope of regression line, 

571-573 
unequal-variances, 371-376 
Two-Tail Test A test in which the null 
hypothesis can be rejected by an ex- 
treme result in either direction. The 
test arises from a nondirectional 
claim or assertion about the popula- 
tion parameter, 311 
Two-Way Analysis of Variance An 
ANOVA procedure in which there 
are two factors, each operating on 
two or more levels. This design is 
able to examine both main effects 
and interactive effects, 431-452 
assumptions of, 440-441 
hypotheses tested, 440-441 
model for, 440-441 
procedure for, 441-444 
purpose of, 439-440 
Two-way stratified sample, 123-124 
Type I Error The rejection of a null 
hypothesis that is true, 313-314 
power of test and, 343-344 
sample size and, 347-348 
Type II Error The failure to reject a 
null hypothesis that is false, 313-314 
power of test and, 343-344 
sample size and, 347-348 

u 

Unbiased Estimator An estimator 
for which the expected value is 
the same as the actual value of the 
population parameter it is intended 
to estimate, 271 

Uncertainty A decision situation in 
which the possible states of nature 
are known, but their exact proba- 
bilities are unknown, 737 

Unequal-variances i-test, 371-376 

Union At least one of a number of 
possible events occurs; e.g., "A or 
B", 140 

Unpooled t-test. See Unequal- 
variances t-test 



Validity 

external, 110 

internal, 109-110 
Variable independence test. See 
Chi-square test of variable 
independence; Test of variable 
independence 
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Variables 

continuous quantitative, 9 
continuous random, 168, 206 
control charts for, See Control 

charts, for variables 
dependent, 37, 109, 411, 550 
discrete quantitative, 8 
discrete random, 168 
dummy, 631-632, 657-660 
extraneous, 109 

independent, 37, 109, 411, 550. 
See also Chi-square test of 
variable independence 
qualitative, 8 
quantitative, 8 
random, 168, 206 
relationships between, 37-38, 
83-88, 550 
Variance A measure of dispersion 
based on the squared differences 
between observed values and the 
mean, 72-74 

from grouped data, 85 
of two independent samples, 
comparing, 394-398 
Variation 

assignable, 758 
coefficient of, 79-8 1 
random, 757 
Venn Diagram For an experiment, a 
visual display of the sample space 
and the possible events within, 
139-141 

Visual representation of data, 16-47 
abuse of, 34-35 
bar chart, 28-29 
dotplot, 27 



frequency polygon, 20 
histogram, 20-22 
line graph, 28-29 
pictogram, 32-33 
pie chart, 31-32 
scatter diagram, 37-41 
sketch, 33 

w 

Weighted Mean Also called the 
weighted average, a measure of 
central tendency that weights each 
data value according to its relative 
importance, 61 

Wilcoxon Rank Sum Test A 
nonparametric test comparing the 
medians of two independent 
samples, 515-518 

Wilcoxon Signed Rank Test For one 
sample, a nonparametric test of the 
hypothesis that the population me- 
dian could be a given value. For 
paired samples, it tests whether the 
population median of the difference 
between paired observations could 
be zero, 506-514 

for comparing paired samples, 

511-514 
for one sample, 506-509 

Within-group variation, in analysis 
of variance, 412^-13 

Worker Empowerment Also called 
quality at the source, a practice 
where the individual worker is 
(1) responsible for the quality of 
output from his or her own 



workstation, and (2) given 
authority to make changes he or 
she deems necessary for quality 
improvements, 765 

z 

z-Score Also referred to as the z 
value, the number of standard 
deviation units from the mean of 
a normal distribution to a given x 
value, 212 

for sampling distribution of the 

mean, 247 
for sampling distribution of the 

proportion, 253-255 
for standard normal distribution, 
212-221 
z-Test A hypothesis test using z 
as a test statistic and relying on 
the normal distribution for the 
determination of calculated and 
critical values, 318 

approximation to Wilcoxon rank 

sum test, 517 
approximation to Wilcoxon 
signed rank test, 508-509 
comparing means of two 
independent samples, 
378-381 
comparing two sample 

proportions, 388-393 
of mean, population standard 
deviation known, 318-321 
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a = right-tail area, (e.g., for 
a right-tail area of 0.025 and 
d.f. = 15, the (value is 2.131.) 
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Source: t-values generated by Minitab, then rounded to three decimal places. 



e.g., for z= 1.34, refer to The Standard Normal Distribution 

the 1.3 row and the .04 column 
to find the area, .4099. 
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Source: Cumulative standard normal probabilites from z = 0.00 to z = 3.09, generated by Minitab, 
then rounded to four decimal places. 



